Suppr超能文献

应用 MetaMap 对 Medline 进行分析,以在大型临床数据集识别新的关联:可行性分析。

Applying MetaMap to Medline for identifying novel associations in a large clinical dataset: a feasibility analysis.

机构信息

Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA.

Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, Michigan, USA.

出版信息

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):925-37. doi: 10.1136/amiajnl-2014-002767. Epub 2014 Jun 13.

Abstract

OBJECTIVE

We describe experiments designed to determine the feasibility of distinguishing known from novel associations based on a clinical dataset comprised of International Classification of Disease, V.9 (ICD-9) codes from 1.6 million patients by comparing them to associations of ICD-9 codes derived from 20.5 million Medline citations processed using MetaMap. Associations appearing only in the clinical dataset, but not in Medline citations, are potentially novel.

METHODS

Pairwise associations of ICD-9 codes were independently identified in both the clinical and Medline datasets, which were then compared to quantify their degree of overlap. We also performed a manual review of a subset of the associations to validate how well MetaMap performed in identifying diagnoses mentioned in Medline citations that formed the basis of the Medline associations.

RESULTS

The overlap of associations based on ICD-9 codes in the clinical and Medline datasets was low: only 6.6% of the 3.1 million associations found in the clinical dataset were also present in the Medline dataset. Further, a manual review of a subset of the associations that appeared in both datasets revealed that co-occurring diagnoses from Medline citations do not always represent clinically meaningful associations.

DISCUSSION

Identifying novel associations derived from large clinical datasets remains challenging. Medline as a sole data source for existing knowledge may not be adequate to filter out widely known associations.

CONCLUSIONS

In this study, novel associations were not readily identified. Further improvements in accuracy and relevance for tools such as MetaMap are needed to realize their expected utility.

摘要

目的

我们描述了一些实验,旨在通过将 160 万患者的国际疾病分类第 9 版(ICD-9)代码与使用 MetaMap 处理的 2050 万篇 Medline 引文的 ICD-9 代码关联进行比较,从包含 ICD-9 代码的临床数据集来确定区分已知和新关联的可行性。仅在临床数据集中出现而不在 Medline 引文中出现的关联可能是新的。

方法

在临床和 Medline 数据集中独立识别 ICD-9 代码的成对关联,然后进行比较以量化它们的重叠程度。我们还对关联的一个子集进行了手动审查,以验证 MetaMap 在识别 Medline 引文中提到的、构成 Medline 关联基础的诊断方面的性能如何。

结果

临床和 Medline 数据集基于 ICD-9 代码的关联重叠率较低:在临床数据集中发现的 310 万关联中,只有 6.6%也存在于 Medline 数据集中。此外,对同时出现在两个数据集的关联的一个子集进行手动审查表明,Medline 引文中同时出现的诊断并不总是代表有临床意义的关联。

讨论

从大型临床数据集识别新的关联仍然具有挑战性。Medline 作为现有知识的唯一数据源,可能不足以过滤掉广泛已知的关联。

结论

在这项研究中,新的关联不容易被识别。需要进一步提高工具(如 MetaMap)的准确性和相关性,以实现其预期的效用。

相似文献

3
Modeling temporal relationships in large scale clinical associations.大规模临床关联中的时间关系建模。
J Am Med Inform Assoc. 2013 Mar-Apr;20(2):332-41. doi: 10.1136/amiajnl-2012-001117. Epub 2012 Sep 27.

引用本文的文献

1
The intersection of COVID-19 and autoimmunity.COVID-19 与自身免疫的交集。
J Clin Invest. 2021 Dec 15;131(24). doi: 10.1172/JCI154886.
5

本文引用的文献

7
MeSH indexing based on automatically generated summaries.基于自动生成的摘要进行 MeSH 标引。
BMC Bioinformatics. 2013 Jun 26;14:208. doi: 10.1186/1471-2105-14-208.
10

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验