• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用统一医学语言系统进行下一代表型分析。

Next generation phenotyping using the unified medical language system.

机构信息

Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, United States.

出版信息

JMIR Med Inform. 2014 Mar 18;2(1):e5. doi: 10.2196/medinform.3172.

DOI:10.2196/medinform.3172
PMID:25601137
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4288084/
Abstract

BACKGROUND

Structured information within patient medical records represents a largely untapped treasure trove of research data. In the United States, privacy issues notwithstanding, this has recently become more accessible thanks to the increasing adoption of electronic health records (EHR) and health care data standards fueled by the Meaningful Use legislation. The other side of the coin is that it is now becoming increasingly more difficult to navigate the profusion of many disparate clinical terminology standards, which often span millions of concepts.

OBJECTIVE

The objective of our study was to develop a methodology for integrating large amounts of structured clinical information that is both terminology agnostic and able to capture heterogeneous clinical phenotypes including problems, procedures, medications, and clinical results (such as laboratory tests and clinical observations). In this context, we define phenotyping as the extraction of all clinically relevant features contained in the EHR.

METHODS

The scope of the project was framed by the Common Meaningful Use (MU) Dataset terminology standards; the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT), RxNorm, the Logical Observation Identifiers Names and Codes (LOINC), the Current Procedural Terminology (CPT), the Health care Common Procedure Coding System (HCPCS), the International Classification of Diseases Ninth Revision Clinical Modification (ICD-9-CM), and the International Classification of Diseases Tenth Revision Clinical Modification (ICD-10-CM). The Unified Medical Language System (UMLS) was used as a mapping layer among the MU ontologies. An extract, load, and transform approach separated original annotations in the EHR from the mapping process and allowed for continuous updates as the terminologies were updated. Additionally, we integrated all terminologies into a single UMLS derived ontology and further optimized it to make the relatively large concept graph manageable.

RESULTS

The initial evaluation was performed with simulated data from the Clinical Avatars project using 100,000 virtual patients undergoing a 90 day, genotype guided, warfarin dosing protocol. This dataset was annotated with standard MU terminologies, loaded, and transformed using the UMLS. We have deployed this methodology to scale in our in-house analytics platform using structured EHR data for 7931 patients (12 million clinical observations) treated at the Froedtert Hospital. A demonstration limited to Clinical Avatars data is available on the Internet using the credentials user "jmirdemo" and password "jmirdemo".

CONCLUSIONS

Despite its inherent complexity, the UMLS can serve as an effective interface terminology for many of the clinical data standards currently used in the health care domain.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ed0/4288084/5321060c1bc2/medinform_v2i1e5_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ed0/4288084/2552b5b36394/medinform_v2i1e5_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ed0/4288084/7a0b92b1c9d8/medinform_v2i1e5_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ed0/4288084/5321060c1bc2/medinform_v2i1e5_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ed0/4288084/2552b5b36394/medinform_v2i1e5_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ed0/4288084/7a0b92b1c9d8/medinform_v2i1e5_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ed0/4288084/5321060c1bc2/medinform_v2i1e5_fig3.jpg
摘要

背景

患者病历中的结构化信息代表了一个尚未开发的研究数据宝库。在美国,尽管存在隐私问题,但由于电子健康记录 (EHR) 的日益普及以及受“有意义使用”法规推动的医疗保健数据标准,这一点最近变得更加容易实现。另一方面,现在越来越难以驾驭许多不同的临床术语标准,这些标准通常涵盖数百万个概念。

目的

我们研究的目的是开发一种方法来整合大量的结构化临床信息,这种方法既与术语无关,又能够捕获包括问题、程序、药物和临床结果(如实验室检查和临床观察)在内的异构临床表型。在这种情况下,我们将表型定义为从 EHR 中提取所有包含的临床相关特征。

方法

项目的范围由通用有意义使用 (MU) 数据集术语标准、系统命名法医学术语 (SNOMED CT)、RxNorm、逻辑观察标识符名称和代码 (LOINC)、当前程序术语 (CPT)、医疗保健常见程序编码系统 (HCPCS)、国际疾病分类第九修订临床修订版 (ICD-9-CM) 和国际疾病分类第十版临床修订版 (ICD-10-CM) 定义。统一医学语言系统 (UMLS) 被用作 MU 本体之间的映射层。提取、加载和转换方法将 EHR 中的原始注释与映射过程分开,并允许随着术语的更新而进行持续更新。此外,我们将所有术语集成到一个单一的 UMLS 衍生本体中,并进一步对其进行优化,以使其相对较大的概念图易于管理。

结果

最初的评估是使用来自 Clinical Avatars 项目的模拟数据进行的,涉及 10 万名接受 90 天基因指导华法林剂量方案的虚拟患者。该数据集使用标准 MU 术语进行了注释,并使用 UMLS 进行了加载和转换。我们已经将这种方法部署到我们的内部分析平台中,使用 7931 名患者(1200 万条临床观察)的结构化 EHR 数据进行治疗。在 Froedtert 医院。可在 Internet 上使用凭据“jmirdemo”和密码“jmirdemo”访问对 Clinical Avatars 数据的演示。

结论

尽管存在内在的复杂性,但 UMLS 可以作为医疗保健领域目前使用的许多临床数据标准的有效接口术语。

相似文献

1
Next generation phenotyping using the unified medical language system.利用统一医学语言系统进行下一代表型分析。
JMIR Med Inform. 2014 Mar 18;2(1):e5. doi: 10.2196/medinform.3172.
2
Issues in mapping LOINC laboratory tests to SNOMED CT.将LOINC实验室检查映射到SNOMED CT中的问题。
AMIA Annu Symp Proc. 2008 Nov 6;2008:51-5.
3
An alternative database approach for management of SNOMED CT and improved patient data queries.一种用于管理医学系统命名法临床术语(SNOMED CT)及改进患者数据查询的替代数据库方法。
J Biomed Inform. 2015 Oct;57:350-7. doi: 10.1016/j.jbi.2015.08.016. Epub 2015 Aug 21.
4
ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials.ELaPro,一个 LOINC 映射的核心数据集,用于临床试验资格筛选的顶级实验室程序。
BMC Med Res Methodol. 2022 May 14;22(1):141. doi: 10.1186/s12874-022-01611-y.
5
Recent Developments in Clinical Terminologies - SNOMED CT, LOINC, and RxNorm.临床术语的最新进展——SNOMED CT、LOINC和RxNorm。
Yearb Med Inform. 2018 Aug;27(1):129-139. doi: 10.1055/s-0038-1667077. Epub 2018 Aug 29.
6
A comparative analysis of the density of the SNOMED CT conceptual content for semantic harmonization.用于语义协调的SNOMED CT概念内容密度的比较分析。
Artif Intell Med. 2015 May;64(1):29-40. doi: 10.1016/j.artmed.2015.03.002. Epub 2015 Apr 2.
7
Proposed Algorithm with Standard Terminologies (SNOMED and CPT) for Automated Generation of Medical Bills for Laboratory Tests.用于自动生成实验室检查医疗账单的、带有标准术语(国际疾病分类标准临床术语和现行程序编码)的建议算法。
Healthc Inform Res. 2010 Sep;16(3):185-90. doi: 10.4258/hir.2010.16.3.185. Epub 2010 Sep 30.
8
Standard Lexicons, Coding Systems and Ontologies for Interoperability and Semantic Computation in Imaging.用于成像中互操作性和语义计算的标准词汇表、编码系统和本体
J Digit Imaging. 2018 Jun;31(3):353-360. doi: 10.1007/s10278-018-0069-8.
9
Medical informatics standards applicable to emergency department information systems: making sense of the jumble.适用于急诊科信息系统的医学信息学标准:理清混乱局面。
Acad Emerg Med. 2004 Nov;11(11):1198-205. doi: 10.1197/j.aem.2004.08.023.
10
Standardizing Clinical Diagnoses: Evaluating Alternate Terminology Selection.标准化临床诊断:评估替代术语的选择
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:71-79. eCollection 2020.

引用本文的文献

1
PheNormGPT: a framework for extraction and normalization of key medical findings.PheNormGPT:一种用于提取和规范关键医学发现的框架。
Database (Oxford). 2024 Oct 23;2024. doi: 10.1093/database/baae103.
2
Using the Unified Medical Language System to Expand the Operative Stress Score - First Use Case.使用统一医学语言系统扩展手术应激评分——第一个用例。
J Surg Res. 2021 Dec;268:552-561. doi: 10.1016/j.jss.2021.07.030. Epub 2021 Aug 28.
3
Electronic Health Records-Based Cardio-Oncology Registry for Care Gap Identification and Pragmatic Research: Procedure and Observational Study.

本文引用的文献

1
EHR-based phenome wide association study in pancreatic cancer.基于电子健康记录的胰腺癌全表型组关联研究。
AMIA Jt Summits Transl Sci Proc. 2014 Apr 7;2014:9-15. eCollection 2014.
2
The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data.人类表型本体论项目:通过表型数据将分子生物学和疾病联系起来。
Nucleic Acids Res. 2014 Jan;42(Database issue):D966-74. doi: 10.1093/nar/gkt1026. Epub 2013 Nov 11.
3
User evaluation of the effects of a text simplification algorithm using term familiarity on perception, understanding, learning, and information retention.
基于电子健康记录的心脏肿瘤学登记系统用于识别照护差距和务实研究:程序与观察性研究
JMIR Cardio. 2021 May 12;5(1):e22296. doi: 10.2196/22296.
4
Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams.将临床表型与实验室检测结果中的基因型联系起来,进行全面的体格检查。
BMC Med Inform Decis Mak. 2021 Feb 24;21(Suppl 3):51. doi: 10.1186/s12911-021-01387-z.
5
Using UMLS for electronic health data standardization and database design.使用 UMLS 进行电子健康数据标准化和数据库设计。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1520-1528. doi: 10.1093/jamia/ocaa176.
6
A transformation-based method for auditing the IS-A hierarchy of biomedical terminologies in the Unified Medical Language System.基于转换的方法审核统一医学语言系统中生物医学术语的 IS-A 层次结构。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1568-1575. doi: 10.1093/jamia/ocaa123.
7
Development and implementation of a dynamically updated big data intelligence platform from electronic health records for nasopharyngeal carcinoma research.开发和实施一个基于电子健康记录的鼻咽癌研究动态更新大数据智能平台。
Br J Radiol. 2019 Oct;92(1102):20190255. doi: 10.1259/bjr.20190255. Epub 2019 Aug 20.
8
Predicting life expectancy with a long short-term memory recurrent neural network using electronic medical records.使用电子病历通过长短期记忆递归神经网络预测预期寿命。
BMC Med Inform Decis Mak. 2019 Feb 28;19(1):36. doi: 10.1186/s12911-019-0775-2.
9
SNOMED CT Concept Hierarchies for Computable Clinical Phenotypes From Electronic Health Record Data: Comparison of Intensional Versus Extensional Value Sets.用于从电子健康记录数据中获取可计算临床表型的SNOMED CT概念层次结构:内涵值集与外延值集的比较
JMIR Med Inform. 2019 Jan 16;7(1):e11487. doi: 10.2196/11487.
10
SNOMED CT Concept Hierarchies for Sharing Definitions of Clinical Conditions Using Electronic Health Record Data.使用电子健康记录数据共享临床病症定义的SNOMED CT概念层次结构。
Appl Clin Inform. 2018 Jul;9(3):667-682. doi: 10.1055/s-0038-1668090. Epub 2018 Aug 29.
用户对一种使用术语熟悉度的文本简化算法在感知、理解、学习和信息保留方面效果的评估。
J Med Internet Res. 2013 Jul 31;15(7):e144. doi: 10.2196/jmir.2569.
4
Accuracy of electronically reported "meaningful use" clinical quality measures.电子报告的“有意义使用”临床质量指标的准确性。
Ann Intern Med. 2013 Jul 2;159(1):73. doi: 10.7326/0003-4819-159-1-201307020-00017.
5
Towards a Universal Clinical Genomics Database: the 2012 International Standards for Cytogenomic Arrays Consortium Meeting.迈向通用临床基因组数据库:2012 年国际细胞遗传学芯片联盟会议。
Hum Mutat. 2013 Jun;34(6):915-9. doi: 10.1002/humu.22306. Epub 2013 Apr 2.
6
Accuracy of electronically reported "meaningful use" clinical quality measures: a cross-sectional study.电子报告的“有意义使用”临床质量指标的准确性:一项横断面研究。
Ann Intern Med. 2013 Jan 15;158(2):77-83. doi: 10.7326/0003-4819-158-2-201301150-00001.
7
Quality assurance in LOINC using Description Logic.使用描述逻辑进行LOINC中的质量保证。
AMIA Annu Symp Proc. 2012;2012:1099-108. Epub 2012 Nov 3.
8
A systems approach to designing effective clinical trials using simulations.系统方法在设计有效的临床试验中的应用模拟。
Circulation. 2013 Jan 29;127(4):517-26. doi: 10.1161/CIRCULATIONAHA.112.123034. Epub 2012 Dec 21.
9
VarioML framework for comprehensive variation data representation and exchange.VarioML 框架用于全面的变异数据表示和交换。
BMC Bioinformatics. 2012 Oct 3;13:254. doi: 10.1186/1471-2105-13-254.
10
Next-generation phenotyping of electronic health records.电子健康记录的下一代表型分析。
J Am Med Inform Assoc. 2013 Jan 1;20(1):117-21. doi: 10.1136/amiajnl-2012-001145. Epub 2012 Sep 6.