• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 UMLS 进行电子健康数据标准化和数据库设计。

Using UMLS for electronic health data standardization and database design.

机构信息

Frances Payne Bolton School of Nursing, Case Western Reserve University, Cleveland, Ohio,USA.

Critical Care Transport, Cleveland Clinic, Cleveland, Ohio,USA.

出版信息

J Am Med Inform Assoc. 2020 Oct 1;27(10):1520-1528. doi: 10.1093/jamia/ocaa176.

DOI:10.1093/jamia/ocaa176
PMID:32940707
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7647352/
Abstract

OBJECTIVE

Patients that undergo medical transfer represent 1 patient population that remains infrequently studied due to challenges in aggregating data across multiple domains and sources that are necessary to capture the entire episode of patient care. To facilitate access to and secondary use of transport patient data, we developed the Transport Data Repository that combines data from 3 separate domains and many sources within our health system.

METHODS

The repository is a relational database anchored by the Unified Medical Language System unique concept identifiers to integrate, map, and standardize the data into a common data model. Primary data domains included sending and receiving hospital encounters, medical transport record, and custom hospital transport log data. A 4-step mapping process was developed: 1) automatic source code match, 2) exact text match, 3) fuzzy matching, and 4) manual matching.

RESULTS

431 090 total mappings were generated in the Transport Data Repository, consisting of 69 010 unique concepts with 77% of the data being mapped automatically. Transport Source Data yielded significantly lower mapping results with only 8% of data entities automatically mapped and a significant amount (43%) remaining unmapped.

DISCUSSION

The multistep mapping process resulted in a majority of data been automatically mapped. Poor matching of transport medical record data is due to the third-party vendor data being generated and stored in a nonstandardized format.

CONCLUSION

The multistep mapping process developed and implemented is necessary to normalize electronic health data from multiple domains and sources into a common data model to support secondary use of data.

摘要

目的

接受医疗转运的患者是一个很少被研究的人群,这是因为在多个领域和来源中汇集数据以捕获患者护理的整个过程存在挑战。为了便于访问和二次使用转运患者数据,我们开发了转运数据存储库,该存储库结合了我们的医疗系统中来自 3 个独立领域和许多来源的数据。

方法

该存储库是一个关系数据库,以统一医学语言系统唯一概念标识符为基础,以整合、映射和标准化数据到通用数据模型中。主要数据领域包括发送和接收医院就诊、医疗转运记录和自定义医院转运日志数据。开发了一个 4 步映射过程:1)自动源代码匹配,2)精确文本匹配,3)模糊匹配和 4)手动匹配。

结果

在转运数据存储库中生成了 431090 个总映射,包括 69010 个唯一概念,其中 77%的数据是自动映射的。转运源数据的映射结果明显较低,只有 8%的数据实体自动映射,而大量(43%)的数据仍未映射。

讨论

多步映射过程导致大部分数据自动映射。转运医疗记录数据匹配不佳是由于第三方供应商的数据是在非标准化格式中生成和存储的。

结论

开发和实施的多步映射过程对于将来自多个领域和来源的电子健康数据规范化到通用数据模型中以支持数据的二次使用是必要的。

相似文献

1
Using UMLS for electronic health data standardization and database design.使用 UMLS 进行电子健康数据标准化和数据库设计。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1520-1528. doi: 10.1093/jamia/ocaa176.
2
Data quality assessment framework to assess electronic medical record data for use in research.用于评估电子病历数据以供研究使用的数据质量评估框架。
Int J Med Inform. 2016 Jun;90:40-7. doi: 10.1016/j.ijmedinf.2016.03.006. Epub 2016 Mar 24.
3
Text Simplification Using Consumer Health Vocabulary to Generate Patient-Centered Radiology Reporting: Translation and Evaluation.使用消费者健康词汇进行文本简化以生成以患者为中心的放射学报告:翻译与评估
J Med Internet Res. 2017 Dec 18;19(12):e417. doi: 10.2196/jmir.8536.
4
A method for cohort selection of cardiovascular disease records from an electronic health record system.一种从电子健康记录系统中选择心血管疾病记录队列的方法。
Int J Med Inform. 2017 Jun;102:138-149. doi: 10.1016/j.ijmedinf.2017.03.015. Epub 2017 Mar 30.
5
Standardized mappings--a framework to combine different semantic mappers into a standardized web-API.标准化映射——一种将不同语义映射器组合成标准化网络应用程序编程接口的框架。
Stud Health Technol Inform. 2015;212:23-6.
6
Extraction of UMLS® Concepts Using Apache cTAKES™ for German Language.使用Apache cTAKES™从德语中提取统一医学语言系统(UMLS®)概念。
Stud Health Technol Inform. 2016;223:71-6.
7
Evaluating MedDRA-to-ICD terminology mappings.评估 MedDRA 到 ICD 的术语映射。
BMC Med Inform Decis Mak. 2024 Feb 7;23(Suppl 4):299. doi: 10.1186/s12911-023-02375-1.
8
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT:一种用于从医学叙述中映射短语概念的机器学习系统。
J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.
9
Phase II evaluation of clinical coding schemes: completeness, taxonomy, mapping, definitions, and clarity. CPRI Work Group on Codes and Structures.临床编码方案的II期评估:完整性、分类法、映射、定义及清晰度。CPRI代码与结构工作组
J Am Med Inform Assoc. 1997 May-Jun;4(3):238-51. doi: 10.1136/jamia.1997.0040238.
10
UMLS-Query: a perl module for querying the UMLS.UMLS查询:一个用于查询统一医学语言系统(UMLS)的Perl模块。
AMIA Annu Symp Proc. 2008 Nov 6;2008:652-6.

引用本文的文献

1
Natural language processing and expert follow-up establishes tachycardia association with CDKL5 deficiency disorder.自然语言处理和专家随访确定了心动过速与CDKL5缺乏症之间的关联。
Genet Med Open. 2023 Nov 18;2:100842. doi: 10.1016/j.gimo.2023.100842. eCollection 2024.
2
The challenges and opportunities of continuous data quality improvement for healthcare administration data.医疗管理数据持续数据质量改进的挑战与机遇
JAMIA Open. 2024 Aug 1;7(3):ooae058. doi: 10.1093/jamiaopen/ooae058. eCollection 2024 Oct.
3
Healthcare utilization and clinical characteristics of genetic epilepsy in electronic health records.电子健康记录中遗传性癫痫的医疗保健利用情况及临床特征
Brain Commun. 2024 Mar 14;6(2):fcae090. doi: 10.1093/braincomms/fcae090. eCollection 2024.
4
A GCN-based approach to uncover misaligned synonymous terms in the UMLS Metathesaurus.基于图卷积网络的方法揭示 UMLS Metathesaurus 中未对齐的同义术语。
AMIA Annu Symp Proc. 2024 Jan 11;2023:977-986. eCollection 2023.
5
Nursing Informatics' Contribution to One Health.护理信息学对大健康的贡献。
Yearb Med Inform. 2023 Aug;32(1):65-75. doi: 10.1055/s-0043-1768738. Epub 2023 Dec 26.
6
A Data Transformation Methodology to Create Findable, Accessible, Interoperable, and Reusable Health Data: Software Design, Development, and Evaluation Study.一种创建可发现、可访问、可互操作和可重用健康数据的数据转换方法:软件设计、开发和评估研究。
J Med Internet Res. 2023 Mar 8;25:e42822. doi: 10.2196/42822.
7
High-risk diagnosis combinations in patients undergoing interhospital transfer: a retrospective observational study.高危诊断组合在院内转院患者中的应用:一项回顾性观察研究。
BMC Emerg Med. 2022 Nov 24;22(1):187. doi: 10.1186/s12873-022-00742-1.
8
ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials.ELaPro,一个 LOINC 映射的核心数据集,用于临床试验资格筛选的顶级实验室程序。
BMC Med Res Methodol. 2022 May 14;22(1):141. doi: 10.1186/s12874-022-01611-y.
9
Subcategorizing EHR diagnosis codes to improve clinical application of machine learning models.对电子健康记录诊断代码进行细分,以提高机器学习模型的临床应用。
Int J Med Inform. 2021 Dec;156:104588. doi: 10.1016/j.ijmedinf.2021.104588. Epub 2021 Sep 21.
10
The UMLS knowledge sources at 30: indispensable to current research and applications in biomedical informatics.30岁的统一医学语言系统知识源:生物医学信息学当前研究与应用不可或缺的要素
J Am Med Inform Assoc. 2020 Oct 1;27(10):1499-1501. doi: 10.1093/jamia/ocaa208.

本文引用的文献

1
Research-grade data in the real world: challenges and opportunities in data quality from a pragmatic trial in community-based practices.真实世界中的研究级数据:来自社区实践中实用试验的数据质量的挑战和机遇。
J Am Med Inform Assoc. 2019 Aug 1;26(8-9):847-854. doi: 10.1093/jamia/ocz062.
2
Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model.All Of Us 研究计划的数据模型协调:将 i2b2 数据转换为 OMOP 通用数据模型。
PLoS One. 2019 Feb 19;14(2):e0212463. doi: 10.1371/journal.pone.0212463. eCollection 2019.
3
An ontology-guided semantic data integration framework to support integrative data analysis of cancer survival.本体指导的语义数据集成框架,支持癌症生存的综合数据分析。
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):41. doi: 10.1186/s12911-018-0636-4.
4
Evaluating Foundational Data Quality in the National Patient-Centered Clinical Research Network (PCORnet®).评估国家以患者为中心的临床研究网络(PCORnet®)中的基础数据质量。
EGEMS (Wash DC). 2018 Apr 13;6(1):3. doi: 10.5334/egems.199.
5
Extracting and utilizing electronic health data from Epic for research.从Epic系统中提取并利用电子健康数据用于研究。
Ann Transl Med. 2018 Feb;6(3):42. doi: 10.21037/atm.2018.01.13.
6
Enhanced LexSynonym Acquisition for Effective UMLS Concept Mapping.用于有效统一医学语言系统(UMLS)概念映射的增强型词汇同义词获取
Stud Health Technol Inform. 2017;245:501-505.
7
Semi-Automatic Mark-Up and UMLS Annotation of Clinical Guidelines.临床指南的半自动标记与统一医学语言系统注释
Stud Health Technol Inform. 2017;245:294-297.
8
Segment convolutional neural networks (Seg-CNNs) for classifying relations in clinical notes.用于在临床笔记中分类关系的分段卷积神经网络(Seg-CNNs)。
J Am Med Inform Assoc. 2018 Jan 1;25(1):93-98. doi: 10.1093/jamia/ocx090.
9
The Agency for Healthcare Research and Quality and the Development of a Learning Health Care System.医疗保健研究与质量机构以及学习型医疗保健系统的发展。
JAMA Intern Med. 2017 Jul 1;177(7):909-910. doi: 10.1001/jamainternmed.2017.2589.
10
Development and validation of a structured query language implementation of the Elixhauser comorbidity index.埃利克斯豪泽共病指数结构化查询语言实现方法的开发与验证
J Am Med Inform Assoc. 2017 Jul 1;24(4):845-850. doi: 10.1093/jamia/ocw181.