电子健康记录大数据深度表型分析。国际医学信息学协会基因组医学工作组的贡献。

EHR Big Data Deep Phenotyping. Contribution of the IMIA Genomic Medicine Working Group.

作者信息

Frey L J, Lenert L, Lopez-Campos G

机构信息

Lewis J Frey, Chair IMIA Genomic Medicine WG, Biomedical Informatics Center, Public Health Sciences, Associate Professor, Hollings Cancer Center, Research Member, Medical University of South Carolina, 135 Cannon Street, Suite 405K, MUSC 200, Charleston, SC 29425. USA, Tel: +1 843 792 4216, Fax: +1 843 792 5587, E-mail:

出版信息

Yearb Med Inform. 2014 Aug 15;9(1):206-11. doi: 10.15265/IY-2014-0006.

DOI:10.15265/IY-2014-0006

PMID:25123744

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4287080/

Abstract

OBJECTIVES

Given the quickening speed of discovery of variant disease drivers from combined patient genotype and phenotype data, the objective is to provide methodology using big data technology to support the definition of deep phenotypes in medical records.

METHODS

As the vast stores of genomic information increase with next generation sequencing, the importance of deep phenotyping increases. The growth of genomic data and adoption of Electronic Health Records (EHR) in medicine provides a unique opportunity to integrate phenotype and genotype data into medical records. The method by which collections of clinical findings and other health related data are leveraged to form meaningful phenotypes is an active area of research. Longitudinal data stored in EHRs provide a wealth of information that can be used to construct phenotypes of patients. We focus on a practical problem around data integration for deep phenotype identification within EHR data. The use of big data approaches are described that enable scalable markup of EHR events that can be used for semantic and temporal similarity analysis to support the identification of phenotype and genotype relationships.

CONCLUSIONS

Stead and colleagues' 2005 concept of using light standards to increase the productivity of software systems by riding on the wave of hardware/processing power is described as a harbinger for designing future healthcare systems. The big data solution, using flexible markup, provides a route to improved utilization of processing power for organizing patient records in genotype and phenotype research.

摘要

目的

鉴于从患者基因型和表型数据中发现疾病驱动变异的速度不断加快，目标是提供使用大数据技术的方法，以支持病历中深度表型的定义。

方法

随着下一代测序技术使基因组信息的海量存储不断增加，深度表型分析的重要性日益凸显。医学领域基因组数据的增长以及电子健康记录（EHR）的采用，为将表型和基因型数据整合到病历中提供了独特的机会。利用临床发现和其他健康相关数据的集合来形成有意义表型的方法是一个活跃的研究领域。EHR中存储的纵向数据提供了丰富的信息，可用于构建患者的表型。我们关注EHR数据中围绕深度表型识别的数据整合这一实际问题。描述了使用大数据方法实现EHR事件的可扩展标记，可用于语义和时间相似性分析，以支持表型和基因型关系的识别。

结论

斯特德及其同事在2005年提出的利用轻量级标准搭乘硬件/处理能力提升的浪潮来提高软件系统生产力的概念，被描述为设计未来医疗保健系统的先驱。使用灵活标记的大数据解决方案为在基因型和表型研究中更好地利用处理能力来组织患者记录提供了一条途径。

相似文献

EHR Big Data Deep Phenotyping. Contribution of the IMIA Genomic Medicine Working Group.电子健康记录大数据深度表型分析。国际医学信息学协会基因组医学工作组的贡献。

Yearb Med Inform. 2014 Aug 15;9(1):206-11. doi: 10.15265/IY-2014-0006.

Deep Phenotyping on Electronic Health Records Facilitates Genetic Diagnosis by Clinical Exomes.电子健康记录的深度表型分析有助于通过临床外显子组进行遗传诊断。

Am J Hum Genet. 2018 Jul 5;103(1):58-73. doi: 10.1016/j.ajhg.2018.05.010. Epub 2018 Jun 28.

The use of electronic health records for psychiatric phenotyping and genomics.电子健康记录在精神表型和基因组学中的应用。

Am J Med Genet B Neuropsychiatr Genet. 2018 Oct;177(7):601-612. doi: 10.1002/ajmg.b.32548. Epub 2017 May 30.

Leveraging electronic healthcare record standards and semantic web technologies for the identification of patient cohorts.利用电子医疗记录标准和语义网技术来确定患者队列。

J Am Med Inform Assoc. 2013 Dec;20(e2):e288-96. doi: 10.1136/amiajnl-2013-001923. Epub 2013 Aug 9.

Deep Phenotyping of Chinese Electronic Health Records by Recognizing Linguistic Patterns of Phenotypic Narratives With a Sequence Motif Discovery Tool: Algorithm Development and Validation.利用序列基序发现工具识别表型叙述的语言模式对中国电子健康记录进行深度表型分析：算法开发与验证

J Med Internet Res. 2022 Jun 3;24(6):e37213. doi: 10.2196/37213.

Integration of genetic and clinical information to improve imputation of data missing from electronic health records.整合遗传和临床信息，以改善电子健康记录中缺失数据的推断。

J Am Med Inform Assoc. 2019 Oct 1;26(10):1056-1063. doi: 10.1093/jamia/ocz041.

Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.精准医学的文本挖掘：为电子健康记录和生物医学文献构建结构以理解基因与健康。

Adv Exp Med Biol. 2016;939:139-166. doi: 10.1007/978-981-10-1503-8_7.

High-throughput phenotyping with temporal sequences.高通量表型分析与时间序列。

J Am Med Inform Assoc. 2021 Mar 18;28(4):772-781. doi: 10.1093/jamia/ocaa288.

Chapter 13: Mining electronic health records in the genomics era.第十三章：基因组时代的电子健康记录挖掘。

PLoS Comput Biol. 2012;8(12):e1002823. doi: 10.1371/journal.pcbi.1002823. Epub 2012 Dec 27.

Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: the SHARPn project.为电子健康记录数据的二次利用构建一个强大、可扩展且符合标准的基础架构：SHARPn 项目。

J Biomed Inform. 2012 Aug;45(4):763-71. doi: 10.1016/j.jbi.2012.01.009. Epub 2012 Feb 4.

引用本文的文献

PheNominal: an EHR-integrated web application for structured deep phenotyping at the point of care.PheNominal：一个在医疗护理点进行结构化深度表型分析的 EHR 集成型 Web 应用程序。

BMC Med Inform Decis Mak. 2022 Jul 28;22(Suppl 2):198. doi: 10.1186/s12911-022-01927-1.

Sensitivity and Specificity of Real-World Social Factor Screening Approaches.真实世界社会因素筛查方法的灵敏度和特异性。

J Med Syst. 2021 Nov 12;45(12):111. doi: 10.1007/s10916-021-01788-7.

High-throughput phenotyping with temporal sequences.高通量表型分析与时间序列。

J Am Med Inform Assoc. 2021 Mar 18;28(4):772-781. doi: 10.1093/jamia/ocaa288.

An artificial intelligence approach to COVID-19 infection risk assessment in virtual visits: A case report.人工智能在虚拟就诊中评估 COVID-19 感染风险：病例报告。

J Am Med Inform Assoc. 2020 Aug 1;27(8):1321-1325. doi: 10.1093/jamia/ocaa105.

Automated detection of altered mental status in emergency department clinical notes: a deep learning approach.基于深度学习的急诊科临床记录中意识状态改变的自动检测。

BMC Med Inform Decis Mak. 2019 Aug 19;19(1):164. doi: 10.1186/s12911-019-0894-9.

Artificial Intelligence vs. Natural Stupidity: Evaluating AI readiness for the Vietnamese Medical Information System.人工智能与天然愚笨：评估越南医疗信息系统的人工智能就绪情况。

J Clin Med. 2019 Feb 1;8(2):168. doi: 10.3390/jcm8020168.

Artificial Intelligence and Integrated Genotype⁻Phenotype Identification.人工智能与综合基因型-表型鉴定

Genes (Basel). 2018 Dec 28;10(1):18. doi: 10.3390/genes10010018.

Data integration strategies for predictive analytics in precision medicine.精准医学中预测分析的数据整合策略。

Per Med. 2018 Nov;15(6):543-551. doi: 10.2217/pme-2018-0035. Epub 2018 Nov 2.

Careflow Mining Techniques to Explore Type 2 Diabetes Evolution.用于探索2型糖尿病演变的护理流程挖掘技术

J Diabetes Sci Technol. 2018 Mar;12(2):251-259. doi: 10.1177/1932296818761751.

Clinical Research Informatics for Big Data and Precision Medicine.大数据与精准医学的临床研究信息学

Yearb Med Inform. 2016 Nov 10(1):211-218. doi: 10.15265/IY-2016-019.

本文引用的文献

An early illness recognition framework using a temporal Smith Waterman algorithm and NLP.一种使用时间序列史密斯-沃特曼算法和自然语言处理技术的早期疾病识别框架。

AMIA Annu Symp Proc. 2013 Nov 16;2013:548-57. eCollection 2013.

The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data.人类表型本体论项目：通过表型数据将分子生物学和疾病联系起来。

Nucleic Acids Res. 2014 Jan;42(Database issue):D966-74. doi: 10.1093/nar/gkt1026. Epub 2013 Nov 11.

Improving data and knowledge management to better integrate health care and research.改善数据和知识管理，以更好地整合医疗保健与研究。

J Intern Med. 2013 Oct;274(4):321-8. doi: 10.1111/joim.12105. Epub 2013 Jul 15.

PhenoTips: patient phenotyping software for clinical and research use.PhenoTips：用于临床和研究用途的患者表型分析软件。

Hum Mutat. 2013 Aug;34(8):1057-65. doi: 10.1002/humu.22347. Epub 2013 May 24.

Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.基于电子病历的表型算法验证：eMERGE 网络的结果和经验教训。

J Am Med Inform Assoc. 2013 Jun;20(e1):e147-54. doi: 10.1136/amiajnl-2012-000896. Epub 2013 Mar 26.

Crossing the omic chasm: a time for omic ancillary systems.跨越组学鸿沟：组学辅助系统的时代

JAMA. 2013 Mar 27;309(12):1237-8. doi: 10.1001/jama.2013.1579.

Next-generation phenotyping of electronic health records.电子健康记录的下一代表型分析。

J Am Med Inform Assoc. 2013 Jan 1;20(1):117-21. doi: 10.1136/amiajnl-2012-001145. Epub 2012 Sep 6.

Data Integration in Genomic Medicine: Trends and Applications. Contribution of the IMIA Working Group on Informatics in Genomic Medicine.基因组医学中的数据整合：趋势与应用。IMIA基因组医学信息学工作组的贡献。

Yearb Med Inform. 2012;7:117-25.

Next-generation sequencing: impact of exome sequencing in characterizing Mendelian disorders.下一代测序：外显子组测序在孟德尔疾病特征分析中的影响。

J Hum Genet. 2012 Oct;57(10):621-32. doi: 10.1038/jhg.2012.91. Epub 2012 Jul 26.

Deep phenotyping for precision medicine.深度表型分析用于精准医学。

Hum Mutat. 2012 May;33(5):777-80. doi: 10.1002/humu.22080.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验