Frey L J, Lenert L, Lopez-Campos G
Lewis J Frey, Chair IMIA Genomic Medicine WG, Biomedical Informatics Center, Public Health Sciences, Associate Professor, Hollings Cancer Center, Research Member, Medical University of South Carolina, 135 Cannon Street, Suite 405K, MUSC 200, Charleston, SC 29425. USA, Tel: +1 843 792 4216, Fax: +1 843 792 5587, E-mail:
Yearb Med Inform. 2014 Aug 15;9(1):206-11. doi: 10.15265/IY-2014-0006.
Given the quickening speed of discovery of variant disease drivers from combined patient genotype and phenotype data, the objective is to provide methodology using big data technology to support the definition of deep phenotypes in medical records.
As the vast stores of genomic information increase with next generation sequencing, the importance of deep phenotyping increases. The growth of genomic data and adoption of Electronic Health Records (EHR) in medicine provides a unique opportunity to integrate phenotype and genotype data into medical records. The method by which collections of clinical findings and other health related data are leveraged to form meaningful phenotypes is an active area of research. Longitudinal data stored in EHRs provide a wealth of information that can be used to construct phenotypes of patients. We focus on a practical problem around data integration for deep phenotype identification within EHR data. The use of big data approaches are described that enable scalable markup of EHR events that can be used for semantic and temporal similarity analysis to support the identification of phenotype and genotype relationships.
Stead and colleagues' 2005 concept of using light standards to increase the productivity of software systems by riding on the wave of hardware/processing power is described as a harbinger for designing future healthcare systems. The big data solution, using flexible markup, provides a route to improved utilization of processing power for organizing patient records in genotype and phenotype research.
鉴于从患者基因型和表型数据中发现疾病驱动变异的速度不断加快,目标是提供使用大数据技术的方法,以支持病历中深度表型的定义。
随着下一代测序技术使基因组信息的海量存储不断增加,深度表型分析的重要性日益凸显。医学领域基因组数据的增长以及电子健康记录(EHR)的采用,为将表型和基因型数据整合到病历中提供了独特的机会。利用临床发现和其他健康相关数据的集合来形成有意义表型的方法是一个活跃的研究领域。EHR中存储的纵向数据提供了丰富的信息,可用于构建患者的表型。我们关注EHR数据中围绕深度表型识别的数据整合这一实际问题。描述了使用大数据方法实现EHR事件的可扩展标记,可用于语义和时间相似性分析,以支持表型和基因型关系的识别。
斯特德及其同事在2005年提出的利用轻量级标准搭乘硬件/处理能力提升的浪潮来提高软件系统生产力的概念,被描述为设计未来医疗保健系统的先驱。使用灵活标记的大数据解决方案为在基因型和表型研究中更好地利用处理能力来组织患者记录提供了一条途径。