Hanauer David A, Rhodes Daniel R, Chinnaiyan Arul M
Department of Pediatrics, University of Michigan, Ann Arbor, Michigan, United States of America.
PLoS One. 2009;4(4):e5203. doi: 10.1371/journal.pone.0005203. Epub 2009 Apr 13.
The vast amounts of clinical data collected in electronic health records (EHR) is analogous to the data explosion from the "-omics" revolution. In the EHR clinicians often maintain patient-specific problem summary lists which are used to provide a concise overview of significant medical diagnoses. We hypothesized that by tapping into the collective wisdom generated by hundreds of physicians entering problems into the EHR we could detect significant associations among diagnoses that are not described in the literature.
METHODOLOGY/PRINCIPAL FINDINGS: We employed an analytic approach original developed for detecting associations between sets of gene expression data, called Molecular Concept Map (MCM), to find significant associations among the 1.5 million clinical problem summary list entries in 327,000 patients from our institution's EHR. An odds ratio (OR) and p-value was calculated for each association. A subset of the 750,000 associations found were explored using the MCM tool. Expected associations were confirmed and recently reported but poorly known associations were uncovered. Novel associations which may warrant further exploration were also found. Examples of expected associations included non-insulin dependent diabetes mellitus and various diagnoses such as retinopathy, hypertension, and coronary artery disease. A recently reported association included irritable bowel and vulvodynia (OR 2.9, p = 5.6x10(-4)). Associations that are currently unknown or very poorly known included those between granuloma annulare and osteoarthritis (OR 4.3, p = 1.1x10(-4)) and pyloric stenosis and ventricular septal defect (OR 12.1, p = 2.0x10(-3)).
CONCLUSIONS/SIGNIFICANCE: Computer programs developed for analyses of "-omic" data can be successfully applied to the area of clinical medicine. The results of the analysis may be useful for hypothesis generation as well as supporting clinical care by reminding clinicians of likely problems associated with a patient's existing problems.
电子健康记录(EHR)中收集的大量临床数据类似于“组学”革命带来的数据爆炸。在电子健康记录中,临床医生通常会维护针对患者的问题总结清单,用于简要概述重要的医学诊断。我们假设,通过利用数百名医生在电子健康记录中输入问题所产生的集体智慧,我们可以检测出文献中未描述的诊断之间的显著关联。
方法/主要发现:我们采用了一种最初为检测基因表达数据集之间的关联而开发的分析方法,称为分子概念图(MCM),以在我们机构电子健康记录中327,000名患者的150万个临床问题总结清单条目中寻找显著关联。计算了每个关联的比值比(OR)和p值。使用MCM工具对发现的750,000个关联中的一个子集进行了探索。确认了预期的关联,并且发现了最近报道但鲜为人知的关联。还发现了可能值得进一步探索的新关联。预期关联的例子包括非胰岛素依赖型糖尿病与各种诊断,如视网膜病变、高血压和冠状动脉疾病。最近报道的一种关联包括肠易激综合征和外阴痛(OR 2.9,p = 5.6×10⁻⁴)。目前未知或知之甚少的关联包括环状肉芽肿与骨关节炎之间的关联(OR 4.3,p = 1.1×10⁻⁴)以及幽门狭窄与室间隔缺损之间的关联(OR 12.1,p = 2.0×10⁻³)。
结论/意义:为分析“组学”数据而开发的计算机程序可以成功应用于临床医学领域。分析结果可能有助于生成假设,并通过提醒临床医生注意与患者现有问题相关的可能问题来支持临床护理。