Department of Biomedical Informatics, Columbia University, New York, New York, USA.
J Am Med Inform Assoc. 2013 Dec;20(e2):e243-52. doi: 10.1136/amiajnl-2013-001930. Epub 2013 Jul 9.
To describe a collaborative approach for developing an electronic health record (EHR) phenotyping algorithm for drug-induced liver injury (DILI).
We analyzed types and causes of differences in DILI case definitions provided by two institutions-Columbia University and Mayo Clinic; harmonized two EHR phenotyping algorithms; and assessed the performance, measured by sensitivity, specificity, positive predictive value, and negative predictive value, of the resulting algorithm at three institutions except that sensitivity was measured only at Columbia University.
Although these sites had the same case definition, their phenotyping methods differed by selection of liver injury diagnoses, inclusion of drugs cited in DILI cases, laboratory tests assessed, laboratory thresholds for liver injury, exclusion criteria, and approaches to validating phenotypes. We reached consensus on a DILI phenotyping algorithm and implemented it at three institutions. The algorithm was adapted locally to account for differences in populations and data access. Implementations collectively yielded 117 algorithm-selected cases and 23 confirmed true positive cases.
Phenotyping for rare conditions benefits significantly from pooling data across institutions. Despite the heterogeneity of EHRs and varied algorithm implementations, we demonstrated the portability of this algorithm across three institutions. The performance of this algorithm for identifying DILI was comparable with other computerized approaches to identify adverse drug events.
Phenotyping algorithms developed for rare and complex conditions are likely to require adaptive implementation at multiple institutions. Better approaches are also needed to share algorithms. Early agreement on goals, data sources, and validation methods may improve the portability of the algorithms.
描述一种用于开发电子病历(EHR)药物性肝损伤(DILI)表型算法的协作方法。
我们分析了两个机构(哥伦比亚大学和梅奥诊所)提供的 DILI 病例定义的类型和差异原因;协调了两种 EHR 表型算法;并评估了除哥伦比亚大学外,在三个机构中使用该算法的性能,通过灵敏度、特异性、阳性预测值和阴性预测值来衡量。
尽管这些站点有相同的病例定义,但它们的表型方法在肝损伤诊断的选择、包括在 DILI 病例中引用的药物、评估的实验室测试、肝损伤的实验室阈值、排除标准和验证表型的方法上存在差异。我们就 DILI 表型算法达成共识,并在三个机构中实施。该算法在本地进行了调整,以考虑到人群和数据访问的差异。实施情况共同产生了 117 个算法选择的病例和 23 个确认的真正阳性病例。
对罕见疾病进行表型分析可从跨机构数据汇集中显著受益。尽管 EHR 存在异质性且算法实施方式各不相同,但我们证明了该算法在三个机构中的可移植性。该算法用于识别 DILI 的性能与其他用于识别药物不良事件的计算机化方法相当。
为罕见和复杂疾病开发的表型算法可能需要在多个机构进行适应性实施。还需要更好的方法来共享算法。尽早就目标、数据源和验证方法达成一致,可能会提高算法的可移植性。