使源自电子健康记录的表型适应理赔数据：利用有限临床数据进行表型分析的经验教训。

Adapting electronic health records-derived phenotypes to claims data: Lessons learned in using limited clinical data for phenotyping.

作者信息

Ostropolets Anna, Reich Christian, Ryan Patrick, Shang Ning, Hripcsak George, Weng Chunhua

机构信息

Columbia University Medical Center, New York, NY, USA; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, USA.

IQVIA, Cambridge, MA, USA; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, USA.

出版信息

J Biomed Inform. 2020 Feb;102:103363. doi: 10.1016/j.jbi.2019.103363. Epub 2019 Dec 19.

DOI:10.1016/j.jbi.2019.103363

PMID:31866433

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7390483/

Abstract

Algorithms for identifying patients of interest from observational data must address missing and inaccurate data and are desired to achieve comparable performance on both administrative claims and electronic health records data. However, administrative claims data do not contain the necessary information to develop accurate algorithms for disorders that require laboratory results, and this omission can result in insensitive diagnostic code-based algorithms. In this paper, we tested our assertion that the performance of a diagnosis code-based algorithm for chronic kidney disorder (CKD) can be improved by adding other codes indirectly related to CKD (e.g., codes for dialysis, kidney transplant, suspicious kidney disorders). Following the best practices from Observational Health Data Sciences and Informatics (OHDSI), we adapted an electronic health record-based gold standard algorithm for CKD and then created algorithms that can be executed on administrative claims data and account for related data quality issues. We externally validated our algorithms on four electronic health record datasets in the OHDSI network. Compared to the algorithm that uses CKD diagnostic codes only, positive predictive value of the algorithms that use additional codes was slightly increased (47.4% vs. 47.9-48.5% respectively). The algorithms adapted from the gold standard algorithm can be used to infer chronic kidney disorder based on administrative claims data. We succeeded in improving the generalizability and consistency of the CKD phenotypes by using data and vocabulary standardized across the OHDSI network, although performance variability across datasets remains. We showed that identifying and addressing coding and data heterogeneity can improve the performance of the algorithms.

摘要

从观察数据中识别感兴趣患者的算法必须处理缺失和不准确的数据，并期望在行政索赔数据和电子健康记录数据上都能实现可比的性能。然而，行政索赔数据不包含开发针对需要实验室检查结果的疾病的准确算法所需的必要信息，这种遗漏可能导致基于诊断代码的算法不够敏感。在本文中，我们检验了我们的断言，即通过添加与慢性肾脏病（CKD）间接相关的其他代码（例如透析、肾移植、可疑肾脏疾病的代码），可以提高基于诊断代码的CKD算法的性能。遵循观察性健康数据科学与信息学（OHDSI）的最佳实践，我们改编了一种基于电子健康记录的CKD金标准算法，然后创建了可以在行政索赔数据上执行并考虑相关数据质量问题的算法。我们在OHDSI网络中的四个电子健康记录数据集上对我们的算法进行了外部验证。与仅使用CKD诊断代码的算法相比，使用额外代码的算法的阳性预测值略有提高（分别为47.4%和47.9 - 48.5%）。从金标准算法改编而来的算法可用于基于行政索赔数据推断慢性肾脏病。尽管各数据集之间仍存在性能差异，但通过使用OHDSI网络中标准化的数据和词汇，我们成功提高了CKD表型的可推广性和一致性。我们表明，识别和解决编码及数据异质性可以提高算法的性能。

相似文献

Adapting electronic health records-derived phenotypes to claims data: Lessons learned in using limited clinical data for phenotyping.使源自电子健康记录的表型适应理赔数据：利用有限临床数据进行表型分析的经验教训。

J Biomed Inform. 2020 Feb;102:103363. doi: 10.1016/j.jbi.2019.103363. Epub 2019 Dec 19.

Phenotyping in distributed data networks: selecting the right codes for the right patients.分布式数据网络中的表型分析：为合适的患者选择合适的编码。

AMIA Annu Symp Proc. 2023 Apr 29;2022:826-835. eCollection 2022.

MixEHR-Guided: A guided multi-modal topic modeling approach for large-scale automatic phenotyping using the electronic health record.混合 EHR 引导：一种使用电子健康记录进行大规模自动表型分析的引导式多模态主题建模方法。

J Biomed Inform. 2022 Oct;134:104190. doi: 10.1016/j.jbi.2022.104190. Epub 2022 Sep 1.

Development and validation of phenotype classifiers across multiple sites in the observational health data sciences and informatics network.在观察性健康数据科学和信息学网络的多个站点开发和验证表型分类器。

J Am Med Inform Assoc. 2020 Jun 1;27(6):877-883. doi: 10.1093/jamia/ocaa032.

Development and electronic health record validation of an algorithm for identifying patients with Duchenne muscular dystrophy in US administrative claims.开发并验证一种用于从美国行政索赔中识别杜氏肌营养不良症患者的算法。

J Manag Care Spec Pharm. 2023 Sep;29(9):1033-1044. doi: 10.18553/jmcp.2023.29.9.1033.

Optimizing research in symptomatic uterine fibroids with development of a computable phenotype for use with electronic health records.优化有症状的子宫纤维瘤的研究，开发可计算的表型，用于电子健康记录。

Am J Obstet Gynecol. 2018 Jun;218(6):610.e1-610.e7. doi: 10.1016/j.ajog.2018.02.002. Epub 2018 Feb 9.

Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.基于电子病历的表型算法验证：eMERGE 网络的结果和经验教训。

J Am Med Inform Assoc. 2013 Jun;20(e1):e147-54. doi: 10.1136/amiajnl-2012-000896. Epub 2013 Mar 26.

Development and validation of an electronic phenotyping algorithm for chronic kidney disease.慢性肾脏病电子表型分析算法的开发与验证

AMIA Annu Symp Proc. 2014 Nov 14;2014:907-16. eCollection 2014.

Validation of human immunodeficiency virus diagnosis codes among women enrollees of a U.S. health plan.美国健康计划中女性参保者的人类免疫缺陷病毒诊断代码验证。

BMC Health Serv Res. 2024 Feb 22;24(1):234. doi: 10.1186/s12913-024-10685-x.

Accuracy of identifying diagnosis of moderate to severe chronic kidney disease in administrative claims data.在行政索赔数据中识别中度至重度慢性肾脏病诊断的准确性。

Pharmacoepidemiol Drug Saf. 2022 Apr;31(4):467-475. doi: 10.1002/pds.5398. Epub 2021 Dec 23.

引用本文的文献

Multi-domain rule-based phenotyping algorithms enable improved GWAS signal.基于多领域规则的表型分析算法可增强全基因组关联研究（GWAS）信号。

NPJ Digit Med. 2025 Aug 2;8(1):499. doi: 10.1038/s41746-025-01815-8.

Ophthalmol Retina. 2024 Aug;8(8):733-743. doi: 10.1016/j.oret.2024.03.014. Epub 2024 Mar 20.

Electronic health record data quality assessment and tools: a systematic review.电子健康记录数据质量评估及工具：系统综述。

J Am Med Inform Assoc. 2023 Sep 25;30(10):1730-1740. doi: 10.1093/jamia/ocad120.

Classifying Infection Risk Following Pediatric Cardiac Surgery.小儿心脏手术后感染风险分类。

AMIA Annu Symp Proc. 2023 Apr 29;2022:1153-1162. eCollection 2022.

Measurement Error and Misclassification in Orthopedics: When Study Subjects are Categorized in the Wrong Exposure or Outcome Groups.骨科中的测量误差和分类错误：当研究对象被错误地归入暴露或结局组时。

J Arthroplasty. 2022 Oct;37(10):1956-1960. doi: 10.1016/j.arth.2022.05.025. Epub 2022 Sep 6.

Development and validation of algorithms to identify patients with chronic kidney disease and related chronic diseases across the Northern Territory, Australia.开发和验证算法以识别澳大利亚北领地的慢性肾脏病患者和相关慢性病患者。

BMC Nephrol. 2022 Sep 23;23(1):320. doi: 10.1186/s12882-022-02947-9.

Data Consult Service: Can we use observational data to address immediate clinical needs?数据咨询服务：我们能否利用观察性数据来满足当前的临床需求？

J Am Med Inform Assoc. 2021 Sep 18;28(10):2139-2146. doi: 10.1093/jamia/ocab122.

Electronic phenotyping of health outcomes of interest using a linked claims-electronic health record database: Findings from a machine learning pilot project.使用链接的索赔-电子健康记录数据库对感兴趣的健康结果进行电子表型分析：来自机器学习试点项目的结果。

J Am Med Inform Assoc. 2021 Jul 14;28(7):1507-1517. doi: 10.1093/jamia/ocab036.

Impact of Diverse Data Sources on Computational Phenotyping.多源数据对计算表型分析的影响。

Front Genet. 2020 Jun 3;11:556. doi: 10.3389/fgene.2020.00556. eCollection 2020.

Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability.深度表型分析：拥抱复杂性和时间性——迈向可扩展性、便携性和互操作性。

J Biomed Inform. 2020 May;105:103433. doi: 10.1016/j.jbi.2020.103433. Epub 2020 Apr 23.

本文引用的文献

KDIGO 2017 Clinical Practice Guideline Update for the Diagnosis, Evaluation, Prevention, and Treatment of Chronic Kidney Disease-Mineral and Bone Disorder (CKD-MBD).KDIGO 2017慢性肾脏病-矿物质和骨异常（CKD-MBD）诊断、评估、预防及治疗临床实践指南更新

Kidney Int Suppl (2011). 2017 Jul;7(1):1-59. doi: 10.1016/j.kisu.2017.04.001. Epub 2017 Jun 21.

Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.观察性健康数据科学与信息学（OHDSI）：观察性研究人员的机遇。

Stud Health Technol Inform. 2015;216:574-8.

From patient care to research: a validation study examining the factors contributing to data quality in a primary care electronic medical record database.从患者护理到研究：一项验证性研究，考察基层医疗电子病历数据库中影响数据质量的因素。

BMC Fam Pract. 2015 Feb 5;16:11. doi: 10.1186/s12875-015-0223-z.

Detecting chronic kidney disease in population-based administrative databases using an algorithm of hospital encounter and physician claim codes.利用医院就诊和医生索赔代码算法在基于人群的行政数据库中检测慢性肾脏病。

BMC Nephrol. 2013 Apr 5;14:81. doi: 10.1186/1471-2369-14-81.

Validation of the diagnostic algorithms for 5 chronic conditions in the Canadian Primary Care Sentinel Surveillance Network (CPCSSN): a Kingston Practice-based Research Network (PBRN) report.加拿大初级保健监测网络（CPCSSN）5 种慢性病诊断算法的验证：金斯敦基于实践的研究网络（PBRN）报告。

J Am Board Fam Med. 2013 Mar-Apr;26(2):159-67. doi: 10.3122/jabfm.2013.02.120183.

Validating a case definition for chronic kidney disease using administrative data.利用行政数据验证慢性肾脏病的病例定义。

Nephrol Dial Transplant. 2012 May;27(5):1826-31. doi: 10.1093/ndt/gfr598. Epub 2011 Oct 19.

The definition, classification, and prognosis of chronic kidney disease: a KDIGO Controversies Conference report.慢性肾脏病的定义、分类和预后：KDIGO 争议会议报告。

Kidney Int. 2011 Jul;80(1):17-28. doi: 10.1038/ki.2010.483. Epub 2010 Dec 8.

Identification of individuals with CKD from Medicare claims data: a validation study.通过医疗保险理赔数据识别慢性肾脏病患者：一项验证研究。

Am J Kidney Dis. 2005 Aug;46(2):225-32. doi: 10.1053/j.ajkd.2005.04.029.

Low rates of testing and diagnostic codes usage in a commercial clinical laboratory: evidence for lack of physician awareness of chronic kidney disease.商业临床实验室中较低的检测率和诊断编码使用率：医生对慢性肾脏病认识不足的证据

J Am Soc Nephrol. 2005 Aug;16(8):2439-48. doi: 10.1681/ASN.2005020192. Epub 2005 Jun 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验