• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

真实世界数据集中常见眼病的增强型表型识别

Enhanced Phenotype Identification of Common Ocular Diseases in Real-World Datasets.

作者信息

Stein Joshua D, An Hong Su, Andrews Chris A, Pershing Suzann, Mungle Tushar, Bicket Amanda K, Rosenthal Julie M, Zhang Amy D, Lee Wen-Shin, Ludwig Cassie, Mekonnen Bethlehem, Hernandez-Boussard Tina

机构信息

Department of Ophthalmology and Visual Sciences, University of Michigan, Ann Arbor, Michigan.

Department of Health Management and Policy, School of Public Health, University of Michigan, Ann Arbor, Michigan.

出版信息

Ophthalmol Sci. 2025 Jan 24;5(4):100717. doi: 10.1016/j.xops.2025.100717. eCollection 2025 Jul-Aug.

DOI:10.1016/j.xops.2025.100717
PMID:40212931
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11985028/
Abstract

OBJECTIVE

For studies using real-world data, accurately identifying patients with phenotypes of interest is challenging. To identify cohorts of interest, most studies exclusively use the International Classification of Diseases (ICD) billing codes, which can be limiting. We developed a method to accurately identify the presence or absence of 3 common ocular diseases (diabetic retinopathy [DR], age-related macular degeneration [AMD], and glaucoma) using electronic health record (EHR) data.

DESIGN

Database study.

PARTICIPANTS

Three thousand nine hundred fourteen eyes from 1957 patients at 2 Sight OUtcomes Research CollaborativE (SOURCE) Ophthalmology Data Repository sites.

METHODS

We developed enhanced phenotype identification (EPI) algorithms that search EHR fields, including eye examination findings, orders, charges, medication prescriptions, and surgery data for evidence that a patient has glaucoma, DR, or AMD. We trained our EPI models using gold standard assessments of the EHR by ophthalmologists for the presence/absence of these conditions, compared the performance of our EPI models to models developed using ICD codes alone, and validated the performance of model using data from another SOURCE site.

MAIN OUTCOME MEASURES

Area under the receiver operating curve (AUC), area under the precision-recall curve (AUPRC), and model calibration.

RESULTS

The AUCs of our EPI models were better than ICD-only models for glaucoma (0.97 vs. 0.90), DR (0.997 vs. 0.98), and AMD (0.99 vs. 0.95). The AUPRCs of our EPI models were also much better than ICD-only models for glaucoma (0.79 vs. 0.32), DR (0.96 vs. 0.84), and AMD (0.74 vs. 0.55). When testing on patients from a second SOURCE site, the AUC and AUPRC for glaucoma (0.93, 0.74), DR (0.98, 0.77), and AMD (0.96, 0.64) were slightly worse than the primary site but still quite high. However, for all 3 conditions, model calibration was worse at the second site.

CONCLUSIONS

Leveraging machine learning, we developed EPI models to accurately identify most patients with glaucoma, DR, and AMD in real-world datasets. The EPI models significantly outperform ICD-only models in identifying patients confirmed to have these conditions. These findings underscore the potential of using comprehensive EHR data combined with advanced machine learning techniques to improve the accuracy of patient phenotype identification, leading to better patient management and clinical outcomes.

FINANCIAL DISCLOSURES

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

摘要

目的

对于使用真实世界数据的研究而言,准确识别具有感兴趣表型的患者具有挑战性。为了识别感兴趣的队列,大多数研究仅使用国际疾病分类(ICD)计费代码,这可能存在局限性。我们开发了一种方法,可利用电子健康记录(EHR)数据准确识别3种常见眼病(糖尿病性视网膜病变[DR]、年龄相关性黄斑变性[AMD]和青光眼)的存在与否。

设计

数据库研究。

参与者

来自2个视力结果研究协作组(SOURCE)眼科数据存储库站点的1957名患者的3914只眼睛。

方法

我们开发了增强型表型识别(EPI)算法,该算法在EHR字段中搜索,包括眼部检查结果、医嘱、费用、药物处方和手术数据,以寻找患者患有青光眼、DR或AMD的证据。我们使用眼科医生对EHR的金标准评估来训练我们的EPI模型,以确定这些疾病的存在与否,将我们的EPI模型的性能与仅使用ICD代码开发的模型进行比较,并使用来自另一个SOURCE站点的数据验证模型的性能。

主要结局指标

受试者工作特征曲线下面积(AUC)、精确召回率曲线下面积(AUPRC)和模型校准。

结果

我们的EPI模型在青光眼(0.97对0.90)、DR(0.997对0.98)和AMD(0.99对0.95)方面的AUC优于仅使用ICD的模型。我们的EPI模型在青光眼(0.79对0.32)、DR(0.96对0.84)和AMD(0.74对0.55)方面的AUPRC也远优于仅使用ICD的模型。在对来自第二个SOURCE站点的患者进行测试时,青光眼(0.93,0.74)、DR(0.98,0.77)和AMD(0.96,0.64)的AUC和AUPRC略低于主要站点,但仍然相当高。然而,对于所有3种疾病,模型校准在第二个站点更差。

结论

利用机器学习,我们开发了EPI模型,以准确识别真实世界数据集中大多数患有青光眼、DR和AMD的患者。在识别确诊患有这些疾病的患者方面,EPI模型显著优于仅使用ICD的模型。这些发现强调了使用综合EHR数据结合先进机器学习技术来提高患者表型识别准确性的潜力,从而实现更好的患者管理和临床结局。

财务披露

在本文末尾的脚注和披露中可能会发现专有或商业披露信息。

相似文献

1
Enhanced Phenotype Identification of Common Ocular Diseases in Real-World Datasets.真实世界数据集中常见眼病的增强型表型识别
Ophthalmol Sci. 2025 Jan 24;5(4):100717. doi: 10.1016/j.xops.2025.100717. eCollection 2025 Jul-Aug.
2
Evaluation of an Algorithm for Identifying Ocular Conditions in Electronic Health Record Data.评估一种在电子健康记录数据中识别眼部疾病的算法。
JAMA Ophthalmol. 2019 May 1;137(5):491-497. doi: 10.1001/jamaophthalmol.2018.7051.
3
Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative.多中心电子健康记录联盟中青光眼的预测模型:视力结果研究协作组
Ophthalmol Sci. 2023 Dec 6;4(3):100445. doi: 10.1016/j.xops.2023.100445. eCollection 2024 May-Jun.
4
Deep-Learning-Aided Diagnosis of Diabetic Retinopathy, Age-Related Macular Degeneration, and Glaucoma Based on Structural and Angiographic OCT.基于结构和血管造影光学相干断层扫描的深度学习辅助诊断糖尿病性视网膜病变、年龄相关性黄斑变性和青光眼。
Ophthalmol Sci. 2022 Nov 9;3(1):100245. doi: 10.1016/j.xops.2022.100245. eCollection 2023 Mar.
5
Comparison of Diagnosis Codes to Clinical Notes in Classifying Patients with Diabetic Retinopathy.糖尿病视网膜病变患者分类中诊断编码与临床记录的比较
Ophthalmol Sci. 2024 Jun 14;4(6):100564. doi: 10.1016/j.xops.2024.100564. eCollection 2024 Nov-Dec.
6
Improving the Identification of Diabetic Retinopathy and Related Conditions in the Electronic Health Record Using Natural Language Processing Methods.使用自然语言处理方法改善电子健康记录中糖尿病视网膜病变及相关病症的识别
Ophthalmol Sci. 2024 Jul 18;4(6):100578. doi: 10.1016/j.xops.2024.100578. eCollection 2024 Nov-Dec.
7
Machine Learning Methods Using Artificial Intelligence Deployed on Electronic Health Record Data for Identification and Referral of At-Risk Patients From Primary Care Physicians to Eye Care Specialists: Retrospective, Case-Controlled Study.利用人工智能的机器学习方法应用于电子健康记录数据,以识别有风险的患者并将其从初级保健医生转诊至眼科专科医生:回顾性病例对照研究。
JMIR AI. 2024 Mar 12;3:e48295. doi: 10.2196/48295.
8
Validity of Administrative Claims and Electronic Health Registry Data From a Single Practice for Eye Health Surveillance.单一实践的行政索赔和电子健康记录数据在眼部健康监测中的有效性。
JAMA Ophthalmol. 2023 Jun 1;141(6):534-541. doi: 10.1001/jamaophthalmol.2023.1263.
9
Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes.使用来自多民族糖尿病患者群体的视网膜图像开发并验证用于糖尿病视网膜病变及相关眼病的深度学习系统
JAMA. 2017 Dec 12;318(22):2211-2223. doi: 10.1001/jama.2017.18152.
10
Variations in Electronic Health Record-Based Definitions of Diabetic Retinopathy Cohorts: A Literature Review and Quantitative Analysis.基于电子健康记录的糖尿病视网膜病变队列定义的差异:文献综述与定量分析
Ophthalmol Sci. 2024 Jan 24;4(4):100468. doi: 10.1016/j.xops.2024.100468. eCollection 2024 Jul-Aug.

引用本文的文献

1
Ensemble learning to enhance accurate identification of patients with glaucoma using electronic health records.使用电子健康记录的集成学习以提高青光眼患者的准确识别
JAMIA Open. 2025 Aug 10;8(4):ooaf080. doi: 10.1093/jamiaopen/ooaf080. eCollection 2025 Aug.

本文引用的文献

1
Prediction Models for Glaucoma in a Multicenter Electronic Health Records Consortium: The Sight Outcomes Research Collaborative.多中心电子健康记录联盟中青光眼的预测模型:视力结果研究协作组
Ophthalmol Sci. 2023 Dec 6;4(3):100445. doi: 10.1016/j.xops.2023.100445. eCollection 2024 May-Jun.
2
Disparities in Retinal Vein Occlusion Presentation and Initiation of Anti-VEGF Therapy: An Academy IRIS® Registry Analysis.视网膜静脉阻塞的表现差异及抗VEGF治疗的起始情况:一项美国眼科学会IRIS®注册研究分析
Ophthalmol Retina. 2024 Jul;8(7):657-665. doi: 10.1016/j.oret.2024.01.017. Epub 2024 Jan 24.
3
Optic Neuritis and Cranial Neuropathies Diagnosis Rates before Coronavirus Disease 2019, in the Initial Pandemic Phase, and Post-Vaccine Introduction.视神经炎和颅神经病的诊断率在 2019 年冠状病毒病之前、大流行初期和疫苗接种后。
Ophthalmology. 2024 Jan;131(1):78-86. doi: 10.1016/j.ophtha.2023.08.021. Epub 2023 Aug 25.
4
Phenome-Wide Association Studies.全表型组关联研究
JAMA. 2022 Jan 4;327(1):75-76. doi: 10.1001/jama.2021.20356.
5
Development and Evaluation of a Rules-based Algorithm for Primary Open-Angle Glaucoma in the VA Million Veteran Program.基于规则算法在退伍军人事务部百万老兵计划中原发性开角型青光眼的开发与评估。
Ophthalmic Epidemiol. 2022 Dec;29(6):640-648. doi: 10.1080/09286586.2021.1992784. Epub 2021 Nov 25.
6
Reoperation Rates and Disease Costs for Primary Open-Angle Glaucoma Patients in the United States Treated with Incisional Glaucoma Surgery.美国接受切口性青光眼手术治疗的原发性开角型青光眼患者的再次手术率和疾病成本。
Ophthalmol Glaucoma. 2022 May-Jun;5(3):297-305. doi: 10.1016/j.ogla.2021.10.011. Epub 2021 Oct 27.
7
Clinical and economic burden of neovascular age-related macular degeneration by disease status: a US claims-based analysis.基于美国索赔数据分析的湿性年龄相关性黄斑变性疾病状况的临床和经济负担。
J Manag Care Spec Pharm. 2021 Sep;27(9):1260-1272. doi: 10.18553/jmcp.2021.27.9.1260.
8
Validity of Using Billing Codes From Electronic Health Records to Estimate Skin Cancer Counts.利用电子健康记录中的计费代码估算皮肤癌计数的有效性。
JAMA Dermatol. 2021 Sep 1;157(9):1089-1094. doi: 10.1001/jamadermatol.2021.2856.
9
Increasing Incidence and Prevalence of Common Retinal Diseases in Retina Practices Across the United States.美国各地视网膜诊所常见视网膜疾病的发病率和患病率不断增加。
Ophthalmic Surg Lasers Imaging Retina. 2021 Jan 1;52(1):29-36. doi: 10.3928/23258160-20201223-06.
10
Clinical and Economic Burden of Glaucoma by Disease Severity: A United States Claims-Based Analysis.基于美国理赔数据分析青光眼严重程度对临床和经济负担的影响。
Ophthalmol Glaucoma. 2021 Sep-Oct;4(5):490-503. doi: 10.1016/j.ogla.2020.12.007. Epub 2021 Feb 11.