• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自然语言处理技术在电子健康记录中识别不同的晶状体病变。

Using Natural Language Processing to Identify Different Lens Pathology in Electronic Health Records.

机构信息

From the W.K. Kellogg Eye Center, Department of Ophthalmology and Visual Sciences, University of Michigan, Ann Arbor, Michigan, USA (J.D.S., Y.Z., C.A.A., J.B.); Department of Health Management and Policy, University of Michigan School of Public Health, Ann Arbor, Michigan, USA (J.D.S.).

From the W.K. Kellogg Eye Center, Department of Ophthalmology and Visual Sciences, University of Michigan, Ann Arbor, Michigan, USA (J.D.S., Y.Z., C.A.A., J.B.).

出版信息

Am J Ophthalmol. 2024 Jun;262:153-160. doi: 10.1016/j.ajo.2024.01.030. Epub 2024 Feb 1.

DOI:10.1016/j.ajo.2024.01.030
PMID:38296152
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11098689/
Abstract

PURPOSE

Nearly all published ophthalmology-related Big Data studies rely exclusively on International Classification of Diseases (ICD) billing codes to identify patients with particular ocular conditions. However, inaccurate or nonspecific codes may be used. We assessed whether natural language processing (NLP), as an alternative approach, could more accurately identify lens pathology.

DESIGN

Database study comparing the accuracy of NLP versus ICD billing codes to properly identify lens pathology.

METHODS

We developed an NLP algorithm capable of searching free-text lens exam data in the electronic health record (EHR) to identify the type(s) of cataract present, cataract density, presence of intraocular lenses, and other lens pathology. We applied our algorithm to 17.5 million lens exam records in the Sight Outcomes Research Collaborative (SOURCE) repository. We selected 4314 unique lens-exam entries and asked 11 clinicians to assess whether all pathology present in the entries had been correctly identified in the NLP algorithm output. The algorithm's sensitivity at accurately identifying lens pathology was compared with that of the ICD codes.

RESULTS

The NLP algorithm correctly identified all lens pathology present in 4104 of the 4314 lens-exam entries (95.1%). For less common lens pathology, algorithm findings were corroborated by reviewing clinicians for 100% of mentions of pseudoexfoliation material and 99.7% for phimosis, subluxation, and synechia. Sensitivity at identifying lens pathology was better for NLP (0.98 [0.96-0.99] than for billing codes (0.49 [0.46-0.53]).

CONCLUSIONS

Our NLP algorithm identifies and classifies lens abnormalities routinely documented by eye-care professionals with high accuracy. Such algorithms will help researchers to properly identify and classify ocular pathology, broadening the scope of feasible research using real-world data.

摘要

目的

几乎所有已发表的眼科相关大数据研究都仅依赖国际疾病分类(ICD)计费代码来识别患有特定眼部疾病的患者。然而,计费代码可能存在不准确或不明确的情况。我们评估了自然语言处理(NLP)作为替代方法是否可以更准确地识别晶状体病变。

设计

比较 NLP 与 ICD 计费代码准确性以正确识别晶状体病变的数据库研究。

方法

我们开发了一种 NLP 算法,能够在电子健康记录(EHR)中搜索自由文本晶状体检查数据,以识别存在的白内障类型、白内障密度、人工晶状体的存在以及其他晶状体病变。我们将我们的算法应用于 SOURCE 存储库中的 1750 万份晶状体检查记录。我们选择了 4314 个独特的晶状体检查条目,并要求 11 名临床医生评估条目内的所有病理是否都在 NLP 算法输出中得到正确识别。比较了算法识别晶状体病变的准确性与 ICD 代码的准确性。

结果

NLP 算法正确识别了 4314 个晶状体检查条目中的 4104 个(95.1%)存在的所有晶状体病变。对于不太常见的晶状体病变,对于假剥脱物质的提及,算法结果得到了 100%的临床医生的证实,对于 99.7%的病例,对于膜性外翻、脱位和粘连的提及,算法结果也得到了证实。识别晶状体病变的敏感性方面,NLP(0.98 [0.96-0.99])优于计费代码(0.49 [0.46-0.53])。

结论

我们的 NLP 算法以高精度识别和分类眼科医生常规记录的晶状体异常。此类算法将帮助研究人员正确识别和分类眼部病变,扩大使用真实世界数据进行可行研究的范围。

相似文献

1
Using Natural Language Processing to Identify Different Lens Pathology in Electronic Health Records.利用自然语言处理技术在电子健康记录中识别不同的晶状体病变。
Am J Ophthalmol. 2024 Jun;262:153-160. doi: 10.1016/j.ajo.2024.01.030. Epub 2024 Feb 1.
2
De Novo Natural Language Processing Algorithm Accurately Identifies Myxofibrosarcoma From Pathology Reports.全新自然语言处理算法可从病理报告中准确识别黏液纤维肉瘤。
Clin Orthop Relat Res. 2025 Jan 1;483(1):80-87. doi: 10.1097/CORR.0000000000003270. Epub 2024 Oct 2.
3
Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis.用于识别老年人住院记录中跌倒事件的基于规则的自然语言处理算法的开发与验证:回顾性分析
JMIR Aging. 2025 Jul 8;8:e65195. doi: 10.2196/65195.
4
Trifocal versus extended depth of focus (EDOF) intraocular lenses after cataract extraction.白内障摘除术后三焦点与扩展景深(EDOF)人工晶状体的比较。
Cochrane Database Syst Rev. 2024 Jul 10;7(7):CD014891. doi: 10.1002/14651858.CD014891.pub2.
5
Trifocal intraocular lenses versus bifocal intraocular lenses after cataract extraction among participants with presbyopia.多焦点人工晶状体与白内障摘除术后老视患者的双焦点人工晶状体比较。
Cochrane Database Syst Rev. 2023 Jan 27;1(1):CD012648. doi: 10.1002/14651858.CD012648.pub3.
6
Identifying Diabetes Related-Complications in a Real-World Free-Text Electronic Medical Records in Hebrew Using Natural Language Processing Techniques.使用自然语言处理技术在真实世界的希伯来语自由文本电子病历中识别糖尿病相关并发症。
J Diabetes Sci Technol. 2024 Jan 30:19322968241228555. doi: 10.1177/19322968241228555.
7
Performance of Natural Language Processing versus International Classification of Diseases Codes in Building Registries for Patients With Fall Injury: Retrospective Analysis.自然语言处理与国际疾病分类编码在构建跌倒损伤患者登记册中的性能:回顾性分析
JMIR Med Inform. 2025 Jul 14;13:e66973. doi: 10.2196/66973.
8
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
9
Intraocular lens optic edge design for the prevention of posterior capsule opacification after cataract surgery.白内障手术后预防后囊膜混浊的人工晶状体光学边缘设计。
Cochrane Database Syst Rev. 2021 Aug 16;8(8):CD012516. doi: 10.1002/14651858.CD012516.pub2.
10
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

引用本文的文献

1
Using Natural Language Processing and Machine Learning to classify the status of kidney allograft in Electronic Medical Records written in Spanish.使用自然语言处理和机器学习对西班牙语电子病历中同种异体肾移植的状态进行分类。
PLoS One. 2025 May 8;20(5):e0322587. doi: 10.1371/journal.pone.0322587. eCollection 2025.
2
ChatGPT-Assisted Classification of Postoperative Bleeding Following Microinvasive Glaucoma Surgery Using Electronic Health Record Data.使用电子健康记录数据通过ChatGPT辅助对微创青光眼手术后的出血情况进行分类
Ophthalmol Sci. 2024 Aug 23;5(1):100602. doi: 10.1016/j.xops.2024.100602. eCollection 2025 Jan-Feb.
3

本文引用的文献

1
Validity of Administrative Claims and Electronic Health Registry Data From a Single Practice for Eye Health Surveillance.单一实践的行政索赔和电子健康记录数据在眼部健康监测中的有效性。
JAMA Ophthalmol. 2023 Jun 1;141(6):534-541. doi: 10.1001/jamaophthalmol.2023.1263.
2
Assessing the Quality of Big Data Is Critical as the Stakes Increase.随着风险增加,评估大数据质量至关重要。
JAMA Ophthalmol. 2023 Jun 1;141(6):541-542. doi: 10.1001/jamaophthalmol.2023.1561.
3
Causes of Childhood Blindness in the United States Using the IRIS® Registry (Intelligent Research in Sight).
Using Electronic Health Record Data to Determine the Safety of Aqueous Humor Liquid Biopsies for Molecular Analyses.
利用电子健康记录数据确定用于分子分析的房水液体活检的安全性。
Ophthalmol Sci. 2024 Mar 19;4(5):100517. doi: 10.1016/j.xops.2024.100517. eCollection 2024 Sep-Oct.
美国儿童失明的病因研究——IRIS® 注册研究(智能视觉研究)。
Ophthalmology. 2023 Sep;130(9):907-913. doi: 10.1016/j.ophtha.2023.04.004. Epub 2023 Apr 8.
4
Applications of natural language processing in ophthalmology: present and future.自然语言处理在眼科中的应用:现状与未来。
Front Med (Lausanne). 2022 Aug 8;9:906554. doi: 10.3389/fmed.2022.906554. eCollection 2022.
5
An IRIS Registry-Based Assessment of Primary Open-Angle Glaucoma Practice Patterns in Academic Versus Nonacademic Settings.基于 IRIS 注册的学术环境与非学术环境下原发性开角型青光眼治疗模式评估。
Am J Ophthalmol. 2022 Oct;242:228-242. doi: 10.1016/j.ajo.2022.04.006. Epub 2022 Apr 22.
6
Prevalence of pediatric eye disease in the optumlabs data warehouse.Optumlabs 数据仓库中儿科眼病的患病率。
Ophthalmic Epidemiol. 2022 Oct;29(5):537-544. doi: 10.1080/09286586.2021.1971261. Epub 2021 Aug 29.
7
Endophthalmitis Rate in Immediately Sequential versus Delayed Sequential Bilateral Cataract Surgery within the Intelligent Research in Sight (IRIS®) Registry Data.立即序与延迟序双侧白内障手术在 Intelligent Research in Sight(IRIS®)注册数据中的眼内炎发生率。
Ophthalmology. 2022 Feb;129(2):129-138. doi: 10.1016/j.ophtha.2021.07.008. Epub 2021 Jul 13.
8
Gradient Boosting Decision Tree Algorithm for the Prediction of Postoperative Intraocular Lens Position in Cataract Surgery.用于预测白内障手术中人工晶状体术后位置的梯度提升决策树算法
Transl Vis Sci Technol. 2020 Dec 21;9(13):38. doi: 10.1167/tvst.9.13.38. eCollection 2020 Dec.
9
Text Processing for Detection of Fungal Ocular Involvement in Critical Care Patients: Cross-Sectional Study.文本处理在重症监护患者真菌感染眼部累及检测中的应用:一项横断面研究。
J Med Internet Res. 2020 Aug 14;22(8):e18855. doi: 10.2196/18855.
10
Application of the Sight Outcomes Research Collaborative Ophthalmology Data Repository for Triaging Patients With Glaucoma and Clinic Appointments During Pandemics Such as COVID-19.在 COVID-19 等大流行期间,Sight Outcomes Research Collaborative Ophthalmology Data Repository 在青光眼患者分诊和诊所预约中的应用。
JAMA Ophthalmol. 2020 Sep 1;138(9):974-980. doi: 10.1001/jamaophthalmol.2020.2974.