• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

眼部生物测量光学字符识别:一种利用光学字符识别来提取人工晶状体生物测量值的机器学习算法。

Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements.

作者信息

Salvi Anish, Arnal Leo, Ly Kevin, Ferreira Gabriel, Wang Sophia Y, Langlotz Curtis, Mahajan Vinit, Ludwig Chase A

机构信息

School of Medicine, Stanford University, Palo Alto, CA, United States.

Department of Medicine, Chicago Medical School at Rosalind Franklin University, North Chicago, IL, United States.

出版信息

Front Artif Intell. 2025 Jan 6;7:1428716. doi: 10.3389/frai.2024.1428716. eCollection 2024.

DOI:10.3389/frai.2024.1428716
PMID:39834877
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11743993/
Abstract

Given close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are often stored as PDFs rather than as structured data in electronic health records. Thus, time-consuming and laborious manual data entry is required for using biometry data as a disease predictor. Herein, we used two separate models, PaddleOCR and Gemini, to extract eye specific biometric measurements from 2,965 Lenstar, 104 IOL Master 500, and 3,616 IOL Master 700 optical biometry reports. For each patient eye, our text extraction pipeline, referred to as Ocular Biometry OCR, involves 1) cropping the report to the biometric data, 2) extracting the text via the optical character recognition model, 3) post-processing the metrics and values into key value pairs, 4) correcting erroneous angles within the pairs, 5) computing the number of errors or missing values, and 6) selecting the window specific results with fewest errors or missing values. To ensure the models' predictions could be put into a machine learning-ready format, artifacts were removed from categorical text data through manual modification where necessary. Performance was evaluated by scoring PaddleOCR and Gemini results. In the absence of ground truth, higher scoring indicated greater inter-model reliability, assuming an equal value between models indicated an accurate result. The detection scores, measuring the number of valid values (i.e., not missing or erroneous), were Lenstar: 0.990, IOLM 500: 1.000, and IOLM 700: 0.998. The similarity scores, measuring the number of equal values, were Lenstar: 0.995, IOLM 500: 0.999, and IOLM 700: 0.999. The agreement scores, combining detection and similarity scores, were Lenstar: 0.985, IOLM 500: 0.999, and IOLM 700: 0.998. IOLM 500 was annotated for ground truths; in this case, higher scoring indicated greater model-to-annotator accuracy. PaddleOCR-to-Annotator achieved scores of detection: 1.000, similarity: 0.999, and agreement: 0.999. Gemini-to-Annotator achieved scores of detection: 1.000, similarity: 1.000, and agreement: 1.000. Scores range from 0 to 1. While PaddleOCR and Gemini demonstrated high agreement, PaddleOCR offered slightly better performance upon reviewing quantitative and qualitative results.

摘要

鉴于眼部结构与眼科疾病之间存在密切关系,眼部生物测量值(包括眼轴长度、晶状体厚度、前房深度和角膜曲率值)可作为预测眼部疾病的特征。然而,眼部生物测量值在电子健康记录中通常以PDF格式存储,而非结构化数据。因此,要将生物测量数据用作疾病预测指标,需要进行耗时费力的手动数据录入。在此,我们使用了两个独立的模型PaddleOCR和Gemini,从2965份Lenstar、104份IOL Master 500和3616份IOL Master 700光学生物测量报告中提取眼部特定的生物测量值。对于每只患者眼睛,我们的文本提取流程(称为眼部生物测量OCR)包括:1)将报告裁剪至生物测量数据部分;2)通过光学字符识别模型提取文本;3)将指标和值后处理为键值对;4)纠正键值对内的错误角度;5)计算错误或缺失值的数量;6)选择错误或缺失值最少的特定窗口结果。为确保模型的预测结果能够转换为适用于机器学习的格式,必要时通过手动修改从分类文本数据中去除伪像。通过对PaddleOCR和Gemini的结果进行评分来评估性能。在没有地面真值的情况下,得分越高表明模型间的可靠性越高,假设模型间得分相等则表明结果准确。检测得分衡量有效值(即非缺失或错误的值)的数量,Lenstar为0.990,IOLM 500为1.000,IOLM 700为0.998。相似性得分衡量相等值的数量,Lenstar为0.995,IOLM 500为0.999,IOLM 700为0.999。一致性得分结合了检测得分和相似性得分,Lenstar为0.985,IOLM 500为0.999,IOLM 700为0.998。IOLM 500有地面真值注释;在这种情况下,得分越高表明模型与注释者之间的准确性越高。PaddleOCR与注释者之间的检测得分为1.000,相似性得分为0.999,一致性得分为0.999。Gemini与注释者之间的检测得分为1.000,相似性得分为1.000,一致性得分为1.000。得分范围为0至1。虽然PaddleOCR和Gemini表现出高度一致性,但在审查定量和定性结果时,PaddleOCR的性能略优。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/0c395ef5fc68/frai-07-1428716-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/56fa44b822a0/frai-07-1428716-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/b3c1a989741d/frai-07-1428716-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/28b2368148bf/frai-07-1428716-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/39fde6cacca7/frai-07-1428716-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/fbf60ef8ffe0/frai-07-1428716-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/a0454268967f/frai-07-1428716-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/986d9067e6c0/frai-07-1428716-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/13203d35c494/frai-07-1428716-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/5bc68d2cac74/frai-07-1428716-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/a56b048f5f46/frai-07-1428716-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/0c395ef5fc68/frai-07-1428716-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/56fa44b822a0/frai-07-1428716-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/b3c1a989741d/frai-07-1428716-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/28b2368148bf/frai-07-1428716-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/39fde6cacca7/frai-07-1428716-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/fbf60ef8ffe0/frai-07-1428716-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/a0454268967f/frai-07-1428716-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/986d9067e6c0/frai-07-1428716-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/13203d35c494/frai-07-1428716-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/5bc68d2cac74/frai-07-1428716-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/a56b048f5f46/frai-07-1428716-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54ff/11743993/0c395ef5fc68/frai-07-1428716-g011.jpg

相似文献

1
Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements.眼部生物测量光学字符识别:一种利用光学字符识别来提取人工晶状体生物测量值的机器学习算法。
Front Artif Intell. 2025 Jan 6;7:1428716. doi: 10.3389/frai.2024.1428716. eCollection 2024.
2
Comparison of Ocular Biometry and Refractive Outcomes Using IOL Master 500, IOL Master 700, and Lenstar LS900.使用IOL Master 500、IOL Master 700和Lenstar LS900进行眼生物测量和屈光结果的比较。
Korean J Ophthalmol. 2020 Apr;34(2):126-132. doi: 10.3341/kjo.2019.0102.
3
Comparison of the biometric measurements obtained using noncontact optical biometers LenStar LS 900 and IOL Master V.5.使用非接触式光学生物测量仪LenStar LS 900和IOL Master V.5获得的生物测量数据的比较。
Klin Oczna. 2011;113(1-3):47-51.
4
[Comparison of the optical coherence method (Zeiss IOL-Master) with two ultrasonographic biometric methods for the calculation of posterior chamber intraocular lenses after phacoemulsification as part of clinical routine].[作为临床常规操作的一部分,比较光学相干法(蔡司IOL-Master)与两种超声生物测量法在白内障超声乳化术后计算后房型人工晶状体中的应用]
Klin Monbl Augenheilkd. 2004 Oct;221(10):837-42. doi: 10.1055/s-2004-813658.
5
Evaluation of the repeatability of the Lenstar and comparison with two other non-contact biometric devices in myopes.评估Lenstar的可重复性并与另外两种用于近视患者的非接触式生物测量设备进行比较。
Clin Exp Optom. 2013 Jan;96(1):92-9. doi: 10.1111/j.1444-0938.2012.00793.x. Epub 2012 Sep 3.
6
Comparison of intraocular lens power calculation by the IOLMaster in phakic and eyes with hydrophobic acrylic lenses.IOLMaster 在有晶状体眼和植入疏水丙烯酸酯人工晶状体眼中计算人工晶状体屈光度的比较。
Ophthalmology. 2009 Jul;116(7):1336-42. doi: 10.1016/j.ophtha.2009.01.035. Epub 2009 May 8.
7
Comparison of the ultrasonographic method with 2 partial coherence interferometry methods for intraocular lens power calculation.超声检查法与两种部分相干干涉测量法在人工晶状体屈光力计算中的比较。
Optometry. 2011 Mar;82(3):140-7. doi: 10.1016/j.optm.2010.07.025. Epub 2010 Oct 8.
8
[Precision of a new device for biometric measurements in pseudophakic eyes].[用于假晶状体眼生物测量的新型设备的精度]
Ophthalmologe. 2011 Aug;108(8):739-44. doi: 10.1007/s00347-011-2373-2.
9
Lenstar LS900 vs EchoScan US-800: comparison between optical and ultrasound biometry with and without contact lenses and its relationship with other biometric parameters.LS900 与 US-800 超声生物测量仪的光学与超声对比:有无接触镜时的对比及其与其他生物测量参数的关系。
Expert Rev Med Devices. 2023 Jul-Dec;20(8):681-690. doi: 10.1080/17434440.2023.2233410. Epub 2023 Jul 7.
10
Comparison of predictions made by the intraocular lens master and ultrasound biometry.人工晶状体测量仪与超声生物测量法预测结果的比较。
Arch Ophthalmol. 2008 Jul;126(7):929-33. doi: 10.1001/archopht.126.7.929.

本文引用的文献

1
High myopia and its risks.高度近视及其风险。
Community Eye Health. 2019;32(105):5-6.
2
An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition.基于图像的序列识别的端到端可训练神经网络及其在场景文本识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2298-2304. doi: 10.1109/TPAMI.2016.2646371. Epub 2016 Dec 29.
3
The complex interactions of retinal, optical and environmental factors in myopia aetiology.近视病因中视网膜、光学和环境因素的复杂相互作用。
Prog Retin Eye Res. 2012 Nov;31(6):622-60. doi: 10.1016/j.preteyeres.2012.06.004. Epub 2012 Jul 4.
4
Biometry and intraocular lens power calculation.生物测量与人工晶状体屈光度计算
Curr Opin Ophthalmol. 2008 Jan;19(1):13-7. doi: 10.1097/ICU.0b013e3282f1c5ad.