Wang Kesheng, Xu Kunhui, Chen Xiaoyu, He Chunlei, Zhang Jianfeng, Li Fenfen, Xiao Chun, Zhang Yu, Wang Ying, Yang Weihua, Kong Dexing, Huang Shoujun, Dai Qi
College of Mathematical Medicine, Zhejiang Normal University, Jinhua, China.
National Clinical Research Center for Ocular Diseases, Eye Hospital, Wenzhou Medical University, Wenzhou, China.
Quant Imaging Med Surg. 2025 May 1;15(5):4071-4084. doi: 10.21037/qims-24-1948. Epub 2025 Apr 11.
The tear meniscus height (TMH) is an important index for the diagnosis of dry eye. However, special inspection doctors are required to make rapid TMH measurements during outpatient examinations, which often leads to substantial measurement errors. At the same time, the existing artificial intelligence (AI) model of TMH segmentation has poor generalization because it only uses one mode of TMH pictures and does not include inspection of external verification sets. The purpose of this study was to propose an automatic measurement method for TMH based on convolutional neural networks (CNNs) to handle diverse datasets.
This multicenter retrospective study included 3,894 TMH images from five centers across four regions in eastern, southern, and western China. The images were annotated using a gradient information-guided human-computer collaborative method, and an attention-limiting neural network (ALNN) was developed. An internal dataset, consisting of 834 color images and 1,105 infrared images from three centers, was constructed for model development and validation. An external validation set, comprising 996 color images and 959 infrared images from two additional centers, was used to test the model's generalizability. The accuracy of AI segmentation results was compared with the inspection reports of special inspection doctors.
In the test set for the color image modality, the segmentation results showed an average mean intersection over union (MIoU) of 0.9578, a recall rate of 0.9648, a precision of 0.9526, and an F1 score of 0.9576. The TMH results obtained on the test set (r=0.935, P<0.001) and on the external validation set (r=0.957, P<0.001) both showed a high correlation with the ground truth (GT). For the infrared image modality, the test set segmentation results showed an average MIoU of 0.9290, a recall rate of 0.9150, a precision of 0.9388, and an F1 score of 0.9249. The TMH results obtained on the test set (r=0.855, P<0.001) and on the external validation set (r=0.803, P<0.001) both showed a high correlation with the GT.
This algorithm exhibits strong generalization capabilities, accurately segments key areas, and automatically provides quantitative analysis of the TMH. The measurements obtained using this AI algorithm exhibit high consistency with the GT, surpassing the reliability of special inspection doctors. This provides significant support in the diagnosis of dry eye disease (DED).
泪液半月板高度(TMH)是诊断干眼症的重要指标。然而,门诊检查时需要专业检查医生快速测量TMH,这常常导致较大的测量误差。同时,现有的TMH分割人工智能(AI)模型泛化能力较差,因为它仅使用一种TMH图片模式,且未纳入外部验证集的检验。本研究的目的是提出一种基于卷积神经网络(CNN)的TMH自动测量方法,以处理多样的数据集。
这项多中心回顾性研究纳入了来自中国东部、南部和西部四个地区五个中心的3894张TMH图像。采用梯度信息引导的人机协作方法对图像进行标注,并开发了一种注意力限制神经网络(ALNN)。构建了一个内部数据集,由来自三个中心的834张彩色图像和1105张红外图像组成,用于模型开发和验证。一个外部验证集,包含来自另外两个中心的996张彩色图像和959张红外图像,用于测试模型的泛化能力。将AI分割结果的准确性与专业检查医生的检查报告进行比较。
在彩色图像模态的测试集中,分割结果显示平均交并比(MIoU)为0.9578,召回率为0.9648,精确率为0.9526,F1分数为0.9576。在测试集(r = 0.935,P < 0.001)和外部验证集(r = 0.957,P < 0.001)上获得的TMH结果与真实值(GT)均显示出高度相关性。对于红外图像模态,测试集分割结果显示平均MIoU为0.9290,召回率为0.9150,精确率为0.9388,F1分数为0.9249。在测试集(r = 0.855,P < 0.001)和外部验证集(r = 0.803,P < 0.001)上获得的TMH结果与GT均显示出高度相关性。
该算法具有很强的泛化能力,能准确分割关键区域,并自动提供TMH的定量分析。使用该AI算法获得的测量结果与GT具有高度一致性,超过了专业检查医生的可靠性。这为干眼症(DED)的诊断提供了重要支持。