Department of Computer Graphics and Image Processing, University of Debrecen, Faculty of Informatics, Debrecen, Hungary.
BMC Ophthalmol. 2013 Aug 7;13(1):40. doi: 10.1186/1471-2415-13-40.
The aim of the project was to develop a novel method for diabetic retinopathy screening based on the examination of tear fluid biomarker changes. In order to evaluate the usability of protein biomarkers for pre-screening purposes several different approaches were used, including machine learning algorithms.
All persons involved in the study had diabetes. Diabetic retinopathy (DR) was diagnosed by capturing 7-field fundus images, evaluated by two independent ophthalmologists. 165 eyes were examined (from 119 patients), 55 were diagnosed healthy and 110 images showed signs of DR. Tear samples were taken from all eyes and state-of-the-art nano-HPLC coupled ESI-MS/MS mass spectrometry protein identification was performed on all samples. Applicability of protein biomarkers was evaluated by six different optimally parameterized machine learning algorithms: Support Vector Machine, Recursive Partitioning, Random Forest, Naive Bayes, Logistic Regression, K-Nearest Neighbor.
Out of the six investigated machine learning algorithms the result of Recursive Partitioning proved to be the most accurate. The performance of the system realizing the above algorithm reached 74% sensitivity and 48% specificity.
Protein biomarkers selected and classified with machine learning algorithms alone are at present not recommended for screening purposes because of low specificity and sensitivity values. This tool can be potentially used to improve the results of image processing methods as a complementary tool in automatic or semiautomatic systems.
本项目旨在开发一种基于泪液生物标志物变化检测的新型糖尿病视网膜病变筛查方法。为了评估蛋白质生物标志物在预筛查中的可用性,我们使用了包括机器学习算法在内的几种不同方法。
所有参与研究的人都患有糖尿病。糖尿病视网膜病变(DR)通过拍摄 7 个视野的眼底图像进行诊断,由两名独立的眼科医生进行评估。共检查了 165 只眼睛(来自 119 名患者),其中 55 只眼睛被诊断为健康,110 只眼睛显示出 DR 的迹象。从所有眼睛中采集泪样,并对所有样本进行最先进的纳米 HPLC 结合 ESI-MS/MS 质谱蛋白质鉴定。我们使用六种经过最佳参数化的机器学习算法(支持向量机、递归分区、随机森林、朴素贝叶斯、逻辑回归、K 最近邻)评估蛋白质生物标志物的适用性。
在六种被研究的机器学习算法中,递归分区的结果被证明是最准确的。实现上述算法的系统性能达到了 74%的灵敏度和 48%的特异性。
由于特异性和灵敏度值较低,目前不建议单独使用基于机器学习算法选择和分类的蛋白质生物标志物进行筛查。该工具可作为自动或半自动系统中的补充工具,用于改善图像处理方法的结果。