School of Basic Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, China.
Shanghai Collaborative Innovation Center of Health Service in Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, China.
J Biomed Inform. 2021 Mar;115:103693. doi: 10.1016/j.jbi.2021.103693. Epub 2021 Feb 1.
Diabetics has become a serious public health burden in China. Multiple complications appear with the progression of diabetics pose a serious threat to the quality of human life and health. We can prevent the progression of prediabetics to diabetics and delay the progression to diabetics by early identification of diabetics and prediabetics and timely intervention, which have positive significance for improving public health.
Using machine learning techniques, we establish the noninvasive diabetics risk prediction model based on tongue features fusion and predict the risk of prediabetics and diabetics.
Applying the type TFDA-1 Tongue Diagnosis Instrument, we collect tongue images, extract tongue features including color and texture features using TDAS, and extract the advanced tongue features with ResNet-50, achieve the fusion of the two features with GA_XGBT, finally establish the noninvasive diabetics risk prediction model and evaluate the performance of testing effectiveness.
Cross-validation suggests the best performance of GA_XGBT model with fusion features, whose average CA is 0.821, the average AUROC is 0.924, the average AUPRC is 0.856, the average Precision is 0.834, the average Recall is 0.822, the average F1-score is 0.813. Test set suggests the best testing performance of GA_XGBT model, whose average CA is 0.81, the average AUROC is 0.918, the average AUPRC is 0.839, the average Precision is 0.821, the average Recall is 0.81, the average F1-score is 0.796. When we test prediabetics with GA_XGBT model, we find that the AUROC is 0.914, the Precision is 0.69, the Recall is 0.952, the F1-score is 0.8. When we test diabetics with GA_XGBT model, we find that the AUROC is 0.984, the Precision is 0.929, the Recall is 0.951, the F1-score is 0.94.
Based on tongue features, the study uses classical machine learning algorithm and deep learning algorithm to maximum the respective advantages. We combine the prior knowledge and potential features together, establish the noninvasive diabetics risk prediction model with features fusion algorithm, and detect prediabetics and diabetics noninvasively. Our study presents a feasible method for establishing the association between diabetics and the tongue image information and prove that tongue image information is a potential marker which facilitates effective early diagnosis of prediabetics and diabetics.
糖尿病已成为中国严重的公共卫生负担。糖尿病的多种并发症的出现对人类生活和健康质量构成了严重威胁。通过早期识别糖尿病患者和糖尿病前期患者并及时干预,可以预防糖尿病前期向糖尿病的进展,延缓向糖尿病的进展,这对改善公众健康具有积极意义。
运用机器学习技术,建立基于舌象特征融合的无创糖尿病风险预测模型,预测糖尿病前期和糖尿病的发病风险。
采用 TFDA-1 型中医舌诊仪采集舌象,利用 TDAS 提取舌象的颜色和纹理特征,采用 ResNet-50 提取舌象的高级特征,运用 GA_XGBT 实现两种特征的融合,最终建立无创糖尿病风险预测模型并评价其检测效能。
交叉验证表明,融合特征的 GA_XGBT 模型具有最佳性能,其平均 CA 为 0.821,平均 AUROC 为 0.924,平均 AUPRC 为 0.856,平均 Precision 为 0.834,平均 Recall 为 0.822,平均 F1-score 为 0.813。测试集表明,GA_XGBT 模型的测试性能最佳,其平均 CA 为 0.81,平均 AUROC 为 0.918,平均 AUPRC 为 0.839,平均 Precision 为 0.821,平均 Recall 为 0.81,平均 F1-score 为 0.796。当我们用 GA_XGBT 模型检测糖尿病前期患者时,发现 AUROC 为 0.914,精度为 0.69,召回率为 0.952,F1-score 为 0.8。当我们用 GA_XGBT 模型检测糖尿病患者时,发现 AUROC 为 0.984,精度为 0.929,召回率为 0.951,F1-score 为 0.94。
本研究基于舌象特征,运用经典机器学习算法和深度学习算法,充分发挥各自优势。融合先验知识和潜在特征,建立特征融合算法的无创糖尿病风险预测模型,实现糖尿病前期和糖尿病的无创检测。本研究为建立糖尿病与舌象信息之间的关联提供了一种可行的方法,并证明了舌象信息是一种潜在的标志物,有助于对糖尿病前期和糖尿病进行有效的早期诊断。