Politecnico di Milano, Milan, Italy.
CNR-IEIIT, Turin, Italy.
Stud Health Technol Inform. 2024 Aug 22;316:736-740. doi: 10.3233/SHTI240519.
This study leverages data from a Canadian database of primary care Electronic Medical Records to develop machine learning models predicting type 2 diabetes mellitus (T2D), prediabetes, or normoglycemia. These models are used as a basis for extracting counterfactual explanations and derive personalized changes in biomarkers to prevent T2D onset, particularly in the still reversible prediabetic state. The models achieve satisfactory performance. Furthermore, feature importance analysis underscores the significance of fasting blood sugar and glycated hemoglobin, while counterfactuals explanations emphasize the centrality of keeping body mass index and cholesterol indicators within or close to the clinically desirable ranges. This research highlights the potential of machine learning and counterfactual explanations in guiding preventive interventions that may help slow down the progression from prediabetes to T2D on an individual basis, eventually fostering a recovery from prediabetes to a normoglycemic state.
本研究利用加拿大初级保健电子病历数据库中的数据,开发了用于预测 2 型糖尿病(T2D)、糖尿病前期或血糖正常的机器学习模型。这些模型被用作提取反事实解释和推断预防 T2D 发病的生物标志物个性化变化的基础,特别是在仍然可逆的糖尿病前期状态。该模型取得了令人满意的性能。此外,特征重要性分析强调了空腹血糖和糖化血红蛋白的重要性,而反事实解释则强调了将体重指数和胆固醇指标保持在或接近临床理想范围内的重要性。这项研究强调了机器学习和反事实解释在指导预防干预方面的潜力,这些干预可能有助于减缓从糖尿病前期到 T2D 的进展,最终促进从糖尿病前期恢复到血糖正常状态。