Hoyos William, Hoyos Kenia, Ruiz Rander, Aguilar Jose
Grupo de Investigación ISI, Universidad Cooperativa de Colombia, Montería, Colombia.
Grupo de Investigación en I+D+i en TIC, Universidad EAFIT, Medellín, Colombia.
BMC Med Inform Decis Mak. 2024 Dec 18;24(1):383. doi: 10.1186/s12911-024-02810-x.
Diabetes mellitus (DM) is a chronic disease prevalent worldwide, requiring a multifaceted analytical approach to improve early detection and subsequent mitigation of morbidity and mortality rates. This research aimed to develop an explainable analysis of DM by combining sociodemographic and clinical data with statistical and artificial intelligence (AI) techniques.
Leveraging a small dataset that includes sociodemographic and clinical profiles of diabetic and non-diabetic individuals, we employed a diverse set of statistical and AI models for predictive purposes and assessment of DM risk factors. The statistical tests used were Student's t-test and Chi-square, while the AI techniques were fuzzy cognitive maps (FCM), artificial neural networks (ANN), support vector machines (SVM), and XGBoost.
Our statistical models facilitated an in-depth exploration of variable associations, while the resulting AI models demonstrated exceptional efficacy in DM classification. In particular, the XGBoost model showed superior performance in accuracy, sensitivity and specificity with values of 1 for each of these metrics. On the other hand, the FCM stood out for its explainability capabilities by allowing an analysis of the variables involved in the prediction using scenario-based simulations.
An integrated analysis of DM using a variety of methodologies is critical for timely detection of the disease and informed clinical decision-making.
糖尿病(DM)是一种在全球范围内普遍存在的慢性疾病,需要采用多方面的分析方法来改善早期检测,并随后降低发病率和死亡率。本研究旨在通过将社会人口统计学和临床数据与统计及人工智能(AI)技术相结合,对糖尿病进行可解释的分析。
利用一个包含糖尿病患者和非糖尿病患者社会人口统计学及临床特征的小型数据集,我们采用了多种统计和AI模型进行预测,并评估糖尿病风险因素。使用的统计检验为学生t检验和卡方检验,而AI技术包括模糊认知图(FCM)、人工神经网络(ANN)、支持向量机(SVM)和XGBoost。
我们的统计模型有助于深入探索变量之间的关联,而所得的AI模型在糖尿病分类中表现出卓越的功效。特别是,XGBoost模型在准确性、敏感性和特异性方面表现出色,这些指标的值均为1。另一方面,FCM因其可解释性而脱颖而出,它允许通过基于场景的模拟来分析预测中涉及的变量。
使用多种方法对糖尿病进行综合分析对于及时发现疾病和做出明智的临床决策至关重要。