Computational Molecular Biology Group, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330, Thailand.
National Nanotechnology Center, National Science and Technology Development Agency, Pathum Thani, 12120, Thailand.
Alzheimers Res Ther. 2022 Aug 9;14(1):111. doi: 10.1186/s13195-022-01043-2.
Mild cognitive impairment (MCI) is an early stage of cognitive decline which could develop into dementia. An early detection of MCI is a crucial step for timely prevention and intervention. Recent studies have developed deep learning models to detect MCI and dementia using a bedside task like the classic clock drawing test (CDT). However, it remains a challenge to predict the early stage of the disease using the CDT data alone. Moreover, the state-of-the-art deep learning techniques still face black box challenges, making it questionable to implement them in a clinical setting.
We recruited 918 subjects from King Chulalongkorn Memorial Hospital (651 healthy subjects and 267 MCI patients). We propose a novel deep learning framework that incorporates data from the CDT, cube-copying, and trail-making tests. Soft label and self-attention were applied to improve the model performance and provide a visual explanation. The interpretability of the visualization of our model and the Grad-CAM approach were rated by experienced medical personnel and quantitatively evaluated using intersection over union (IoU) between the models' heat maps and the regions of interest.
Rather than using a single CDT image in the baseline VGG16 model, using multiple drawing tasks as inputs into our proposed model with soft label significantly improves the classification performance between the healthy aging controls and the MCI patients. In particular, the classification accuracy increases from 0.75 (baseline model) to 0.81. The F1-score increases from 0.36 to 0.65, and the area under the receiver operating characteristic curve (AUC) increases from 0.74 to 0.84. Compared to the multi-input model that also offers interpretable visualization, i.e., Grad-CAM, our model receives higher interpretability scores given by experienced medical experts and higher IoUs.
Our model achieves better classification performance at detecting MCI compared to the baseline model. In addition, the model provides visual explanations that are superior to those of the baseline model as quantitatively evaluated by experienced medical personnel. Thus, our work offers an interpretable machine learning model with high classification performance, both of which are crucial aspects of artificial intelligence in medical diagnosis.
轻度认知障碍(MCI)是认知能力下降的早期阶段,可能发展为痴呆症。早期发现 MCI 是及时预防和干预的关键步骤。最近的研究已经开发了深度学习模型,通过床边任务(如经典的时钟绘制测试(CDT))来检测 MCI 和痴呆症。然而,仅使用 CDT 数据预测疾病的早期阶段仍然是一个挑战。此外,最先进的深度学习技术仍然面临黑盒挑战,因此在临床环境中实施这些技术是值得怀疑的。
我们从朱拉隆功国王纪念医院招募了 918 名受试者(651 名健康受试者和 267 名 MCI 患者)。我们提出了一种新的深度学习框架,该框架结合了 CDT、立方体复制和连线测试的数据。应用软标签和自注意力来提高模型性能并提供可视化解释。我们的模型可视化的可解释性和 Grad-CAM 方法由经验丰富的医务人员进行评分,并使用模型热图和感兴趣区域之间的交并比(IoU)进行定量评估。
与基线 VGG16 模型中仅使用单个 CDT 图像不同,使用多个绘图任务作为输入到我们的软标签模型中,可显著提高健康衰老对照和 MCI 患者之间的分类性能。特别是,分类精度从 0.75(基线模型)提高到 0.81。F1 评分从 0.36 提高到 0.65,接收者操作特征曲线(ROC)下的面积(AUC)从 0.74 提高到 0.84。与提供可解释可视化的多输入模型(即 Grad-CAM)相比,我们的模型获得了经验丰富的医学专家更高的可解释性评分和更高的 IoU。
与基线模型相比,我们的模型在检测 MCI 方面实现了更好的分类性能。此外,该模型提供了视觉解释,与基线模型相比,这些解释在经验丰富的医务人员的定量评估中表现更好。因此,我们的工作提供了一种具有高分类性能和可解释性的机器学习模型,这两者都是人工智能在医学诊断中的关键方面。