Wachinger Christian, Reuter Martin
Department of Child and Adolescent Psychiatry, Psychosomatic and Psychotherapy, Ludwig-Maximilian-University, Munich, Germany; Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
Neuroimage. 2016 Oct 1;139:470-479. doi: 10.1016/j.neuroimage.2016.05.053. Epub 2016 Jun 2.
With the increasing prevalence of Alzheimer's disease, research focuses on the early computer-aided diagnosis of dementia with the goal to understand the disease process, determine risk and preserving factors, and explore preventive therapies. By now, large amounts of data from multi-site studies have been made available for developing, training, and evaluating automated classifiers. Yet, their translation to the clinic remains challenging, in part due to their limited generalizability across different datasets. In this work, we describe a compact classification approach that mitigates overfitting by regularizing the multinomial regression with the mixed ℓ1/ℓ2 norm. We combine volume, thickness, and anatomical shape features from MRI scans to characterize neuroanatomy for the three-class classification of Alzheimer's disease, mild cognitive impairment and healthy controls. We demonstrate high classification accuracy via independent evaluation within the scope of the CADDementia challenge. We, furthermore, demonstrate that variations between source and target datasets can substantially influence classification accuracy. The main contribution of this work addresses this problem by proposing an approach for supervised domain adaptation based on instance weighting. Integration of this method into our classifier allows us to assess different strategies for domain adaptation. Our results demonstrate (i) that training on only the target training set yields better results than the naïve combination (union) of source and target training sets, and (ii) that domain adaptation with instance weighting yields the best classification results, especially if only a small training component of the target dataset is available. These insights imply that successful deployment of systems for computer-aided diagnostics to the clinic depends not only on accurate classifiers that avoid overfitting, but also on a dedicated domain adaptation strategy.
随着阿尔茨海默病患病率的不断上升,研究聚焦于痴呆症的早期计算机辅助诊断,目标是了解疾病进程、确定风险和保护因素,并探索预防性治疗方法。目前,来自多中心研究的大量数据已可用于开发、训练和评估自动分类器。然而,将它们应用于临床仍具有挑战性,部分原因是它们在不同数据集之间的泛化能力有限。在这项工作中,我们描述了一种紧凑的分类方法,该方法通过使用混合ℓ1/ℓ2范数对多项回归进行正则化来减轻过拟合。我们结合来自MRI扫描的体积、厚度和解剖形状特征来表征神经解剖结构,以对阿尔茨海默病、轻度认知障碍和健康对照进行三类分类。我们通过在CADDementia挑战赛范围内的独立评估证明了高分类准确率。此外,我们证明了源数据集和目标数据集之间的差异会显著影响分类准确率。这项工作的主要贡献在于提出了一种基于实例加权的监督域适应方法来解决这个问题。将此方法集成到我们的分类器中使我们能够评估不同的域适应策略。我们的结果表明:(i)仅在目标训练集上进行训练比简单地将源训练集和目标训练集合并能产生更好的结果;(ii)使用实例加权的域适应能产生最佳分类结果,特别是在目标数据集只有少量训练数据可用的情况下。这些见解意味着,要成功地将计算机辅助诊断系统应用于临床,不仅取决于能避免过拟合的准确分类器,还取决于专门的域适应策略。