Department of Radiology and Nuclear Medicine, Amsterdam University Medical Centers, Location VUmc, Amsterdam, The Netherlands; Queen Square MS Centre, University College London, London, United Kingdom.
Queen Square MS Centre, University College London, London, United Kingdom; National Institute of Health Research (NIHR), University College London Hospitals, Biomedical Research Centre, London, United Kingdom.
Neuroimage Clin. 2019;24:102011. doi: 10.1016/j.nicl.2019.102011. Epub 2019 Oct 22.
Machine learning classification is an attractive approach to automatically differentiate patients from healthy subjects, and to predict future disease outcomes. A clinically isolated syndrome (CIS) is often the first presentation of multiple sclerosis (MS), but it is difficult at onset to predict who will have a second relapse and hence convert to clinically definite MS. In this study, we thus aimed to distinguish CIS converters from non-converters at onset of a CIS, using recursive feature elimination and weight averaging with support vector machines. We also sought to assess the influence of cohort size and cross-validation methods on the accuracy estimate of the classification. We retrospectively collected 400 patients with CIS from six European MAGNIMS MS centres. Patients underwent brain MRI at onset of a CIS according to local standard-of-care protocols. The diagnosis of clinically definite MS at one-year follow-up was the standard against which the accuracy of the model was tested. For each patient, we derived MRI-based features, such as grey matter probability, white matter lesion load, cortical thickness, and volume of specific cortical and white matter regions. Features with little contribution to the classification model were removed iteratively through an interleaved sample bootstrapping and feature averaging approach. Classification of CIS outcome at one-year follow-up was performed with 2-fold, 5-fold, 10-fold and leave-one-out cross-validation for each centre cohort independently and in all patients together. The estimated classification accuracy across centres ranged from 64.9% to 88.1% using 2-fold cross-validation and from 73% to 92.9% using leave-one-out cross-validation. The classification accuracy estimate was higher in single-centre, smaller data sets than in combinations of data sets, being the lowest when all patients were merged together. Regional MRI features such as WM lesions, grey matter probability in the thalamus and the precuneus or cortical thickness in the cuneus and inferior temporal gyrus predicted the occurrence of a second relapse in patients at onset of a CIS using support vector machines. The increased accuracy estimate of the classification achieved with smaller and single-centre samples may indicate a model bias (overfitting) when data points were limited, but also more homogeneous. We provide an overview of classifier performance from a range of cross-validation schemes to give insight into the variability across schemes. The proposed recursive feature elimination approach with weight averaging can be used both in single- and multi-centre data sets in order to bridge the gap between group-level comparisons and making predictions for individual patients.
机器学习分类是一种有吸引力的方法,可以自动区分患者和健康受试者,并预测未来的疾病结果。临床孤立综合征(CIS)通常是多发性硬化症(MS)的首次表现,但在发病时很难预测谁会有第二次复发,从而转化为临床明确的 MS。在这项研究中,我们旨在使用递归特征消除和支持向量机的加权平均来区分 CIS 发病时的转化者和非转化者。我们还试图评估队列大小和交叉验证方法对分类准确性估计的影响。我们回顾性地收集了来自六个欧洲 MAGNIMS MS 中心的 400 名 CIS 患者。根据当地的标准护理方案,患者在 CIS 发病时进行脑 MRI 检查。在一年的随访中,临床明确的 MS 诊断是测试模型准确性的标准。对于每个患者,我们推导出基于 MRI 的特征,例如灰质概率、白质病变负荷、皮质厚度以及特定皮质和白质区域的体积。通过迭代的样本引导和特征平均方法,逐步去除对分类模型贡献不大的特征。使用 2 倍、5 倍、10 倍和留一法交叉验证,分别对每个中心队列进行独立和所有患者的分类,以预测一年随访时的 CIS 结果。使用 2 倍交叉验证时,中心间的估计分类准确性范围为 64.9%至 88.1%,使用留一法交叉验证时为 73%至 92.9%。在单中心、较小的数据集,分类准确性估计值高于数据集组合,当所有患者合并在一起时,分类准确性估计值最低。使用支持向量机,WM 病变、丘脑灰质概率、楔前叶或楔叶和颞下回皮质厚度等区域 MRI 特征可预测 CIS 发病患者第二次复发的发生。使用较小的和单中心样本获得的分类准确性估计值的增加可能表明数据点有限时模型存在偏差(过拟合),但也更同质。我们提供了一系列交叉验证方案的分类器性能概述,以深入了解方案之间的可变性。提出的递归特征消除方法与加权平均可以在单中心和多中心数据集上使用,以弥合组水平比较和为个体患者做出预测之间的差距。