Redolfi Alberto, De Francesco Silvia, Palesi Fulvia, Galluzzi Samantha, Muscio Cristina, Castellazzi Gloria, Tiraboschi Pietro, Savini Giovanni, Nigri Anna, Bottini Gabriella, Bruzzone Maria Grazia, Ramusino Matteo Cotta, Ferraro Stefania, Gandini Wheeler-Kingshott Claudia A M, Tagliavini Fabrizio, Frisoni Giovanni B, Ryvlin Philippe, Demonet Jean-François, Kherif Ferath, Cappa Stefano F, D'Angelo Egidio
Laboratory of Neuroinformatics, IRCCS Istituto Centro San Giovanni di Dio Fatebenefratelli, Brescia, Italy.
Laboratory of Alzheimer's Neuroimaging and Epidemiology - LANE, IRCCS Istituto Centro San Giovanni di Dio Fatebenefratelli, Brescia, Italy.
Front Neurol. 2020 Sep 23;11:1021. doi: 10.3389/fneur.2020.01021. eCollection 2020.
With the shift of research focus to personalized medicine in Alzheimer's Dementia (AD), there is an urgent need for tools that are capable of quantifying a patient's risk using diagnostic biomarkers. The Medical Informatics Platform (MIP) is a distributed e-infrastructure federating large amounts of data coupled with machine-learning (ML) algorithms and statistical models to define the biological signature of the disease. The present study assessed (i) the accuracy of two ML algorithms, i.e., supervised Gradient Boosting (GB) and semi-unsupervised 3C strategy (Categorize, Cluster, Classify-CCC) implemented in the MIP and (ii) their contribution over the standard diagnostic workup. We examined individuals coming from the MIP installed across 3 Italian memory clinics, including subjects with Normal Cognition (CN, = 432), Mild Cognitive Impairment (MCI, = 456), and AD ( = 451). The GB classifier was applied to best discriminate the three diagnostic classes in 1,339 subjects, and the CCC strategy was used to refine the classical disease categories. Four dementia experts provided their diagnostic confidence (DC) of MCI conversion on an independent cohort of 38 patients. DC was based on clinical, neuropsychological, CSF, and structural MRI information and again with addition of the outcome from the MIP tools. The GB algorithm provided a classification accuracy of 85% in a nested 10-fold cross-validation for CN vs. MCI vs. AD discrimination. Accuracy increased to 95% in the holdout validation, with the omission of each Italian clinical cohort out in turn. CCC identified five homogeneous clusters of subjects and 36 biomarkers that represented the disease fingerprint. In the DC assessment, CCC defined six clusters in the MCI population used to train the algorithm and 29 biomarkers to improve patients staging. GB and CCC showed a significant impact, evaluated as +5.99% of increment on physicians' DC. The influence of MIP on DC was rated from "slight" to "significant" in 80% of the cases. GB provided fair results in classification of CN, MCI, and AD. CCC identified homogeneous and promising classes of subjects via its semi-unsupervised approach. We measured the effect of the MIP on the physician's DC. Our results pave the way for the establishment of a new paradigm for ML discrimination of patients who will or will not convert to AD, a clinical priority for neurology.
随着阿尔茨海默病(AD)研究重点向个性化医疗的转变,迫切需要能够使用诊断生物标志物来量化患者风险的工具。医学信息平台(MIP)是一个分布式电子基础设施,它将大量数据与机器学习(ML)算法及统计模型相结合,以定义该疾病的生物学特征。本研究评估了:(i)在MIP中实施的两种ML算法的准确性,即监督式梯度提升(GB)和半无监督3C策略(分类、聚类、分类-CCC);(ii)它们相对于标准诊断检查的贡献。我们研究了来自意大利3家记忆诊所安装的MIP的个体,包括认知正常(CN,n = 432)、轻度认知障碍(MCI,n = 456)和AD(n = 451)的受试者。GB分类器用于最佳区分1339名受试者中的三个诊断类别,CCC策略用于细化经典疾病类别。四位痴呆症专家对38名患者的独立队列提供了他们对MCI转化的诊断信心(DC)。DC基于临床、神经心理学、脑脊液和结构MRI信息,并再次加上MIP工具的结果。GB算法在用于区分CN与MCI与AD的嵌套10折交叉验证中提供了85%的分类准确率。在留出验证中,准确率提高到95%,依次排除每个意大利临床队列。CCC识别出五个同质的受试者集群和36个代表疾病指纹的生物标志物。在DC评估中,CCC在用于训练算法的MCI人群中定义了六个集群和29个生物标志物以改善患者分期。GB和CCC显示出显著影响,评估为医生DC增加了5.99%。在80%的病例中,MIP对DC的影响被评为从“轻微”到“显著”。GB在CN、MCI和AD的分类中提供了不错结果。CCC通过其半无监督方法识别出同质且有前景的受试者类别。我们测量了MIP对医生DC的影响。我们的结果为建立一种新的范式铺平了道路,该范式用于对将转化或不会转化为AD的患者进行ML鉴别,这是神经病学的一个临床优先事项。