Iddi Samuel, Li Dan, Aisen Paul S, Rafii Michael S, Thompson Wesley K, Donohue Michael C
Alzheimer's Therapeutic Research Institute, Keck School of Medicine, University of Southern California, San Diego, USA.
Department of Statistics and Actuarial Science, University of Ghana, Legon-Accra, Ghana.
Brain Inform. 2019 Jun 28;6(1):6. doi: 10.1186/s40708-019-0099-0.
Alzheimer's disease is the most common neurodegenerative disease and is characterized by the accumulation of amyloid-beta peptides leading to the formation of plaques and tau protein tangles in brain. These neuropathological features precede cognitive impairment and Alzheimer's dementia by many years. To better understand and predict the course of disease from early-stage asymptomatic to late-stage dementia, it is critical to study the patterns of progression of multiple markers. In particular, we aim to predict the likely future course of progression for individuals given only a single observation of their markers. Improved individual-level prediction may lead to improved clinical care and clinical trials. We propose a two-stage approach to modeling and predicting measures of cognition, function, brain imaging, fluid biomarkers, and diagnosis of individuals using multiple domains simultaneously. In the first stage, joint (or multivariate) mixed-effects models are used to simultaneously model multiple markers over time. In the second stage, random forests are used to predict categorical diagnoses (cognitively normal, mild cognitive impairment, or dementia) from predictions of continuous markers based on the first-stage model. The combination of the two models allows one to leverage their key strengths in order to obtain improved accuracy. We characterize the predictive accuracy of this two-stage approach using data from the Alzheimer's Disease Neuroimaging Initiative. The two-stage approach using a single joint mixed-effects model for all continuous outcomes yields better diagnostic classification accuracy compared to using separate univariate mixed-effects models for each of the continuous outcomes. Overall prediction accuracy above 80% was achieved over a period of 2.5 years. The results further indicate that overall accuracy is improved when markers from multiple assessment domains, such as cognition, function, and brain imaging, are used in the prediction algorithm as compared to the use of markers from a single domain only.
阿尔茨海默病是最常见的神经退行性疾病,其特征是β淀粉样肽的积累导致大脑中形成斑块和tau蛋白缠结。这些神经病理学特征在认知障碍和阿尔茨海默病痴呆症出现前许多年就已存在。为了更好地理解和预测疾病从早期无症状阶段到晚期痴呆症的病程,研究多个标志物的进展模式至关重要。特别是,我们旨在仅根据个体标志物的单次观察来预测其未来可能的进展过程。改善个体水平的预测可能会带来更好的临床护理和临床试验。我们提出一种两阶段方法,用于同时使用多个领域对个体的认知、功能、脑成像、血液生物标志物和诊断指标进行建模和预测。在第一阶段,使用联合(或多变量)混合效应模型对多个标志物随时间进行同时建模。在第二阶段,基于第一阶段模型的连续标志物预测结果,使用随机森林来预测分类诊断(认知正常、轻度认知障碍或痴呆)。这两个模型的结合使人们能够利用它们的关键优势,从而提高准确性。我们使用来自阿尔茨海默病神经影像倡议组织的数据来描述这种两阶段方法的预测准确性。与针对每个连续结果使用单独的单变量混合效应模型相比,对所有连续结果使用单个联合混合效应模型的两阶段方法产生了更好的诊断分类准确性。在2.5年的时间里,总体预测准确率达到了80%以上。结果还进一步表明,与仅使用单个领域的标志物相比,在预测算法中使用来自多个评估领域(如认知、功能和脑成像)的标志物时,总体准确性得到了提高。