Grollemund Vincent, Pradat Pierre-François, Querin Giorgia, Delbot François, Le Chat Gaétan, Pradat-Peyre Jean-François, Bede Peter
Laboratoire d'Informatique de Paris 6, Sorbonne University, Paris, France.
FRS Consulting, Paris, France.
Front Neurosci. 2019 Feb 28;13:135. doi: 10.3389/fnins.2019.00135. eCollection 2019.
Amyotrophic Lateral Sclerosis (ALS) is a relentlessly progressive neurodegenerative condition with limited therapeutic options at present. Survival from symptom onset ranges from 3 to 5 years depending on genetic, demographic, and phenotypic factors. Despite tireless research efforts, the core etiology of the disease remains elusive and drug development efforts are confounded by the lack of accurate monitoring markers. Disease heterogeneity, late-stage recruitment into pharmaceutical trials, and inclusion of phenotypically admixed patient cohorts are some of the key barriers to successful clinical trials. Machine Learning (ML) models and large international data sets offer unprecedented opportunities to appraise candidate diagnostic, monitoring, and prognostic markers. Accurate patient stratification into well-defined prognostic categories is another aspiration of emerging classification and staging systems. The objective of this paper is the comprehensive, systematic, and critical review of ML initiatives in ALS to date and their potential in research, clinical, and pharmacological applications. The focus of this review is to provide a dual, clinical-mathematical perspective on recent advances and future directions of the field. Another objective of the paper is the frank discussion of the pitfalls and drawbacks of specific models, highlighting the shortcomings of existing studies and to provide methodological recommendations for future study designs. Despite considerable sample size limitations, ML techniques have already been successfully applied to ALS data sets and a number of promising diagnosis models have been proposed. Prognostic models have been tested using core clinical variables, biological, and neuroimaging data. These models also offer patient stratification opportunities for future clinical trials. Despite the enormous potential of ML in ALS research, statistical assumptions are often violated, the choice of specific statistical models is seldom justified, and the constraints of ML models are rarely enunciated. From a mathematical perspective, the main barrier to the development of validated diagnostic, prognostic, and monitoring indicators stem from limited sample sizes. The combination of multiple clinical, biofluid, and imaging biomarkers is likely to increase the accuracy of mathematical modeling and contribute to optimized clinical trial designs.
肌萎缩侧索硬化症(ALS)是一种无情进展的神经退行性疾病,目前治疗选择有限。从症状出现开始计算的生存期为3至5年,具体取决于遗传、人口统计学和表型因素。尽管进行了不懈的研究努力,但该疾病的核心病因仍然不明,且由于缺乏准确的监测标志物,药物研发工作受到困扰。疾病异质性、药物试验晚期招募以及表型混合的患者队列纳入是成功开展临床试验的一些关键障碍。机器学习(ML)模型和大型国际数据集为评估候选诊断、监测和预后标志物提供了前所未有的机会。将患者准确分层到明确的预后类别是新兴分类和分期系统的另一个目标。本文的目的是对迄今为止ALS领域的ML举措及其在研究、临床和药理学应用中的潜力进行全面、系统和批判性的综述。本综述的重点是从临床 - 数学双重角度阐述该领域的最新进展和未来方向。本文的另一个目标是坦率讨论特定模型的缺陷和不足,突出现有研究的缺点,并为未来的研究设计提供方法学建议。尽管存在相当大的样本量限制,但ML技术已成功应用于ALS数据集,并提出了一些有前景的诊断模型。已使用核心临床变量、生物学和神经影像学数据对预后模型进行了测试。这些模型还为未来的临床试验提供了患者分层机会。尽管ML在ALS研究中具有巨大潜力,但统计假设经常被违反,特定统计模型的选择很少有合理依据,且ML模型的局限性很少被阐明。从数学角度来看,开发经过验证的诊断、预后和监测指标的主要障碍源于样本量有限。多种临床、生物流体和成像生物标志物的组合可能会提高数学建模的准确性,并有助于优化临床试验设计。