Department of Psychology and Behavioral Sciences, Zhejiang University, Hangzhou, China; Department of Psychiatry, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China; Institute of Neuroscience and Medicine, Brain & Behaviour (INM-7), Research Centre Jülich, Jülich, Germany.
Institute of Neuroscience and Medicine, Brain & Behaviour (INM-7), Research Centre Jülich, Jülich, Germany; Institute of Systems Neuroscience, Medical Faculty, Heinrich-Heine-universität Düsseldorf, Düsseldorf, Germany.
Biol Psychiatry. 2023 Jan 1;93(1):18-28. doi: 10.1016/j.biopsych.2022.07.025. Epub 2022 Aug 6.
Much attention is currently devoted to developing diagnostic classifiers for mental disorders. Complementing these efforts, we highlight the potential of machine learning to gain biological insights into the psychopathology and nosology of mental disorders. Studies to this end have mainly used brain imaging data, which can be obtained noninvasively from large cohorts and have repeatedly been argued to reveal potentially intermediate phenotypes. This may become particularly relevant in light of recent efforts to identify magnetic resonance imaging-derived biomarkers that yield insight into pathophysiological processes as well as to refine the taxonomy of mental illness. In particular, the accuracy of machine learning models may be used as dependent variables to identify features relevant to pathophysiology. Moreover, such approaches may help disentangle the dimensional (within diagnosis) and often overlapping (across diagnoses) symptomatology of psychiatric illness. We also point out a multiview perspective that combines data from different sources, bridging molecular and system-level information. Finally, we summarize recent efforts toward a data-driven definition of subtypes or disease entities through unsupervised and semisupervised approaches. The latter, blending unsupervised and supervised concepts, may represent a particularly promising avenue toward dissecting heterogeneous categories. Finally, we raise several technical and conceptual aspects related to the reviewed approaches. In particular, we discuss common pitfalls pertaining to flawed input data or analytic procedures that would likely lead to unreliable outputs.
目前,人们非常关注开发精神障碍的诊断分类器。作为这些努力的补充,我们强调了机器学习在获得精神障碍的病理和分类学生物学见解方面的潜力。为此目的进行的研究主要使用了脑成像数据,这些数据可以从大的队列中无创地获得,并多次被认为可以揭示潜在的中间表型。鉴于最近在识别磁共振成像衍生的生物标志物方面所做的努力,这一点可能变得尤为重要,这些生物标志物可以深入了解生理病理过程,并对精神疾病的分类进行细化。特别是,机器学习模型的准确性可以用作因变量,以识别与病理生理学相关的特征。此外,这些方法可以帮助厘清精神疾病的维度(在诊断内)和通常重叠(跨诊断)的症状。我们还指出了一种多视图视角,该视角结合了来自不同来源的数据,弥合了分子和系统水平的信息。最后,我们总结了通过无监督和半监督方法实现基于数据的亚类或疾病实体定义的最新进展。后者结合了无监督和监督的概念,可能是剖析异质类别的一个特别有前途的途径。最后,我们提出了与所审查方法相关的几个技术和概念方面的问题。特别是,我们讨论了与有缺陷的输入数据或分析程序相关的常见陷阱,这些陷阱可能会导致不可靠的输出。