Ambroise Corentin, Grigis Antoine, Houenou Josselin, Frouin Vincent
University Paris-Saclay, CEA, CNRS, Neurospin, Baobab UMR 9027, Gif-sur-Yvette, 91191, France.
Pôle de Psychiatrie, AP-HP, Faculté de Médecine de Créteil, DHU PePsy, Hôpitaux Universitaires Mondor, Créteil, 94000, France.
Sci Rep. 2025 Jan 17;15(1):2312. doi: 10.1038/s41598-024-85032-5.
Recent advances highlight the limitations of classification strategies in machine learning that rely on a single data source for understanding, diagnosing and predicting psychiatric syndromes. Moreover, approaches based solely on clinician labels often fail to capture the complexity and variability of these conditions. Recent research underlines the importance of considering multiple dimensions that span across different psychiatric syndromes. These developments have led to more comprehensive approaches to studying psychiatric conditions that incorporate diverse data sources such as imaging, genetics, and symptom reports. Multi-view unsupervised learning frameworks, particularly deep learning models, present promising solutions for integrating and analysing complex datasets. Such models contain generative capabilities which facilitate the exploration of relationships between different data views. In this study, we propose a robust framework for interpreting these models that combines digital avatars with stability selection to assess these relationships. We apply this framework to the Healthy Brain Network cohort which includes clinical behavioural scores and brain imaging features, uncovering a consistent set of brain-behaviour interactions. These associations link cortical measurements obtained from structural MRI with clinical reports evaluating psychiatric symptoms. Our framework effectively identifies relevant and stable associations, even with incomplete datasets, while isolating variability of interest from confounding factors.
近期的进展凸显了机器学习中分类策略的局限性,这些策略依赖单一数据源来理解、诊断和预测精神综合征。此外,仅基于临床医生标签的方法往往无法捕捉这些病症的复杂性和变异性。近期研究强调了考虑跨越不同精神综合征的多个维度的重要性。这些进展催生了更全面的研究精神病症的方法,这些方法整合了多种数据源,如图像、遗传学和症状报告。多视图无监督学习框架,特别是深度学习模型,为整合和分析复杂数据集提供了有前景的解决方案。此类模型具有生成能力,有助于探索不同数据视图之间的关系。在本研究中,我们提出了一个用于解释这些模型的强大框架,该框架将数字化身与稳定性选择相结合以评估这些关系。我们将此框架应用于健康脑网络队列,该队列包括临床行为评分和脑成像特征,揭示了一组一致的脑-行为相互作用。这些关联将从结构磁共振成像获得的皮层测量结果与评估精神症状的临床报告联系起来。我们的框架即使在数据集不完整的情况下也能有效识别相关且稳定的关联,同时将感兴趣的变异性与混杂因素隔离开来。