Knight Foundation School of Computing and Information Sciences, Florida International University, 11200 SW 8th Street CASE 354, Miami, Florida, 33199, USA.
SUPMICROTECH, CNRS, institut FEMTO-ST, 24 rue Alain Savary, Besançon, F-25000, France.
Neuroinformatics. 2023 Oct;21(4):651-668. doi: 10.1007/s12021-023-09639-1. Epub 2023 Aug 15.
Resting-state functional magnetic resonance imaging (rs-fMRI) is a non-invasive imaging technique widely used in neuroscience to understand the functional connectivity of the human brain. While rs-fMRI multi-site data can help to understand the inner working of the brain, the data acquisition and processing of this data has many challenges. One of the challenges is the variability of the data associated with different acquisitions sites, and different MRI machines vendors. Other factors such as population heterogeneity among different sites, with variables such as age and gender of the subjects, must also be considered. Given that most of the machine-learning models are developed using these rs-fMRI multi-site data sets, the intrinsic confounding effects can adversely affect the generalizability and reliability of these computational methods, as well as the imposition of upper limits on the classification scores. This work aims to identify the phenotypic and imaging variables producing the confounding effects, as well as to control these effects. Our goal is to maximize the classification scores obtained from the machine learning analysis of the Autism Brain Imaging Data Exchange (ABIDE) rs-fMRI multi-site data. To achieve this goal, we propose novel methods of stratification to produce homogeneous sub-samples of the 17 ABIDE sites, as well as the generation of new features from the static functional connectivity values, using multiple linear regression models, ComBat harmonization models, and normalization methods. The main results obtained with our statistical models and methods are an accuracy of 76.4%, sensitivity of 82.9%, and specificity of 77.0%, which are 8.8%, 20.5%, and 7.5% above the baseline classification scores obtained from the machine learning analysis of the static functional connectivity computed from the ABIDE rs-fMRI multi-site data.
静息态功能磁共振成像(rs-fMRI)是一种广泛应用于神经科学的非侵入性成像技术,用于了解人脑的功能连接。虽然 rs-fMRI 多站点数据有助于了解大脑的内部工作原理,但该数据的采集和处理存在许多挑战。其中一个挑战是与不同采集站点和不同 MRI 机器供应商相关的数据可变性。还必须考虑其他因素,例如不同站点之间的人群异质性,以及受试者的年龄和性别等变量。鉴于大多数机器学习模型都是使用这些 rs-fMRI 多站点数据集开发的,因此内在混杂效应可能会对这些计算方法的通用性和可靠性产生不利影响,并对分类评分施加上限。这项工作旨在确定产生混杂效应的表型和成像变量,并控制这些效应。我们的目标是最大化从自闭症脑成像数据交换(ABIDE)rs-fMRI 多站点数据的机器学习分析中获得的分类评分。为了实现这一目标,我们提出了新的分层方法,以产生 17 个 ABIDE 站点的同质子样本,并使用多元线性回归模型、ComBat 协调模型和归一化方法从静态功能连接值生成新特征。我们的统计模型和方法的主要结果是准确性为 76.4%,敏感性为 82.9%,特异性为 77.0%,比从 ABIDE rs-fMRI 多站点数据的静态功能连接计算的机器学习分析中获得的基线分类评分分别高出 8.8%、20.5%和 7.5%。