Quantitative Sciences Unit, Stanford School of Medicine, Palo Alto, California, United States of America.
Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, Colorado, United States of America.
PLoS Comput Biol. 2024 Nov 11;20(11):e1012524. doi: 10.1371/journal.pcbi.1012524. eCollection 2024 Nov.
Modeling the network topology of the human brain within the mesoscale has become an increasing focus within the neuroscientific community due to its variation across diverse cognitive processes, in the presence of neuropsychiatric disease or injury, and over the lifespan. Much research has been done on the creation of algorithms to detect these mesoscopic structures, called communities or modules, but less has been done to conduct inference on these structures. The literature on analysis of these community detection algorithms has focused on comparing them within the same subject. These approaches, however, either do not accomodate a more general association between community structure and an outcome or cannot accommodate additional covariates that may confound the association of interest. We propose a semiparametric kernel machine regression model for either a continuous or binary outcome, where covariate effects are modeled parametrically and brain connectivity measures are measured nonparametrically. By incorporating notions of similarity between network community structures into a kernel distance function, the high-dimensional feature space of brain networks, defined on input pairs, can be generalized to non-linear spaces, allowing for a wider class of distance-based algorithms. We evaluate our proposed methodology on both simulated and real datasets.
由于人类大脑的网络拓扑结构在不同认知过程、神经精神疾病或损伤以及整个生命周期中存在变化,因此在介观尺度上对其进行建模已成为神经科学界日益关注的焦点。已经有很多关于创建算法以检测这些介观结构(称为社区或模块)的研究,但对这些结构进行推断的研究却较少。关于分析这些社区检测算法的文献主要集中在比较同一主体内的算法。然而,这些方法要么不适应社区结构与结果之间更一般的关联,要么不能适应可能混淆感兴趣关联的其他协变量。我们提出了一种用于连续或二进制结果的半参数核机器回归模型,其中协变量效应以参数方式建模,大脑连接性测量以非参数方式测量。通过将网络社区结构之间的相似性概念纳入核距离函数中,可以将输入对定义的大脑网络的高维特征空间推广到非线性空间,从而允许更广泛的基于距离的算法。我们在模拟数据集和真实数据集上评估了我们提出的方法。