合并或集成：多个神经影像学研究的综合分析。

Merging or ensembling: integrative analysis in multiple neuroimaging studies.

机构信息

Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States.

Department of Statistics, Florida State University, Tallahassee, FL 32306, United States.

出版信息

Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujae003.

DOI:10.1093/biomtc/ujae003

PMID:38465984

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10926268/

Abstract

The aim of this paper is to systematically investigate merging and ensembling methods for spatially varying coefficient mixed effects models (SVCMEM) in order to carry out integrative learning of neuroimaging data obtained from multiple biomedical studies. The "merged" approach involves training a single learning model using a comprehensive dataset that encompasses information from all the studies. Conversely, the "ensemble" approach involves creating a weighted average of distinct learning models, each developed from an individual study. We systematically investigate the prediction accuracy of the merged and ensemble learners under the presence of different degrees of interstudy heterogeneity. Additionally, we establish asymptotic guidelines for making strategic decisions about when to employ either of these models in different scenarios, along with deriving optimal weights for the ensemble learner. To validate our theoretical results, we perform extensive simulation studies. The proposed methodology is also applied to 3 large-scale neuroimaging studies.

摘要

本文旨在系统地研究空间变系数混合效应模型（SVCMEM）的合并和集成方法，以便对来自多个生物医学研究的神经影像学数据进行综合学习。“合并”方法涉及使用包含所有研究信息的综合数据集来训练单个学习模型。相反，“集成”方法涉及创建来自各个研究的不同学习模型的加权平均值。我们系统地研究了在不同程度的研究间异质性存在下，合并和集成学习者的预测准确性。此外，我们还为在不同情况下何时使用这些模型中的任意一个提供了策略性决策的渐近指南，并为集成学习者推导出了最优权重。为了验证我们的理论结果，我们进行了广泛的模拟研究。所提出的方法也应用于 3 个大规模的神经影像学研究。

相似文献

Merging or ensembling: integrative analysis in multiple neuroimaging studies.合并或集成：多个神经影像学研究的综合分析。

Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujae003.

Super Learner for Survival Data Prediction.用于生存数据预测的超级学习器。

Int J Biostat. 2020 Feb 22. doi: 10.1515/ijb-2019-0065.

Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型，对于使用可穿戴设备进行压力预测具有良好的泛化能力。

J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.

Hyperspectral inversion of soil organic matter based on improved ensemble learning method.

Spectrochim Acta A Mol Biomol Spectrosc. 2025 Oct 15;339:126302. doi: 10.1016/j.saa.2025.126302. Epub 2025 Apr 27.

Optimal ensemble construction for multistudy prediction with applications to mortality estimation.多研究预测的最优集成构建及其在死亡率估计中的应用。

Stat Med. 2024 Apr 30;43(9):1774-1789. doi: 10.1002/sim.10006. Epub 2024 Feb 23.

Tree-Weighting for Multi-Study Ensemble Learners.多研究集成学习者的树加权。

Pac Symp Biocomput. 2020;25:451-462.

Improving Individual Brain Age Prediction Using an Ensemble Deep Learning Framework.使用集成深度学习框架改进个体脑龄预测

Front Psychiatry. 2021 Mar 23;12:626677. doi: 10.3389/fpsyt.2021.626677. eCollection 2021.

Ensemble of Deep Neural Networks based on Condorcet's Jury Theorem for screening Covid-19 and Pneumonia from radiograph images.基于 Condorcet 陪审团定理的深度学习神经网络集成，用于从 X 光图像中筛选新冠病毒和肺炎。

Comput Biol Med. 2022 Oct;149:105979. doi: 10.1016/j.compbiomed.2022.105979. Epub 2022 Aug 25.

Ensembling local learners through multimodal perturbation.通过多模态扰动整合局部学习者。

IEEE Trans Syst Man Cybern B Cybern. 2005 Aug;35(4):725-35. doi: 10.1109/tsmcb.2005.845396.

Single-index varying coefficient model for functional responses.用于函数响应的单指标变系数模型。

Biometrics. 2016 Dec;72(4):1275-1284. doi: 10.1111/biom.12526. Epub 2016 Apr 8.

本文引用的文献

Statistical Learning Methods for Neuroimaging Data Analysis with Applications.统计学习方法在神经影像学数据分析中的应用。

Annu Rev Biomed Data Sci. 2023 Aug 10;6:73-104. doi: 10.1146/annurev-biodatasci-020722-100353. Epub 2023 Apr 26.

Image harmonization: A review of statistical and deep learning methods for removing batch effects and evaluation metrics for effective harmonization.图像调和：去除批次效应的统计和深度学习方法综述，以及有效调和的评价指标。

Neuroimage. 2023 Jul 1;274:120125. doi: 10.1016/j.neuroimage.2023.120125. Epub 2023 Apr 20.

Functional hybrid factor regression model for handling heterogeneity in imaging studies.用于处理影像研究中异质性的功能混合因子回归模型。

Biometrika. 2022 Dec;109(4):1133-1148. doi: 10.1093/biomet/asac007.

Privacy-preserving harmonization via distributed ComBat.通过分布式 ComBat 进行隐私保护的协调。

Neuroimage. 2022 Mar;248:118822. doi: 10.1016/j.neuroimage.2021.118822. Epub 2021 Dec 25.

Common genetic variation influencing human white matter microstructure.常见遗传变异影响人类白质微观结构。

Science. 2021 Jun 18;372(6548). doi: 10.1126/science.abf3736.

Longitudinal ComBat: A method for harmonizing longitudinal multi-scanner imaging data.纵向 ComBat：一种协调纵向多扫描仪成像数据的方法。

Neuroimage. 2020 Oct 15;220:117129. doi: 10.1016/j.neuroimage.2020.117129. Epub 2020 Jul 5.

Confound modelling in UK Biobank brain imaging.英国生物银行大脑成像中的混杂建模。

Neuroimage. 2021 Jan 1;224:117002. doi: 10.1016/j.neuroimage.2020.117002. Epub 2020 Jun 2.

FMEM: Functional Mixed Effects Models for Longitudinal Functional Responses.FMEM：用于纵向功能反应的功能混合效应模型

Stat Sin. 2019;29(4):2007-2033. doi: 10.5705/ss.202017.0505.

Genome-wide association analysis of 19,629 individuals identifies variants influencing regional brain volumes and refines their genetic co-architecture with cognitive and mental health traits.对 19629 个人进行全基因组关联分析，确定了影响区域脑容量的变异，并与认知和精神健康特征一起细化了它们的遗传共构。

Nat Genet. 2019 Nov;51(11):1637-1644. doi: 10.1038/s41588-019-0516-6. Epub 2019 Nov 1.

CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.多重假设检验中的混杂因素调整

Ann Stat. 2017 Oct;45(5):1863-1894. doi: 10.1214/16-AOS1511. Epub 2017 Oct 31.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验