• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多视图分析的协同学习。

Cooperative learning for multiview analysis.

机构信息

Department of Biomedical Data Science, Stanford University, Stanford, CA 94305.

Department of Statistics, Stanford University, Stanford, CA 94305.

出版信息

Proc Natl Acad Sci U S A. 2022 Sep 20;119(38):e2202113119. doi: 10.1073/pnas.2202113119. Epub 2022 Sep 12.

DOI:10.1073/pnas.2202113119
PMID:36095183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9499553/
Abstract

We propose a method for supervised learning with multiple sets of features ("views"). The multiview problem is especially important in biology and medicine, where "-omics" data, such as genomics, proteomics, and radiomics, are measured on a common set of samples. "Cooperative learning" combines the usual squared-error loss of predictions with an "agreement" penalty to encourage the predictions from different data views to agree. By varying the weight of the agreement penalty, we get a continuum of solutions that include the well-known early and late fusion approaches. Cooperative learning chooses the degree of agreement (or fusion) in an adaptive manner, using a validation set or cross-validation to estimate test set prediction error. One version of our fitting procedure is modular, where one can choose different fitting mechanisms (e.g., lasso, random forests, boosting, or neural networks) appropriate for different data views. In the setting of cooperative regularized linear regression, the method combines the lasso penalty with the agreement penalty, yielding feature sparsity. The method can be especially powerful when the different data views share some underlying relationship in their signals that can be exploited to boost the signals. We show that cooperative learning achieves higher predictive accuracy on simulated data and real multiomics examples of labor-onset prediction. By leveraging aligned signals and allowing flexible fitting mechanisms for different modalities, cooperative learning offers a powerful approach to multiomics data fusion.

摘要

我们提出了一种用于多组特征(“视图”)的监督学习方法。在生物学和医学中,多视图问题尤为重要,因为基因组学、蛋白质组学和放射组学等“组学”数据是在一组共同的样本上测量的。“协同学习”将预测的常用平方误差损失与“一致性”惩罚相结合,以鼓励来自不同数据视图的预测一致。通过改变一致性惩罚的权重,我们得到了一个连续的解决方案,其中包括众所周知的早期和晚期融合方法。协同学习以自适应的方式选择一致性(或融合)的程度,使用验证集或交叉验证来估计测试集的预测误差。我们的拟合过程的一个版本是模块化的,其中可以为不同的数据视图选择不同的拟合机制(例如,lasso、随机森林、boosting 或神经网络)。在协同正则化线性回归的设置中,该方法将 lasso 惩罚与一致性惩罚相结合,产生特征稀疏性。当不同的数据视图在其信号中共享一些可以利用来增强信号的潜在关系时,该方法尤其强大。我们表明,协同学习在模拟数据和实际的分娩预测多组学示例上实现了更高的预测准确性。通过利用对齐的信号并为不同的模式提供灵活的拟合机制,协同学习为多组学数据融合提供了一种强大的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/1d0692360124/pnas.2202113119fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/96a68411c006/pnas.2202113119fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/ec15c164eb41/pnas.2202113119fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/e85082395c29/pnas.2202113119fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/146c396ecab9/pnas.2202113119fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/1d0692360124/pnas.2202113119fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/96a68411c006/pnas.2202113119fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/ec15c164eb41/pnas.2202113119fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/e85082395c29/pnas.2202113119fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/146c396ecab9/pnas.2202113119fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d80d/9499553/1d0692360124/pnas.2202113119fig05.jpg

相似文献

1
Cooperative learning for multiview analysis.多视图分析的协同学习。
Proc Natl Acad Sci U S A. 2022 Sep 20;119(38):e2202113119. doi: 10.1073/pnas.2202113119. Epub 2022 Sep 12.
2
Multiview learning for understanding functional multiomics.多视图学习理解功能多组学。
PLoS Comput Biol. 2020 Apr 2;16(4):e1007677. doi: 10.1371/journal.pcbi.1007677. eCollection 2020 Apr.
3
Interpretable deep learning methods for multiview learning.多视图学习的可解释深度学习方法。
BMC Bioinformatics. 2024 Feb 14;25(1):69. doi: 10.1186/s12859-024-05679-9.
4
A fair experimental comparison of neural network architectures for latent representations of multi-omics for drug response prediction.神经网络架构在多组学药物反应预测中潜在表示的公平实验比较。
BMC Bioinformatics. 2023 Feb 14;24(1):45. doi: 10.1186/s12859-023-05166-7.
5
Uncertainty-guided cross learning via CNN and transformer for semi-supervised honeycomb lung lesion segmentation.基于 CNN 和 Transformer 的不确定性引导交叉学习在半监督蜂窝肺病变分割中的应用。
Phys Med Biol. 2023 Dec 11;68(24). doi: 10.1088/1361-6560/ad0eb2.
6
Supervised learning via the "hubNet" procedure.通过“hubNet”程序进行监督学习。
Stat Sin. 2018 Jul;28(3):1225-1243. doi: 10.5705/ss.202016.0482.
7
MGLNN: Semi-supervised learning via Multiple Graph Cooperative Learning Neural Networks.MGLNN:基于多图协同学习神经网络的半监督学习。
Neural Netw. 2022 Sep;153:204-214. doi: 10.1016/j.neunet.2022.05.024. Epub 2022 Jun 3.
8
Deep learning: new computational modelling techniques for genomics.深度学习:基因组学的新计算建模技术。
Nat Rev Genet. 2019 Jul;20(7):389-403. doi: 10.1038/s41576-019-0122-6.
9
Sparse Inverse Covariance Estimation with L0 Penalty for Network Construction with Omics Data.用于组学数据网络构建的具有L0惩罚的稀疏逆协方差估计
J Comput Biol. 2016 Mar;23(3):192-202. doi: 10.1089/cmb.2015.0102. Epub 2016 Feb 1.
10
Overall Survival Prognostic Modelling of Non-small Cell Lung Cancer Patients Using Positron Emission Tomography/Computed Tomography Harmonised Radiomics Features: The Quest for the Optimal Machine Learning Algorithm.正电子发射断层扫描/计算机断层扫描调和放射组学特征预测非小细胞肺癌患者总生存期:最优机器学习算法的探索。
Clin Oncol (R Coll Radiol). 2022 Feb;34(2):114-127. doi: 10.1016/j.clon.2021.11.014. Epub 2021 Dec 3.

引用本文的文献

1
Improving predictability, reliability, and generalizability of brain-wide associations for cognitive abilities via multimodal stacking.通过多模态堆叠提高全脑认知能力关联的可预测性、可靠性和普遍性。
PNAS Nexus. 2025 Jun 24;4(6):pgaf175. doi: 10.1093/pnasnexus/pgaf175. eCollection 2025 Jun.
2
Omic Risk Scores are Associated with COPD-related Traits Across Three Cohorts.全基因组关联研究风险评分与三个队列中慢性阻塞性肺疾病相关特征有关。
medRxiv. 2025 Jun 2:2025.06.01.25328699. doi: 10.1101/2025.06.01.25328699.
3
A machine learning approach to leveraging electronic health records for enhanced omics analysis.

本文引用的文献

1
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
2
Integrated trajectories of the maternal metabolome, proteome, and immunome predict labor onset.母体内分泌代谢组学、蛋白质组学和免疫组学的综合轨迹可预测分娩开始。
Sci Transl Med. 2021 May 5;13(592). doi: 10.1126/scitranslmed.abd9898.
3
Sparse linear discriminant analysis for multiview structured data.多视角结构数据的稀疏线性判别分析。
一种利用电子健康记录进行增强型组学分析的机器学习方法。
Nat Mach Intell. 2025;7(2):293-306. doi: 10.1038/s42256-024-00974-9. Epub 2025 Jan 16.
4
NMFProfiler: a multi-omics integration method for samples stratified in groups.NMFProfiler:一种用于对分组样本进行多组学整合的方法。
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf066.
5
The need for a cancer exposome atlas: a scoping review.癌症暴露组图谱的必要性:一项范围综述
JNCI Cancer Spectr. 2025 Jan 3;9(1). doi: 10.1093/jncics/pkae122.
6
Assessing polyomic risk to predict Alzheimer's disease using a machine learning model.使用机器学习模型评估多组学风险以预测阿尔茨海默病。
Alzheimers Dement. 2024 Dec;20(12):8700-8714. doi: 10.1002/alz.14319. Epub 2024 Nov 7.
7
Joint modeling of an outcome variable and integrated omics datasets using GLM-PO2PLS.使用GLM-PO2PLS对结果变量和综合组学数据集进行联合建模。
J Appl Stat. 2024 Feb 21;51(13):2627-2651. doi: 10.1080/02664763.2024.2313458. eCollection 2024.
8
Enhancing portability of trans-ancestral polygenic risk scores through tissue-specific functional genomic data integration.通过组织特异性功能基因组数据整合提高跨祖先多基因风险评分的可移植性。
PLoS Genet. 2024 Aug 7;20(8):e1011356. doi: 10.1371/journal.pgen.1011356. eCollection 2024 Aug.
9
Alzheimer's disease detection using data fusion with a deep supervised encoder.基于深度监督编码器的数据融合用于阿尔茨海默病检测
Front Dement. 2024;3. doi: 10.3389/frdem.2024.1332928. Epub 2024 Feb 12.
10
Bayesian Simultaneous Factorization and Prediction Using Multi-Omic Data.使用多组学数据的贝叶斯同时分解与预测
Comput Stat Data Anal. 2024 Sep;197. doi: 10.1016/j.csda.2024.107974. Epub 2024 Apr 30.
Biometrics. 2022 Jun;78(2):612-623. doi: 10.1111/biom.13458. Epub 2021 Mar 30.
4
Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis.病理融合:融合组织病理学和基因组特征用于癌症诊断和预后的综合框架。
IEEE Trans Med Imaging. 2022 Apr;41(4):757-770. doi: 10.1109/TMI.2020.3021387. Epub 2022 Apr 1.
5
Essential Role of Complement in Pregnancy: From Implantation to Parturition and Beyond.补体在妊娠中的重要作用:从着床到分娩及以后。
Front Immunol. 2020 Jul 31;11:1681. doi: 10.3389/fimmu.2020.01681. eCollection 2020.
6
Integrative Methods and Practical Challenges for Single-Cell Multi-omics.单细胞多组学的综合方法和实际挑战。
Trends Biotechnol. 2020 Sep;38(9):1007-1022. doi: 10.1016/j.tibtech.2020.02.013. Epub 2020 Mar 26.
7
An integrative multi-omics analysis to identify candidate DNA methylation biomarkers related to prostate cancer risk.一种综合的多组学分析方法,用于鉴定与前列腺癌风险相关的候选 DNA 甲基化生物标志物。
Nat Commun. 2020 Aug 6;11(1):3905. doi: 10.1038/s41467-020-17673-9.
8
Integrating genomic features for non-invasive early lung cancer detection.整合基因组特征进行非侵入性早期肺癌检测。
Nature. 2020 Apr;580(7802):245-251. doi: 10.1038/s41586-020-2140-0. Epub 2020 Mar 25.
9
Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction.从电子健康记录和遗传数据中学习以改善心血管事件预测。
Sci Rep. 2019 Jan 24;9(1):717. doi: 10.1038/s41598-018-36745-x.
10
Precision medicine screening using whole-genome sequencing and advanced imaging to identify disease risk in adults.采用全基因组测序和先进成像技术进行精准医学筛查,以识别成年人的疾病风险。
Proc Natl Acad Sci U S A. 2018 Apr 3;115(14):3686-3691. doi: 10.1073/pnas.1706096114. Epub 2018 Mar 19.