• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

神经网络架构在多组学药物反应预测中潜在表示的公平实验比较。

A fair experimental comparison of neural network architectures for latent representations of multi-omics for drug response prediction.

机构信息

Department of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany.

出版信息

BMC Bioinformatics. 2023 Feb 14;24(1):45. doi: 10.1186/s12859-023-05166-7.

DOI:10.1186/s12859-023-05166-7
PMID:36788531
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9926634/
Abstract

BACKGROUND

Recent years have seen a surge of novel neural network architectures for the integration of multi-omics data for prediction. Most of the architectures include either encoders alone or encoders and decoders, i.e., autoencoders of various sorts, to transform multi-omics data into latent representations. One important parameter is the depth of integration: the point at which the latent representations are computed or merged, which can be either early, intermediate, or late. The literature on integration methods is growing steadily, however, close to nothing is known about the relative performance of these methods under fair experimental conditions and under consideration of different use cases.

RESULTS

We developed a comparison framework that trains and optimizes multi-omics integration methods under equal conditions. We incorporated early integration, PCA and four recently published deep learning methods: MOLI, Super.FELT, OmiEmbed, and MOMA. Further, we devised a novel method, Omics Stacking, that combines the advantages of intermediate and late integration. Experiments were conducted on a public drug response data set with multiple omics data (somatic point mutations, somatic copy number profiles and gene expression profiles) that was obtained from cell lines, patient-derived xenografts, and patient samples. Our experiments confirmed that early integration has the lowest predictive performance. Overall, architectures that integrate triplet loss achieved the best results. Statistical differences can, overall, rarely be observed, however, in terms of the average ranks of methods, Super.FELT is consistently performing best in a cross-validation setting and Omics Stacking best in an external test set setting.

CONCLUSIONS

We recommend researchers to follow fair comparison protocols, as suggested in the paper. When faced with a new data set, Super.FELT is a good option in the cross-validation setting as well as Omics Stacking in the external test set setting. Statistical significances are hardly observable, despite trends in the algorithms' rankings. Future work on refined methods for transfer learning tailored for this domain may improve the situation for external test sets. The source code of all experiments is available under https://github.com/kramerlab/Multi-Omics_analysis.

摘要

背景

近年来,出现了许多用于整合多组学数据进行预测的新型神经网络架构。大多数架构都包含编码器,或者编码器和解码器,即各种类型的自动编码器,将多组学数据转换为潜在表示。一个重要的参数是整合的深度:计算或合并潜在表示的点,可以是早期、中期或晚期。关于整合方法的文献正在稳步增加,然而,在公平的实验条件下,以及在考虑不同用例的情况下,几乎没有人知道这些方法的相对性能。

结果

我们开发了一个比较框架,可以在同等条件下训练和优化多组学整合方法。我们纳入了早期整合、PCA 以及最近发表的四种深度学习方法:MOLI、Super.FELT、OmiEmbed 和 MOMA。此外,我们设计了一种新的方法,Omics Stacking,它结合了中期和晚期整合的优势。实验是在一个具有多个组学数据(体细胞点突变、体细胞拷贝数谱和基因表达谱)的公共药物反应数据集上进行的,这些数据是从细胞系、患者来源的异种移植物和患者样本中获得的。我们的实验证实早期整合的预测性能最低。总体而言,整合三重损失的架构取得了最佳结果。然而,在统计上,总体上很少观察到差异,但是在方法的平均排名方面,Super.FELT 在交叉验证设置中表现最好,而 Omics Stacking 在外部测试集设置中表现最好。

结论

我们建议研究人员遵循本文中建议的公平比较协议。当面对新数据集时,Super.FELT 在交叉验证设置中是一个不错的选择,而 Omics Stacking 在外部测试集设置中也是如此。尽管算法排名存在趋势,但统计显著性几乎难以观察到。针对该领域的改进的迁移学习方法的未来工作可能会改善外部测试集的情况。所有实验的源代码可在 https://github.com/kramerlab/Multi-Omics_analysis 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/99167dac7b9e/12859_2023_5166_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/dbc6273c10fc/12859_2023_5166_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/3e85d7c1141d/12859_2023_5166_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/f0a277e6b58f/12859_2023_5166_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/2e3256020607/12859_2023_5166_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/1f4cf279dc81/12859_2023_5166_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/67c3bb0b9e68/12859_2023_5166_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/a514241ca172/12859_2023_5166_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/e626af8df7bc/12859_2023_5166_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/b61ba38fde07/12859_2023_5166_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/aabcf5641d5a/12859_2023_5166_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/99167dac7b9e/12859_2023_5166_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/dbc6273c10fc/12859_2023_5166_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/3e85d7c1141d/12859_2023_5166_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/f0a277e6b58f/12859_2023_5166_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/2e3256020607/12859_2023_5166_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/1f4cf279dc81/12859_2023_5166_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/67c3bb0b9e68/12859_2023_5166_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/a514241ca172/12859_2023_5166_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/e626af8df7bc/12859_2023_5166_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/b61ba38fde07/12859_2023_5166_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/aabcf5641d5a/12859_2023_5166_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/680d/9926634/99167dac7b9e/12859_2023_5166_Fig11_HTML.jpg

相似文献

1
A fair experimental comparison of neural network architectures for latent representations of multi-omics for drug response prediction.神经网络架构在多组学药物反应预测中潜在表示的公平实验比较。
BMC Bioinformatics. 2023 Feb 14;24(1):45. doi: 10.1186/s12859-023-05166-7.
2
Super.FELT: supervised feature extraction learning using triplet loss for drug response prediction with multi-omics data.Super.FELT:基于三重损失的监督特征提取学习在多组学数据药物反应预测中的应用。
BMC Bioinformatics. 2021 May 25;22(1):269. doi: 10.1186/s12859-021-04146-z.
3
MOLI: multi-omics late integration with deep neural networks for drug response prediction.MOLI:基于深度神经网络的多组学晚期整合进行药物反应预测。
Bioinformatics. 2019 Jul 15;35(14):i501-i509. doi: 10.1093/bioinformatics/btz318.
4
moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks.moBRCA-net:一种基于多组学注意力神经网络的乳腺癌亚型分类框架。
BMC Bioinformatics. 2023 Apr 26;24(1):169. doi: 10.1186/s12859-023-05273-5.
5
Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment.基于可解释元学习的多组学生存分析和通路富集
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad113.
6
FGCNSurv: dually fused graph convolutional network for multi-omics survival prediction.FGCNSurv:用于多组学生存预测的双重融合图卷积网络。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad472.
7
AVBAE-MODFR: A novel deep learning framework of embedding and feature selection on multi-omics data for pan-cancer classification.AVBAE-MODFR:一种基于多组学数据的嵌入和特征选择的深度学习框架,用于泛癌分类。
Comput Biol Med. 2024 Jul;177:108614. doi: 10.1016/j.compbiomed.2024.108614. Epub 2024 May 14.
8
MOMA: a multi-task attention learning algorithm for multi-omics data interpretation and classification.MOMA:一种用于多组学数据解释和分类的多任务注意力学习算法。
Bioinformatics. 2022 Apr 12;38(8):2287-2296. doi: 10.1093/bioinformatics/btac080.
9
Deeply integrating latent consistent representations in high-noise multi-omics data for cancer subtyping.在高噪声多组学数据中深度集成潜在一致表示以进行癌症亚型分类。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae061.
10
Capturing the latent space of an Autoencoder for multi-omics integration and cancer subtyping.捕获自动编码器的潜在空间,用于多组学整合和癌症亚型分类。
Comput Biol Med. 2022 Sep;148:105832. doi: 10.1016/j.compbiomed.2022.105832. Epub 2022 Jul 5.

引用本文的文献

1
Visible neural networks for multi-omics integration: a critical review.用于多组学整合的可视化神经网络:批判性综述
Front Artif Intell. 2025 Jul 17;8:1595291. doi: 10.3389/frai.2025.1595291. eCollection 2025.

本文引用的文献

1
MOMA: a multi-task attention learning algorithm for multi-omics data interpretation and classification.MOMA:一种用于多组学数据解释和分类的多任务注意力学习算法。
Bioinformatics. 2022 Apr 12;38(8):2287-2296. doi: 10.1093/bioinformatics/btac080.
2
XOmiVAE: an interpretable deep learning model for cancer classification using high-dimensional omics data.XOmiVAE:一种使用高维组学数据进行癌症分类的可解释深度学习模型。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab315.
3
OmiEmbed: A Unified Multi-Task Deep Learning Framework for Multi-Omics Data.
OmiEmbed:一个用于多组学数据的统一多任务深度学习框架。
Cancers (Basel). 2021 Jun 18;13(12):3047. doi: 10.3390/cancers13123047.
4
Super.FELT: supervised feature extraction learning using triplet loss for drug response prediction with multi-omics data.Super.FELT:基于三重损失的监督特征提取学习在多组学数据药物反应预测中的应用。
BMC Bioinformatics. 2021 May 25;22(1):269. doi: 10.1186/s12859-021-04146-z.
5
MOLI: multi-omics late integration with deep neural networks for drug response prediction.MOLI:基于深度神经网络的多组学晚期整合进行药物反应预测。
Bioinformatics. 2019 Jul 15;35(14):i501-i509. doi: 10.1093/bioinformatics/btz318.
6
PLATYPUS: A Multiple-View Learning Predictive Framework for Cancer Drug Sensitivity Prediction.鸭嘴兽:一种用于癌症药物敏感性预测的多视图学习预测框架。
Pac Symp Biocomput. 2019;24:136-147.
7
Pharmacogenomic landscape of patient-derived tumor cells informs precision oncology therapy.患者来源肿瘤细胞的药物基因组学全景分析为精准肿瘤治疗提供信息。
Nat Genet. 2018 Oct;50(10):1399-1411. doi: 10.1038/s41588-018-0209-6. Epub 2018 Sep 27.
8
Deep Learning-Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer.基于深度学习的多组学整合可稳健预测肝癌患者的生存情况。
Clin Cancer Res. 2018 Mar 15;24(6):1248-1259. doi: 10.1158/1078-0432.CCR-17-0853. Epub 2017 Oct 5.
9
Discovering novel pharmacogenomic biomarkers by imputing drug response in cancer patients from large genomics studies.通过从大型基因组学研究中推断癌症患者的药物反应来发现新的药物基因组生物标志物。
Genome Res. 2017 Oct;27(10):1743-1751. doi: 10.1101/gr.221077.117. Epub 2017 Aug 28.
10
A review on machine learning principles for multi-view biological data integration.机器学习原理在多视图生物数据集成中的研究综述。
Brief Bioinform. 2018 Mar 1;19(2):325-340. doi: 10.1093/bib/bbw113.