• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用神经网络和XGBoost通过学习曲线对多种药物反应预测模式进行比较。

Comparison of multiple modalities for drug response prediction with learning curves using neural networks and XGBoost.

作者信息

Branson Nikhil, Cutillas Pedro R, Bessant Conrad

机构信息

School of Biological and Behavioural Sciences, Queen Mary University of London, London E1 4NS, United Kingdom.

Digital Environment Research Institute, Queen Mary University of London, London E1 1HH, United Kingdom.

出版信息

Bioinform Adv. 2023 Dec 23;4(1):vbad190. doi: 10.1093/bioadv/vbad190. eCollection 2024.

DOI:10.1093/bioadv/vbad190
PMID:38282976
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10812874/
Abstract

MOTIVATION

Anti-cancer drug response prediction is a central problem within stratified medicine. Transcriptomic profiles of cancer cell lines are typically used for drug response prediction, but we hypothesize that proteomics or phosphoproteomics might be more suitable as they give a more direct insight into cellular processes. However, there has not yet been a systematic comparison between all three of these datatypes using consistent evaluation criteria.

RESULTS

Due to the limited number of cell lines with phosphoproteomics profiles we use learning curves, a plot of predictive performance as a function of dataset size, to compare the current performance and predict the future performance of the three omics datasets with more data. We use neural networks and XGBoost and compare them against a simple rule-based benchmark. We show that phosphoproteomics slightly outperforms RNA-seq and proteomics using the 38 cell lines with profiles of all three omics data types. Furthermore, using the 877 cell lines with proteomics and RNA-seq profiles, we show that RNA-seq slightly outperforms proteomics. With the learning curves we predict that the mean squared error using the phosphoproteomics dataset would decrease by if a dataset of the same size as the proteomics/transcriptomics was collected. For the cell lines with proteomics and RNA-seq profiles the learning curves reveal that for smaller dataset sizes neural networks outperform XGBoost and for larger datasets. Furthermore, the trajectory of the XGBoost curve suggests that it will improve faster than the neural networks as more data are collected.

AVAILABILITY AND IMPLEMENTATION

See https://github.com/Nik-BB/Learning-curves-for-DRP for the code used.

摘要

动机

抗癌药物反应预测是分层医学中的核心问题。癌细胞系的转录组谱通常用于药物反应预测,但我们推测蛋白质组学或磷酸化蛋白质组学可能更合适,因为它们能更直接地洞察细胞过程。然而,尚未使用一致的评估标准对这三种数据类型进行系统比较。

结果

由于具有磷酸化蛋白质组学谱的细胞系数量有限,我们使用学习曲线(一种将预测性能绘制成数据集大小函数的图)来比较当前性能,并预测随着数据增多这三种组学数据集的未来性能。我们使用神经网络和XGBoost,并将它们与基于简单规则的基准进行比较。我们表明,使用具有所有三种组学数据类型谱的38个细胞系时,磷酸化蛋白质组学略优于RNA测序和蛋白质组学。此外,使用具有蛋白质组学和RNA测序谱的877个细胞系,我们表明RNA测序略优于蛋白质组学。通过学习曲线,我们预测如果收集与蛋白质组学/转录组学大小相同的数据集,使用磷酸化蛋白质组学数据集的均方误差将降低 。对于具有蛋白质组学和RNA测序谱的细胞系,学习曲线表明对于较小的数据集大小,神经网络优于XGBoost,而对于较大的数据集则相反。此外,XGBoost曲线的轨迹表明,随着更多数据的收集,它将比神经网络改善得更快。

可用性和实现方式

有关所用代码,请参阅https://github.com/Nik-BB/Learning-curves-for-DRP 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9879/10812874/4878f6a35569/vbad190f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9879/10812874/80fcbd8e2b41/vbad190f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9879/10812874/4878f6a35569/vbad190f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9879/10812874/80fcbd8e2b41/vbad190f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9879/10812874/4878f6a35569/vbad190f2.jpg

相似文献

1
Comparison of multiple modalities for drug response prediction with learning curves using neural networks and XGBoost.使用神经网络和XGBoost通过学习曲线对多种药物反应预测模式进行比较。
Bioinform Adv. 2023 Dec 23;4(1):vbad190. doi: 10.1093/bioadv/vbad190. eCollection 2024.
2
MLG2Net: Molecular Global Graph Network for Drug Response Prediction in Lung Cancer Cell Lines.MLG2Net:用于肺癌细胞系药物反应预测的分子全局图网络
J Med Syst. 2025 Apr 10;49(1):47. doi: 10.1007/s10916-025-02182-3.
3
Learning curves for drug response prediction in cancer cell lines.肿瘤细胞系药物反应预测的学习曲线。
BMC Bioinformatics. 2021 May 17;22(1):252. doi: 10.1186/s12859-021-04163-y.
4
TGSA: protein-protein association-based twin graph neural networks for drug response prediction with similarity augmentation.TGSA:基于蛋白质-蛋白质关联的孪生图神经网络,用于通过相似性增强进行药物反应预测。
Bioinformatics. 2022 Jan 3;38(2):461-468. doi: 10.1093/bioinformatics/btab650.
5
MMCL-CDR: enhancing cancer drug response prediction with multi-omics and morphology images contrastive representation learning.MMCL-CDR:基于多组学和形态图像对比表示学习增强癌症药物反应预测
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad734.
6
DEGnext: classification of differentially expressed genes from RNA-seq data using a convolutional neural network with transfer learning.DEGnext:使用具有迁移学习的卷积神经网络对 RNA-seq 数据进行差异表达基因分类。
BMC Bioinformatics. 2022 Jan 6;23(1):17. doi: 10.1186/s12859-021-04527-4.
7
Off-target predictions in CRISPR-Cas9 gene editing using deep learning.使用深度学习进行 CRISPR-Cas9 基因编辑中的脱靶预测。
Bioinformatics. 2018 Sep 1;34(17):i656-i663. doi: 10.1093/bioinformatics/bty554.
8
Phenotype prediction from single-cell RNA-seq data using attention-based neural networks.基于注意力机制神经网络的单细胞 RNA-seq 数据表型预测。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae067.
9
Prediction of Gene Regulatory Connections with Joint Single-Cell Foundation Models and Graph-Based Learning.基于联合单细胞基础模型和基于图的学习预测基因调控连接
bioRxiv. 2025 Jan 29:2024.12.16.628715. doi: 10.1101/2024.12.16.628715.
10
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.

引用本文的文献

1
Understanding the sources of performance in deep drug response models reveals insights and improvements.了解深度药物反应模型中的性能来源可揭示见解并带来改进。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i142-i149. doi: 10.1093/bioinformatics/btaf255.
2
The specification game: rethinking the evaluation of drug response prediction for precision oncology.规范博弈:重新思考精准肿瘤学中药物反应预测的评估以提高精准度
J Cheminform. 2025 Mar 14;17(1):33. doi: 10.1186/s13321-025-00972-y.
3
A method for miRNA diffusion association prediction using machine learning decoding of multi-level heterogeneous graph Transformer encoded representations.

本文引用的文献

1
Deep Neural Networks and Tabular Data: A Survey.深度神经网络与表格数据:一项综述。
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7499-7519. doi: 10.1109/TNNLS.2022.3229161. Epub 2024 Jun 3.
2
Deep learning methods for drug response prediction in cancer: Predominant and emerging trends.用于癌症药物反应预测的深度学习方法:主流与新趋势
Front Med (Lausanne). 2023 Feb 15;10:1086097. doi: 10.3389/fmed.2023.1086097. eCollection 2023.
3
The Shape of Learning Curves: A Review.学习曲线的形态:综述
基于多层异质图 Transformer 编码表示的机器学习解码预测 miRNA 扩散关联的方法。
Sci Rep. 2024 Sep 3;14(1):20490. doi: 10.1038/s41598-024-68897-4.
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7799-7819. doi: 10.1109/TPAMI.2022.3220744. Epub 2023 May 5.
4
Opportunities for pharmacoproteomics in biomarker discovery.药物蛋白质组学在生物标志物发现中的机遇。
Proteomics. 2023 Apr;23(7-8):e2200031. doi: 10.1002/pmic.202200031. Epub 2022 Sep 22.
5
Pan-cancer proteomic map of 949 human cell lines.949 个人类细胞系的泛癌症蛋白质组图谱。
Cancer Cell. 2022 Aug 8;40(8):835-849.e8. doi: 10.1016/j.ccell.2022.06.010. Epub 2022 Jul 14.
6
GraphCDR: a graph neural network method with contrastive learning for cancer drug response prediction.GraphCDR:一种基于对比学习的图神经网络方法,用于癌症药物反应预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab457.
7
Drug sensitivity prediction from cell line-based pharmacogenomics data: guidelines for developing machine learning models.从基于细胞系的药物基因组学数据预测药物敏感性:开发机器学习模型的指南。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab294.
8
Converting tabular data into images for deep learning with convolutional neural networks.将表格数据转换为卷积神经网络的深度学习图像。
Sci Rep. 2021 May 31;11(1):11325. doi: 10.1038/s41598-021-90923-y.
9
Learning curves for drug response prediction in cancer cell lines.肿瘤细胞系药物反应预测的学习曲线。
BMC Bioinformatics. 2021 May 17;22(1):252. doi: 10.1186/s12859-021-04163-y.
10
Drug ranking using machine learning systematically predicts the efficacy of anti-cancer drugs.利用机器学习进行药物排名系统地预测了抗癌药物的疗效。
Nat Commun. 2021 Mar 25;12(1):1850. doi: 10.1038/s41467-021-22170-8.