转移学习在癌症药物敏感性预测中的应用。

Application of transfer learning for cancer drug sensitivity prediction.

机构信息

Department of Electrical and Computer Engineering, Texas Tech University, 1012 Boston Ave, Lubbock, 79409, TX, USA.

Department of Mathematics and Statistics, Texas Tech University, 1108 Memorial Circle, Lubbock, 79409, TX, USA.

出版信息

BMC Bioinformatics. 2018 Dec 28;19(Suppl 17):497. doi: 10.1186/s12859-018-2465-y.

DOI:10.1186/s12859-018-2465-y

PMID:30591023

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6309077/

Abstract

BACKGROUND

In precision medicine, scarcity of suitable biological data often hinders the design of an appropriate predictive model. In this regard, large scale pharmacogenomics studies, like CCLE and GDSC hold the promise to mitigate the issue. However, one cannot directly employ data from multiple sources together due to the existing distribution shift in data. One way to solve this problem is to utilize the transfer learning methodologies tailored to fit in this specific context.

RESULTS

In this paper, we present two novel approaches for incorporating information from a secondary database for improving the prediction in a target database. The first approach is based on latent variable cost optimization and the second approach considers polynomial mapping between the two databases. Utilizing CCLE and GDSC databases, we illustrate that the proposed approaches accomplish a better prediction of drug sensitivities for different scenarios as compared to the existing approaches.

CONCLUSION

We have compared the performance of the proposed predictive models with database-specific individual models as well as existing transfer learning approaches. We note that our proposed approaches exhibit superior performance compared to the abovementioned alternative techniques for predicting sensitivity for different anti-cancer compounds, particularly the nonlinear mapping model shows the best overall performance.

摘要

背景

在精准医学中，合适的生物数据的稀缺常常阻碍了合适的预测模型的设计。在这方面，大规模的药物基因组学研究，如 CCLE 和 GDSC，有望缓解这一问题。然而，由于数据中存在分布偏移，不能直接将来自多个来源的数据一起使用。解决这个问题的一种方法是利用专门针对这种特定情况的迁移学习方法。

结果

在本文中，我们提出了两种新的方法，用于结合来自辅助数据库的信息以提高目标数据库中的预测。第一种方法基于潜在变量成本优化，第二种方法考虑两个数据库之间的多项式映射。利用 CCLE 和 GDSC 数据库，我们表明，与现有方法相比，所提出的方法在不同情况下对药物敏感性的预测更准确。

结论

我们将所提出的预测模型的性能与数据库特定的个体模型以及现有的迁移学习方法进行了比较。我们注意到，与上述替代技术相比，我们提出的方法在预测不同抗癌化合物的敏感性方面表现出更好的性能，特别是非线性映射模型表现出最佳的整体性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c86d/6309077/a147b2d1a671/12859_2018_2465_Fig1_HTML.jpg

相似文献

Application of transfer learning for cancer drug sensitivity prediction.

BMC Bioinformatics. 2018 Dec 28;19(Suppl 17):497. doi: 10.1186/s12859-018-2465-y.

A transfer learning approach via procrustes analysis and mean shift for cancer drug sensitivity prediction.

J Bioinform Comput Biol. 2018 Jun;16(3):1840014. doi: 10.1142/S0219720018400140.

Improved anticancer drug response prediction in cell lines using matrix factorization with similarity regularization.

BMC Cancer. 2017 Aug 2;17(1):513. doi: 10.1186/s12885-017-3500-5.

Evaluating the consistency of large-scale pharmacogenomic studies.

Brief Bioinform. 2019 Sep 27;20(5):1734-1753. doi: 10.1093/bib/bby046.

A link prediction approach to cancer drug sensitivity prediction.

BMC Syst Biol. 2017 Oct 3;11(Suppl 5):94. doi: 10.1186/s12918-017-0463-8.

Ensembled machine learning framework for drug sensitivity prediction.

IET Syst Biol. 2020 Feb;14(1):39-46. doi: 10.1049/iet-syb.2018.5094.

Integrating heterogeneous drug sensitivity data from cancer pharmacogenomic studies.

Oncotarget. 2016 Aug 9;7(32):51619-51625. doi: 10.18632/oncotarget.10010.

Heterogeneity Aware Random Forest for Drug Sensitivity Prediction.

Sci Rep. 2017 Sep 12;7(1):11347. doi: 10.1038/s41598-017-11665-4.

Deep-Resp-Forest: A deep forest model to predict anti-cancer drug response.

Methods. 2019 Aug 15;166:91-102. doi: 10.1016/j.ymeth.2019.02.009. Epub 2019 Feb 14.

Functional random forest with applications in dose-response predictions.

Sci Rep. 2019 Feb 7;9(1):1628. doi: 10.1038/s41598-018-38231-w.

引用本文的文献

Bridging Data Gaps in Healthcare: A Scoping Review of Transfer Learning in Structured Data Analysis.

Health Data Sci. 2025 Sep 3;5:0321. doi: 10.34133/hds.0321. eCollection 2025.

Cancer Drug Sensitivity Prediction Based on Deep Transfer Learning.

Int J Mol Sci. 2025 Mar 10;26(6):2468. doi: 10.3390/ijms26062468.

A Knowledge-Guided Graph Learning Approach Bridging Phenotype- and Target-Based Drug Discovery.

Adv Sci (Weinh). 2025 Apr;12(16):e2412402. doi: 10.1002/advs.202412402. Epub 2025 Mar 6.

Gene Signatures and Oncology Treatment Implications.

Hematol Oncol Clin North Am. 2025 Apr;39(2):295-307. doi: 10.1016/j.hoc.2024.11.003. Epub 2024 Dec 17.

Transfer learning for genotype-phenotype prediction using deep learning models.

BMC Bioinformatics. 2022 Nov 29;23(1):511. doi: 10.1186/s12859-022-05036-8.

Deep transfer learning of cancer drug responses by integrating bulk and single-cell RNA-seq data.

Nat Commun. 2022 Oct 30;13(1):6494. doi: 10.1038/s41467-022-34277-7.

Identification of phenocopies improves prediction of targeted therapy response over DNA mutations alone.

NPJ Genom Med. 2022 Oct 17;7(1):58. doi: 10.1038/s41525-022-00328-7.

Molecular pathways enhance drug response prediction using transfer learning from cell lines to tumors and patient-derived xenografts.

Sci Rep. 2022 Sep 27;12(1):16109. doi: 10.1038/s41598-022-20646-1.

A feature transferring workflow between data-poor compounds in various tasks.

PLoS One. 2022 Mar 30;17(3):e0266088. doi: 10.1371/journal.pone.0266088. eCollection 2022.

Predicting cancer drug TARGETS - TreAtment Response Generalized Elastic-neT Signatures.

NPJ Genom Med. 2021 Sep 21;6(1):76. doi: 10.1038/s41525-021-00239-z.

本文引用的文献

IntegratedMRF: random forest-based framework for integrating prediction from different data types.

Bioinformatics. 2017 May 1;33(9):1407-1410. doi: 10.1093/bioinformatics/btw765.

Extracting a low-dimensional description of multiple gene expression datasets reveals a potential driver for tumor-associated stroma in ovarian cancer.

Genome Med. 2016 Jun 10;8(1):66. doi: 10.1186/s13073-016-0319-7.

Design of Probabilistic Random Forests with Applications to Anticancer Drug Sensitivity Prediction.

Cancer Inform. 2016 Mar 31;14(Suppl 5):57-73. doi: 10.4137/CIN.S30794. eCollection 2015.

Pharmacogenomic agreement between two cancer cell line data sets.

Nature. 2015 Dec 3;528(7580):84-7. doi: 10.1038/nature15736. Epub 2015 Nov 16.

Inconsistency in large pharmacogenomic studies.

Nature. 2013 Dec 19;504(7480):389-93. doi: 10.1038/nature12831. Epub 2013 Nov 27.

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells.

Nucleic Acids Res. 2013 Jan;41(Database issue):D955-61. doi: 10.1093/nar/gks1111. Epub 2012 Nov 23.

The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity.

Nature. 2012 Mar 28;483(7391):603-7. doi: 10.1038/nature11003.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

转移学习在癌症药物敏感性预测中的应用。

Application of transfer learning for cancer drug sensitivity prediction.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献