利用 TCGA 基因表达数据构建癌症药物反应的预测模型。

Leveraging TCGA gene expression data to build predictive models for cancer drug response.

机构信息

Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.

School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, USA.

出版信息

BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):364. doi: 10.1186/s12859-020-03690-4.

DOI:10.1186/s12859-020-03690-4

PMID:32998700

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7526215/

Abstract

BACKGROUND

Machine learning has been utilized to predict cancer drug response from multi-omics data generated from sensitivities of cancer cell lines to different therapeutic compounds. Here, we build machine learning models using gene expression data from patients' primary tumor tissues to predict whether a patient will respond positively or negatively to two chemotherapeutics: 5-Fluorouracil and Gemcitabine.

RESULTS

We focused on 5-Fluorouracil and Gemcitabine because based on our exclusion criteria, they provide the largest numbers of patients within TCGA. Normalized gene expression data were clustered and used as the input features for the study. We used matching clinical trial data to ascertain the response of these patients via multiple classification methods. Multiple clustering and classification methods were compared for prediction accuracy of drug response. Clara and random forest were found to be the best clustering and classification methods, respectively. The results show our models predict with up to 86% accuracy; despite the study's limitation of sample size. We also found the genes most informative for predicting drug response were enriched in well-known cancer signaling pathways and highlighted their potential significance in chemotherapy prognosis.

CONCLUSIONS

Primary tumor gene expression is a good predictor of cancer drug response. Investment in larger datasets containing both patient gene expression and drug response is needed to support future work of machine learning models. Ultimately, such predictive models may aid oncologists with making critical treatment decisions.

摘要

背景

机器学习已被用于从癌细胞系对不同治疗化合物的敏感性产生的多组学数据中预测癌症药物反应。在这里，我们使用来自患者原发肿瘤组织的基因表达数据构建机器学习模型，以预测患者对两种化疗药物：5-氟尿嘧啶和吉西他滨的反应是阳性还是阴性。

结果

我们专注于 5-氟尿嘧啶和吉西他滨，因为根据我们的排除标准，它们在 TCGA 中提供了最多的患者数量。归一化的基因表达数据被聚类并用作该研究的输入特征。我们使用匹配的临床试验数据通过多种分类方法确定这些患者的反应。比较了多种聚类和分类方法以预测药物反应的准确性。Clara 和随机森林分别被发现是最好的聚类和分类方法。结果表明，我们的模型预测的准确性高达 86％；尽管研究的样本量有限。我们还发现，预测药物反应最有用的基因在已知的癌症信号通路中富集，并强调了它们在化疗预后中的潜在意义。

结论

原发肿瘤基因表达是癌症药物反应的良好预测指标。需要投资更大的数据集，其中包含患者的基因表达和药物反应，以支持未来的机器学习模型工作。最终，这种预测模型可能有助于肿瘤学家做出关键的治疗决策。

相似文献

Leveraging TCGA gene expression data to build predictive models for cancer drug response.

BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):364. doi: 10.1186/s12859-020-03690-4.

Genomic signatures for paclitaxel and gemcitabine resistance in breast cancer derived by machine learning.

Mol Oncol. 2016 Jan;10(1):85-100. doi: 10.1016/j.molonc.2015.07.006. Epub 2015 Aug 22.

A method of gene expression data transfer from cell lines to cancer patients for machine-learning prediction of drug efficiency.

Cell Cycle. 2018;17(4):486-491. doi: 10.1080/15384101.2017.1417706. Epub 2018 Jan 17.

Prediction of Chemosensitivity in Multiple Primary Cancer Patients Using Machine Learning.

Anticancer Res. 2021 May;41(5):2419-2429. doi: 10.21873/anticanres.15017.

Machine learning predicts individual cancer patient responses to therapeutic drugs with high accuracy.

Sci Rep. 2018 Nov 6;8(1):16444. doi: 10.1038/s41598-018-34753-5.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.

EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.

MCT4 as a potential therapeutic target to augment gemcitabine chemosensitivity in resected pancreatic cancer.

Cell Oncol (Dordr). 2021 Dec;44(6):1363-1371. doi: 10.1007/s13402-021-00643-8. Epub 2021 Nov 18.

Expression of DNA translesion synthesis polymerase η in head and neck squamous cell cancer predicts resistance to gemcitabine and cisplatin-based chemotherapy.

PLoS One. 2013 Dec 20;8(12):e83978. doi: 10.1371/journal.pone.0083978. eCollection 2013.

The Cisplatin, 5-fluorouracil, Irinotecan, and Gemcitabine Treatment in Resistant 2D and 3D Model Triple Negative Breast Cancer Cell Line: ABCG2 Expression Data.

Anticancer Agents Med Chem. 2022;22(2):371-377. doi: 10.2174/1871520621666210727105431.

引用本文的文献

Bridging technology and medicine: artificial intelligence in targeted anticancer drug delivery.

RSC Adv. 2025 Aug 4;15(34):27795-27815. doi: 10.1039/d5ra03747f. eCollection 2025 Aug 1.

A disentangled generative model for improved drug response prediction in patients via sample synthesis.

J Pharm Anal. 2025 Jun;15(6):101128. doi: 10.1016/j.jpha.2024.101128. Epub 2024 Oct 24.

GEPIA3: Enhanced drug sensitivity and interaction network analysis for cancer research.

Nucleic Acids Res. 2025 Jul 7;53(W1):W283-W290. doi: 10.1093/nar/gkaf423.

Identification of Cuproptosis-Associated Prognostic Gene Expression Signatures from 20 Tumor Types.

Biology (Basel). 2024 Oct 3;13(10):793. doi: 10.3390/biology13100793.

Integrating Omics Data and AI for Cancer Diagnosis and Prognosis.

Cancers (Basel). 2024 Jul 3;16(13):2448. doi: 10.3390/cancers16132448.

Survival Rate and Chronic Diseases of TCGA Cancer and KoGES Normal Samples by Clustering for DNA Methylation.

Life (Basel). 2024 Jun 17;14(6):768. doi: 10.3390/life14060768.

A context-aware deconfounding autoencoder for robust prediction of personalized clinical drug response from cell-line compound screening.

Nat Mach Intell. 2022 Oct;4(10):879-892. doi: 10.1038/s42256-022-00541-0. Epub 2022 Oct 17.

Large-Scale Machine Learning Analysis Reveals DNA Methylation and Gene Expression Response Signatures for Gemcitabine-Treated Pancreatic Cancer.

Health Data Sci. 2024 Jan 8;4:0108. doi: 10.34133/hds.0108. eCollection 2024.

Uncovering the subtype-specific disease module and the development of drug response prediction models for glioma.

Heliyon. 2024 Mar 1;10(5):e27190. doi: 10.1016/j.heliyon.2024.e27190. eCollection 2024 Mar 15.

Recent Advancements in Subcellular Proteomics: Growing Impact of Organellar Protein Niches on the Understanding of Cell Biology.

J Proteome Res. 2024 Aug 2;23(8):2700-2722. doi: 10.1021/acs.jproteome.3c00839. Epub 2024 Mar 7.

本文引用的文献

Anticancer Drug Response Prediction in Cell Lines Using Weighted Graph Regularized Matrix Factorization.

Mol Ther Nucleic Acids. 2019 Sep 6;17:164-174. doi: 10.1016/j.omtn.2019.05.017. Epub 2019 Jun 4.

Predicting drug response of tumors from integrated genomic profiles by deep neural networks.

BMC Med Genomics. 2019 Jan 31;12(Suppl 1):18. doi: 10.1186/s12920-018-0460-9.

Comprehensive anticancer drug response prediction based on a simple cell line-drug complex network model.

BMC Bioinformatics. 2019 Jan 22;20(1):44. doi: 10.1186/s12859-019-2608-9.

Integrin trafficking in cells and tissues.

Nat Cell Biol. 2019 Feb;21(2):122-132. doi: 10.1038/s41556-018-0223-z. Epub 2019 Jan 2.

Anti-cancer Drug Response Prediction Using Neighbor-Based Collaborative Filtering with Global Effect Removal.

Mol Ther Nucleic Acids. 2018 Dec 7;13:303-311. doi: 10.1016/j.omtn.2018.09.011. Epub 2018 Sep 22.

A novel approach for drug response prediction in cancer cell lines via network representation learning.

Bioinformatics. 2019 May 1;35(9):1527-1535. doi: 10.1093/bioinformatics/bty848.

Cancer Drug Response Profile scan (CDRscan): A Deep Learning Model That Predicts Drug Effectiveness from Cancer Genomic Signature.

Sci Rep. 2018 Jun 11;8(1):8857. doi: 10.1038/s41598-018-27214-6.

Predicting Cancer Drug Response using a Recommender System.

Bioinformatics. 2018 Nov 15;34(22):3907-3914. doi: 10.1093/bioinformatics/bty452.

Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer.

Cell. 2018 Apr 5;173(2):291-304.e6. doi: 10.1016/j.cell.2018.03.022.

Using human brain activity to guide machine learning.

Sci Rep. 2018 Mar 29;8(1):5397. doi: 10.1038/s41598-018-23618-6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用 TCGA 基因表达数据构建癌症药物反应的预测模型。

Leveraging TCGA gene expression data to build predictive models for cancer drug response.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献