• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 TCGA 基因表达数据构建癌症药物反应的预测模型。

Leveraging TCGA gene expression data to build predictive models for cancer drug response.

机构信息

Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.

School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, USA.

出版信息

BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):364. doi: 10.1186/s12859-020-03690-4.

DOI:10.1186/s12859-020-03690-4
PMID:32998700
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7526215/
Abstract

BACKGROUND

Machine learning has been utilized to predict cancer drug response from multi-omics data generated from sensitivities of cancer cell lines to different therapeutic compounds. Here, we build machine learning models using gene expression data from patients' primary tumor tissues to predict whether a patient will respond positively or negatively to two chemotherapeutics: 5-Fluorouracil and Gemcitabine.

RESULTS

We focused on 5-Fluorouracil and Gemcitabine because based on our exclusion criteria, they provide the largest numbers of patients within TCGA. Normalized gene expression data were clustered and used as the input features for the study. We used matching clinical trial data to ascertain the response of these patients via multiple classification methods. Multiple clustering and classification methods were compared for prediction accuracy of drug response. Clara and random forest were found to be the best clustering and classification methods, respectively. The results show our models predict with up to 86% accuracy; despite the study's limitation of sample size. We also found the genes most informative for predicting drug response were enriched in well-known cancer signaling pathways and highlighted their potential significance in chemotherapy prognosis.

CONCLUSIONS

Primary tumor gene expression is a good predictor of cancer drug response. Investment in larger datasets containing both patient gene expression and drug response is needed to support future work of machine learning models. Ultimately, such predictive models may aid oncologists with making critical treatment decisions.

摘要

背景

机器学习已被用于从癌细胞系对不同治疗化合物的敏感性产生的多组学数据中预测癌症药物反应。在这里,我们使用来自患者原发肿瘤组织的基因表达数据构建机器学习模型,以预测患者对两种化疗药物:5-氟尿嘧啶和吉西他滨的反应是阳性还是阴性。

结果

我们专注于 5-氟尿嘧啶和吉西他滨,因为根据我们的排除标准,它们在 TCGA 中提供了最多的患者数量。归一化的基因表达数据被聚类并用作该研究的输入特征。我们使用匹配的临床试验数据通过多种分类方法确定这些患者的反应。比较了多种聚类和分类方法以预测药物反应的准确性。Clara 和随机森林分别被发现是最好的聚类和分类方法。结果表明,我们的模型预测的准确性高达 86%;尽管研究的样本量有限。我们还发现,预测药物反应最有用的基因在已知的癌症信号通路中富集,并强调了它们在化疗预后中的潜在意义。

结论

原发肿瘤基因表达是癌症药物反应的良好预测指标。需要投资更大的数据集,其中包含患者的基因表达和药物反应,以支持未来的机器学习模型工作。最终,这种预测模型可能有助于肿瘤学家做出关键的治疗决策。

相似文献

1
Leveraging TCGA gene expression data to build predictive models for cancer drug response.利用 TCGA 基因表达数据构建癌症药物反应的预测模型。
BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):364. doi: 10.1186/s12859-020-03690-4.
2
Genomic signatures for paclitaxel and gemcitabine resistance in breast cancer derived by machine learning.通过机器学习得出的乳腺癌中对紫杉醇和吉西他滨耐药的基因组特征
Mol Oncol. 2016 Jan;10(1):85-100. doi: 10.1016/j.molonc.2015.07.006. Epub 2015 Aug 22.
3
A method of gene expression data transfer from cell lines to cancer patients for machine-learning prediction of drug efficiency.一种将基因表达数据从细胞系转移到癌症患者的方法,用于机器学习预测药物疗效。
Cell Cycle. 2018;17(4):486-491. doi: 10.1080/15384101.2017.1417706. Epub 2018 Jan 17.
4
Prediction of Chemosensitivity in Multiple Primary Cancer Patients Using Machine Learning.基于机器学习的多原发癌患者化疗敏感性预测。
Anticancer Res. 2021 May;41(5):2419-2429. doi: 10.21873/anticanres.15017.
5
Machine learning predicts individual cancer patient responses to therapeutic drugs with high accuracy.机器学习可以高精度地预测个体癌症患者对治疗药物的反应。
Sci Rep. 2018 Nov 6;8(1):16444. doi: 10.1038/s41598-018-34753-5.
6
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
7
Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.超越多药耐药性:利用机器和统计学习模型在结核分枝杆菌耐药性预测中的罕见变异。
EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.
8
MCT4 as a potential therapeutic target to augment gemcitabine chemosensitivity in resected pancreatic cancer.MCT4 作为一种潜在的治疗靶点,可增强可切除胰腺癌对吉西他滨化疗的敏感性。
Cell Oncol (Dordr). 2021 Dec;44(6):1363-1371. doi: 10.1007/s13402-021-00643-8. Epub 2021 Nov 18.
9
Expression of DNA translesion synthesis polymerase η in head and neck squamous cell cancer predicts resistance to gemcitabine and cisplatin-based chemotherapy.DNA跨损伤合成聚合酶η在头颈部鳞状细胞癌中的表达预示着对吉西他滨和顺铂化疗的耐药性。
PLoS One. 2013 Dec 20;8(12):e83978. doi: 10.1371/journal.pone.0083978. eCollection 2013.
10
The Cisplatin, 5-fluorouracil, Irinotecan, and Gemcitabine Treatment in Resistant 2D and 3D Model Triple Negative Breast Cancer Cell Line: ABCG2 Expression Data.顺铂、5-氟尿嘧啶、伊立替康和吉西他滨治疗耐药 2D 和 3D 模型三阴性乳腺癌细胞系:ABCG2 表达数据。
Anticancer Agents Med Chem. 2022;22(2):371-377. doi: 10.2174/1871520621666210727105431.

引用本文的文献

1
Bridging technology and medicine: artificial intelligence in targeted anticancer drug delivery.连接技术与医学:人工智能在靶向抗癌药物递送中的应用
RSC Adv. 2025 Aug 4;15(34):27795-27815. doi: 10.1039/d5ra03747f. eCollection 2025 Aug 1.
2
A disentangled generative model for improved drug response prediction in patients via sample synthesis.一种通过样本合成改进患者药物反应预测的解缠生成模型。
J Pharm Anal. 2025 Jun;15(6):101128. doi: 10.1016/j.jpha.2024.101128. Epub 2024 Oct 24.
3
GEPIA3: Enhanced drug sensitivity and interaction network analysis for cancer research.

本文引用的文献

1
Anticancer Drug Response Prediction in Cell Lines Using Weighted Graph Regularized Matrix Factorization.基于加权图正则化矩阵分解的细胞系抗癌药物反应预测
Mol Ther Nucleic Acids. 2019 Sep 6;17:164-174. doi: 10.1016/j.omtn.2019.05.017. Epub 2019 Jun 4.
2
Predicting drug response of tumors from integrated genomic profiles by deep neural networks.基于深度神经网络的整合基因组图谱预测肿瘤药物反应
BMC Med Genomics. 2019 Jan 31;12(Suppl 1):18. doi: 10.1186/s12920-018-0460-9.
3
Comprehensive anticancer drug response prediction based on a simple cell line-drug complex network model.
GEPIA3:用于癌症研究的增强型药物敏感性和相互作用网络分析
Nucleic Acids Res. 2025 Jul 7;53(W1):W283-W290. doi: 10.1093/nar/gkaf423.
4
Identification of Cuproptosis-Associated Prognostic Gene Expression Signatures from 20 Tumor Types.从20种肿瘤类型中鉴定铜死亡相关的预后基因表达特征
Biology (Basel). 2024 Oct 3;13(10):793. doi: 10.3390/biology13100793.
5
Integrating Omics Data and AI for Cancer Diagnosis and Prognosis.整合组学数据与人工智能用于癌症诊断和预后评估
Cancers (Basel). 2024 Jul 3;16(13):2448. doi: 10.3390/cancers16132448.
6
Survival Rate and Chronic Diseases of TCGA Cancer and KoGES Normal Samples by Clustering for DNA Methylation.通过对DNA甲基化进行聚类分析TCGA癌症样本和KoGES正常样本的生存率及慢性病情况
Life (Basel). 2024 Jun 17;14(6):768. doi: 10.3390/life14060768.
7
A context-aware deconfounding autoencoder for robust prediction of personalized clinical drug response from cell-line compound screening.一种用于从细胞系化合物筛选中稳健预测个性化临床药物反应的上下文感知去混杂自动编码器。
Nat Mach Intell. 2022 Oct;4(10):879-892. doi: 10.1038/s42256-022-00541-0. Epub 2022 Oct 17.
8
Large-Scale Machine Learning Analysis Reveals DNA Methylation and Gene Expression Response Signatures for Gemcitabine-Treated Pancreatic Cancer.大规模机器学习分析揭示吉西他滨治疗胰腺癌的DNA甲基化和基因表达反应特征
Health Data Sci. 2024 Jan 8;4:0108. doi: 10.34133/hds.0108. eCollection 2024.
9
Uncovering the subtype-specific disease module and the development of drug response prediction models for glioma.揭示胶质瘤的亚型特异性疾病模块及药物反应预测模型的开发。
Heliyon. 2024 Mar 1;10(5):e27190. doi: 10.1016/j.heliyon.2024.e27190. eCollection 2024 Mar 15.
10
Recent Advancements in Subcellular Proteomics: Growing Impact of Organellar Protein Niches on the Understanding of Cell Biology.亚细胞蛋白质组学的最新进展:细胞器蛋白质龛对细胞生物学理解的影响日益增大。
J Proteome Res. 2024 Aug 2;23(8):2700-2722. doi: 10.1021/acs.jproteome.3c00839. Epub 2024 Mar 7.
基于简单的细胞系-药物复合物网络模型的全面抗癌药物反应预测。
BMC Bioinformatics. 2019 Jan 22;20(1):44. doi: 10.1186/s12859-019-2608-9.
4
Integrin trafficking in cells and tissues.细胞和组织中的整合素运输。
Nat Cell Biol. 2019 Feb;21(2):122-132. doi: 10.1038/s41556-018-0223-z. Epub 2019 Jan 2.
5
Anti-cancer Drug Response Prediction Using Neighbor-Based Collaborative Filtering with Global Effect Removal.使用基于邻居的协同过滤并去除全局效应进行抗癌药物反应预测。
Mol Ther Nucleic Acids. 2018 Dec 7;13:303-311. doi: 10.1016/j.omtn.2018.09.011. Epub 2018 Sep 22.
6
A novel approach for drug response prediction in cancer cell lines via network representation learning.通过网络表示学习预测癌症细胞系中的药物反应的新方法。
Bioinformatics. 2019 May 1;35(9):1527-1535. doi: 10.1093/bioinformatics/bty848.
7
Cancer Drug Response Profile scan (CDRscan): A Deep Learning Model That Predicts Drug Effectiveness from Cancer Genomic Signature.癌症药物反应特征扫描(CDRscan):一种从癌症基因组特征预测药物疗效的深度学习模型。
Sci Rep. 2018 Jun 11;8(1):8857. doi: 10.1038/s41598-018-27214-6.
8
Predicting Cancer Drug Response using a Recommender System.利用推荐系统预测癌症药物反应。
Bioinformatics. 2018 Nov 15;34(22):3907-3914. doi: 10.1093/bioinformatics/bty452.
9
Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer.起源细胞模式主导了 33 种癌症类型的 10000 个肿瘤的分子分类。
Cell. 2018 Apr 5;173(2):291-304.e6. doi: 10.1016/j.cell.2018.03.022.
10
Using human brain activity to guide machine learning.利用人类大脑活动来指导机器学习。
Sci Rep. 2018 Mar 29;8(1):5397. doi: 10.1038/s41598-018-23618-6.