• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于最小基因组信息和化学性质的随机森林模型预测抗癌药物活性。

Predicting drug activity against cancer cells by random forest models based on minimal genomic information and chemical properties.

机构信息

Physical Sciences Division, University of Washington Bothell, Bothell, Washington, United States of America.

出版信息

PLoS One. 2019 Jul 11;14(7):e0219774. doi: 10.1371/journal.pone.0219774. eCollection 2019.

DOI:10.1371/journal.pone.0219774
PMID:31295321
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6622537/
Abstract

A key goal of precision medicine is predicting the best drug therapy for a specific patient from genomic information. In oncology, cancers that appear similar pathologically can vary greatly in how they respond to the same drug. Fortunately, data from high-throughput screening programs often reveal important relationships between genomic variability of cancer cells and their response to drugs. Nevertheless, many current computational methods to predict compound activity against cancer cells require large quantities of genomic, epigenomic, and additional cellular data to develop and to apply. Here we integrate recent screening data and machine learning to train classification models that predict the activity/inactivity of compounds against cancer cells based on the mutational status of only 145 oncogenes and a set of compound structural descriptors. Using IC50 values of 1 μM as activity cutoffs, our predictive models have sensitivities of 87%, specificities of 87%, and yield an area under the receiver operating characteristic curve equal to 0.94. We also develop regression models to predict log(IC50) values of compounds for cancer cells; the models achieve a Pearson correlation coefficient of 0.86 for cross-validation and up to 0.65-0.73 against blind test sets. Predictive performance remains strong when as few as 50 oncogenes are included. Finally, even when 40% of experimental IC50 values are missing from screening data, they can be imputed with sufficient reliability that classification accuracy is not diminished. The presented models are fast to generate and may serve as easily implemented screening tools for personalized oncology medicine, drug repurposing, and drug discovery.

摘要

精准医学的一个主要目标是根据基因组信息预测特定患者的最佳药物治疗方案。在肿瘤学中,病理上表现相似的癌症在对同一药物的反应上可能存在很大差异。幸运的是,高通量筛选计划的数据通常揭示了癌细胞的基因组变异与其对药物的反应之间的重要关系。然而,许多当前用于预测化合物对癌细胞活性的计算方法需要大量的基因组、表观基因组和其他细胞数据来开发和应用。在这里,我们整合了最近的筛选数据和机器学习来训练分类模型,这些模型仅基于 145 个癌基因的突变状态和一组化合物结构描述符来预测化合物对癌细胞的活性/非活性。使用 1 μM 的 IC50 值作为活性截止值,我们的预测模型具有 87%的灵敏度、87%的特异性和 0.94 的接收器工作特征曲线下面积。我们还开发了用于预测化合物对癌细胞的 log(IC50)值的回归模型;这些模型在交叉验证中达到了 0.86 的皮尔逊相关系数,在盲测试集上达到了 0.65-0.73。当包含的癌基因数量少至 50 个时,预测性能仍然很强。最后,即使筛选数据中缺失了 40%的实验 IC50 值,也可以通过足够的可靠性进行推断,而不会降低分类准确性。所提出的模型生成速度快,可作为个性化肿瘤学、药物再利用和药物发现的易于实现的筛选工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/e775f89d4a0b/pone.0219774.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/b0178d56a23c/pone.0219774.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/b502b3799154/pone.0219774.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/6524a8f3ac97/pone.0219774.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/eae6a622f9d4/pone.0219774.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/e775f89d4a0b/pone.0219774.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/b0178d56a23c/pone.0219774.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/b502b3799154/pone.0219774.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/6524a8f3ac97/pone.0219774.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/eae6a622f9d4/pone.0219774.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9b/6622537/e775f89d4a0b/pone.0219774.g005.jpg

相似文献

1
Predicting drug activity against cancer cells by random forest models based on minimal genomic information and chemical properties.基于最小基因组信息和化学性质的随机森林模型预测抗癌药物活性。
PLoS One. 2019 Jul 11;14(7):e0219774. doi: 10.1371/journal.pone.0219774. eCollection 2019.
2
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
3
Machine learning prediction of cancer cell sensitivity to drugs based on genomic and chemical properties.基于基因组和化学特性的机器学习预测癌细胞对药物的敏感性。
PLoS One. 2013 Apr 30;8(4):e61318. doi: 10.1371/journal.pone.0061318. Print 2013.
4
Machine learning random forest for predicting oncosomatic variant NGS analysis.机器学习随机森林预测肿瘤体细胞变异 NGS 分析。
Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y.
5
Comparison of the predictive outcomes for anti-tuberculosis drug-induced hepatotoxicity by different machine learning techniques.比较不同机器学习技术对抗结核药物性肝损伤的预测结果。
Comput Methods Programs Biomed. 2020 May;188:105307. doi: 10.1016/j.cmpb.2019.105307. Epub 2019 Dec 27.
6
Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.比较多种机器学习算法和指标进行雌激素受体结合预测。
Mol Pharm. 2018 Oct 1;15(10):4361-4370. doi: 10.1021/acs.molpharmaceut.8b00546. Epub 2018 Aug 28.
7
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
8
Personal Health Information Inference Using Machine Learning on RNA Expression Data from Patients With Cancer: Algorithm Validation Study.利用癌症患者 RNA 表达数据进行机器学习的个人健康信息推断:算法验证研究。
J Med Internet Res. 2020 Aug 10;22(8):e18387. doi: 10.2196/18387.
9
Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.超越多药耐药性:利用机器和统计学习模型在结核分枝杆菌耐药性预测中的罕见变异。
EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.
10
Drug Intervention Response Predictions with PARADIGM (DIRPP) identifies drug resistant cancer cell lines and pathway mechanisms of resistance.使用PARADIGM进行药物干预反应预测(DIRPP)可识别耐药癌细胞系及耐药的通路机制。
Pac Symp Biocomput. 2014:125-35.

引用本文的文献

1
Exploring the role of artificial intelligence in chemotherapy development, cancer diagnosis, and treatment: present achievements and future outlook.探索人工智能在化疗研发、癌症诊断和治疗中的作用:当前成就与未来展望。
Front Oncol. 2025 Feb 4;15:1475893. doi: 10.3389/fonc.2025.1475893. eCollection 2025.
2
Empowering cancer prevention with AI: unlocking new frontiers in prediction, diagnosis, and intervention.利用人工智能助力癌症预防:开启预测、诊断和干预的新前沿。
Cancer Causes Control. 2025 Apr;36(4):353-367. doi: 10.1007/s10552-024-01942-9. Epub 2024 Dec 13.
3
Artificial intelligence in predicting pathogenic microorganisms' antimicrobial resistance: challenges, progress, and prospects.

本文引用的文献

1
Machine learning and feature selection for drug response prediction in precision oncology applications.精准肿瘤学应用中用于药物反应预测的机器学习与特征选择
Biophys Rev. 2019 Feb;11(1):31-39. doi: 10.1007/s12551-018-0446-z. Epub 2018 Aug 10.
2
Cancer Drug Response Profile scan (CDRscan): A Deep Learning Model That Predicts Drug Effectiveness from Cancer Genomic Signature.癌症药物反应特征扫描(CDRscan):一种从癌症基因组特征预测药物疗效的深度学习模型。
Sci Rep. 2018 Jun 11;8(1):8857. doi: 10.1038/s41598-018-27214-6.
3
Next-Generation Machine Learning for Biological Networks.
人工智能在预测病原微生物的抗菌药物耐药性方面的应用:挑战、进展和展望。
Front Cell Infect Microbiol. 2024 Nov 1;14:1482186. doi: 10.3389/fcimb.2024.1482186. eCollection 2024.
4
Predicting Calcein Release from Ultrasound-Targeted Liposomes: A Comparative Analysis of Random Forest and Support Vector Machine.超声靶向脂质体中 calcein 释放的预测:随机森林和支持向量机的比较分析。
Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241296725. doi: 10.1177/15330338241296725.
5
Contemporary Update on Clinical and Experimental Prostate Cancer Biomarkers: A Multi-Omics-Focused Approach to Detection and Risk Stratification.前列腺癌临床与实验生物标志物的当代进展:一种聚焦多组学的检测与风险分层方法
Biology (Basel). 2024 Sep 25;13(10):762. doi: 10.3390/biology13100762.
6
Identification of Potential Tryptase Inhibitors from FDA-Approved Drugs Using Machine Learning, Molecular Docking, and Experimental Validation.利用机器学习、分子对接和实验验证从美国食品药品监督管理局(FDA)批准的药物中鉴定潜在的类胰蛋白酶抑制剂
ACS Omega. 2024 Sep 4;9(37):38820-38831. doi: 10.1021/acsomega.4c04886. eCollection 2024 Sep 17.
7
The Artificial Intelligence-Powered New Era in Pharmaceutical Research and Development: A Review.人工智能驱动的药物研发新时代:综述。
AAPS PharmSciTech. 2024 Aug 15;25(6):188. doi: 10.1208/s12249-024-02901-y.
8
Application of peripheral blood routine parameters in the diagnosis of influenza and Mycoplasma pneumoniae.外周血常规参数在流感和肺炎支原体诊断中的应用。
Virol J. 2024 Jul 23;21(1):162. doi: 10.1186/s12985-024-02429-4.
9
Virtual Screening of Small Molecules Targeting BCL2 with Machine Learning, Molecular Docking, and MD Simulation.基于机器学习、分子对接和 MD 模拟的靶向 BCL2 小分子的虚拟筛选。
Biomolecules. 2024 May 1;14(5):544. doi: 10.3390/biom14050544.
10
A gray box framework that optimizes a white box logical model using a black box optimizer for simulating cellular responses to perturbations.使用黑盒优化器优化白盒逻辑模型的灰色框框架,用于模拟细胞对扰动的反应。
Cell Rep Methods. 2024 May 20;4(5):100773. doi: 10.1016/j.crmeth.2024.100773. Epub 2024 May 13.
下一代生物网络机器学习。
Cell. 2018 Jun 14;173(7):1581-1592. doi: 10.1016/j.cell.2018.05.015. Epub 2018 Jun 7.
4
A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia.一种用于整合大数据以实现急性髓系白血病精准医疗的机器学习方法。
Nat Commun. 2018 Jan 3;9(1):42. doi: 10.1038/s41467-017-02465-5.
5
Precision and recall oncology: combining multiple gene mutations for improved identification of drug-sensitive tumours.精准与召回肿瘤学:结合多种基因突变以改进对药物敏感肿瘤的识别
Oncotarget. 2017 Sep 15;8(57):97025-97040. doi: 10.18632/oncotarget.20923. eCollection 2017 Nov 14.
6
The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching.化学开发工具包(CDK)v2.0:原子类型标注、描绘、分子式及子结构搜索。
J Cheminform. 2017 Jun 6;9(1):33. doi: 10.1186/s13321-017-0220-4.
7
Computational-experimental approach to drug-target interaction mapping: A case study on kinase inhibitors.药物-靶点相互作用图谱绘制的计算实验方法:以激酶抑制剂为例的研究。
PLoS Comput Biol. 2017 Aug 7;13(8):e1005678. doi: 10.1371/journal.pcbi.1005678. eCollection 2017 Aug.
8
Tumoral expression of drug and xenobiotic metabolizing enzymes in breast cancer patients of different ethnicities with implications to personalized medicine.不同种族乳腺癌患者中药物和外源性物质代谢酶的肿瘤表达及其对个体化医学的意义。
Sci Rep. 2017 Jul 6;7(1):4747. doi: 10.1038/s41598-017-04250-2.
9
Current Trends in Drug Sensitivity Prediction.药物敏感性预测的当前趋势
Curr Pharm Des. 2016;22(46):6918-6927. doi: 10.2174/1381612822666161026154430.
10
Using quantile regression to create baseline norms for neuropsychological tests.使用分位数回归为神经心理学测试创建基线常模。
Alzheimers Dement (Amst). 2015 Dec 19;2:12-8. doi: 10.1016/j.dadm.2015.11.005. eCollection 2016.