机器学习和数据挖掘框架在癌症药物反应预测中的应用：综述及基于关联规则挖掘的新型计算机筛选流程

Machine learning and data mining frameworks for predicting drug response in cancer: An overview and a novel in silico screening process based on association rule mining.

机构信息

Biomedical Research Foundation of the Academy of Athens, 4 Soranou Ephessiou Str., Athens GR-11527, Greece; Molecular Carcinogenesis Group, Department of Histology and Embryology, School of Medicine, National and Kapodistrian University of Athens, 75 Mikras Asias Str, Athens GR-11527, Greece.

Department of Pathology, NYU School of Medicine, New York, NY 10016, USA; Laura and Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY 10016, USA.

出版信息

Pharmacol Ther. 2019 Nov;203:107395. doi: 10.1016/j.pharmthera.2019.107395. Epub 2019 Jul 30.

DOI:10.1016/j.pharmthera.2019.107395

PMID:31374225

Abstract

A major challenge in cancer treatment is predicting the clinical response to anti-cancer drugs on a personalized basis. The success of such a task largely depends on the ability to develop computational resources that integrate big "omic" data into effective drug-response models. Machine learning is both an expanding and an evolving computational field that holds promise to cover such needs. Here we provide a focused overview of: 1) the various supervised and unsupervised algorithms used specifically in drug response prediction applications, 2) the strategies employed to develop these algorithms into applicable models, 3) data resources that are fed into these frameworks and 4) pitfalls and challenges to maximize model performance. In this context we also describe a novel in silico screening process, based on Association Rule Mining, for identifying genes as candidate drivers of drug response and compare it with relevant data mining frameworks, for which we generated a web application freely available at: https://compbio.nyumc.org/drugs/. This pipeline explores with high efficiency large sample-spaces, while is able to detect low frequency events and evaluate statistical significance even in the multidimensional space, presenting the results in the form of easily interpretable rules. We conclude with future prospects and challenges of applying machine learning based drug response prediction in precision medicine.

摘要

癌症治疗的一个主要挑战是在个体化基础上预测抗癌药物的临床反应。这项任务的成功在很大程度上取决于开发能够将大型“组学”数据整合到有效药物反应模型中的计算资源的能力。机器学习是一个不断扩展和发展的计算领域，有望满足这些需求。在这里，我们重点介绍：1）专门用于药物反应预测应用的各种有监督和无监督算法，2）将这些算法开发成适用模型所采用的策略，3）输入这些框架的数据资源，以及 4）最大限度提高模型性能的陷阱和挑战。在这种情况下，我们还描述了一种基于关联规则挖掘的新的计算机筛选过程，用于识别候选药物反应驱动基因，并将其与相关的数据挖掘框架进行比较，我们为此生成了一个免费的网络应用程序，可在：https://compbio.nyumc.org/drugs/。该流水线高效地探索了大型样本空间，同时能够在多维空间中检测低频事件并评估统计显著性，以易于解释的规则形式呈现结果。我们最后对基于机器学习的药物反应预测在精准医学中的应用的未来前景和挑战进行了总结。

相似文献

Machine learning and data mining frameworks for predicting drug response in cancer: An overview and a novel in silico screening process based on association rule mining.

Pharmacol Ther. 2019 Nov;203:107395. doi: 10.1016/j.pharmthera.2019.107395. Epub 2019 Jul 30.

A systematic review of data mining and machine learning for air pollution epidemiology.

BMC Public Health. 2017 Nov 28;17(1):907. doi: 10.1186/s12889-017-4914-3.

Unsupervised Tensor Mining for Big Data Practitioners.

Big Data. 2016 Sep;4(3):179-91. doi: 10.1089/big.2016.0026.

R.ROSETTA: an interpretable machine learning framework.

BMC Bioinformatics. 2021 Mar 6;22(1):110. doi: 10.1186/s12859-021-04049-z.

Comparing different supervised machine learning algorithms for disease prediction.

BMC Med Inform Decis Mak. 2019 Dec 21;19(1):281. doi: 10.1186/s12911-019-1004-8.

J Biomed Inform. 2019 Feb;90:103103. doi: 10.1016/j.jbi.2019.103103. Epub 2019 Jan 9.

eDoctor: machine learning and the future of medicine.

J Intern Med. 2018 Dec;284(6):603-619. doi: 10.1111/joim.12822. Epub 2018 Sep 3.

Intelligently Applying Artificial Intelligence in Chemoinformatics.

Curr Top Med Chem. 2018;18(20):1804-1826. doi: 10.2174/1568026619666181120150938.

LEMRG: Decision Rule Generation Algorithm for Mining MicroRNA Expression Data.

Adv Exp Med Biol. 2017;1028:105-137. doi: 10.1007/978-981-10-6041-0_7.

Machine learning and big data analytics in bipolar disorder: A position paper from the International Society for Bipolar Disorders Big Data Task Force.

Bipolar Disord. 2019 Nov;21(7):582-594. doi: 10.1111/bdi.12828. Epub 2019 Sep 18.

引用本文的文献

Comprehensive analysis of single-cell and bulk RNA sequencing data reveals an EGFR signature for predicting immunotherapy response and prognosis in pan-cancer.

Front Immunol. 2025 Jun 12;16:1604394. doi: 10.3389/fimmu.2025.1604394. eCollection 2025.

Use of Computational Intelligence in Customizing Drug Release from 3D-Printed Products: A Comprehensive Review.

Pharmaceutics. 2025 Apr 23;17(5):551. doi: 10.3390/pharmaceutics17050551.

Comparative LC-MS Proteomics of Quinoa Grains: Evaluation of Bioactivity and Health Benefits by Combining In Silico Techniques With In Vitro Assays on Colorectal Adenocarcinoma Cells.

Mol Nutr Food Res. 2025 Jul;69(14):e70125. doi: 10.1002/mnfr.70125. Epub 2025 May 23.

The rules in co-infection of multiple viruses across diverse lineages in a fungal host.

mBio. 2025 Jun 11;16(6):e0026225. doi: 10.1128/mbio.00262-25. Epub 2025 May 20.

Anemia Risk Prediction Model for Osteosarcoma Patients Post-Chemotherapy Using Artificial Intelligence.

Cancer Med. 2024 Dec;13(23):e70427. doi: 10.1002/cam4.70427.

Unraveling the Mysteries of Alzheimer's Disease Using Artificial Intelligence.

Rev Recent Clin Trials. 2025;20(2):124-141. doi: 10.2174/0115748871330861241030143321.

Predicting Calcein Release from Ultrasound-Targeted Liposomes: A Comparative Analysis of Random Forest and Support Vector Machine.

Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241296725. doi: 10.1177/15330338241296725.

Mechanisms of Senescence and Anti-Senescence Strategies in the Skin.

Biology (Basel). 2024 Aug 23;13(9):647. doi: 10.3390/biology13090647.

Integrative analysis of pan-cancer single-cell data reveals a tumor ecosystem subtype predicting immunotherapy response.

NPJ Precis Oncol. 2024 Sep 15;8(1):205. doi: 10.1038/s41698-024-00703-w.

Mime: A flexible machine-learning framework to construct and visualize models for clinical characteristics prediction and feature selection.

Comput Struct Biotechnol J. 2024 Jun 29;23:2798-2810. doi: 10.1016/j.csbj.2024.06.035. eCollection 2024 Dec.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

机器学习和数据挖掘框架在癌症药物反应预测中的应用：综述及基于关联规则挖掘的新型计算机筛选流程

Machine learning and data mining frameworks for predicting drug response in cancer: An overview and a novel in silico screening process based on association rule mining.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献