Suppr
超能文献

元定量构效关系（Meta-QSAR）：元学习在药物设计与发现中的大规模应用。

Meta-QSAR: a large-scale application of meta-learning to drug design and discovery.

作者信息

Olier Ivan, Sadawi Noureddin, Bickerton G Richard, Vanschoren Joaquin, Grosan Crina, Soldatova Larisa, King Ross D

机构信息

1Manchester Metropolitan University, Manchester, UK.

2University of Manchester, Manchester, UK.

出版信息

Mach Learn. 2018;107(1):285-311. doi: 10.1007/s10994-017-5685-x. Epub 2017 Dec 22.

DOI:10.1007/s10994-017-5685-x

PMID:31997851

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6956898/

Abstract

We investigate the learning of quantitative structure activity relationships (QSARs) as a case-study of meta-learning. This application area is of the highest societal importance, as it is a key step in the development of new medicines. The standard QSAR learning problem is: given a target (usually a protein) and a set of chemical compounds (small molecules) with associated bioactivities (e.g. inhibition of the target), learn a predictive mapping from molecular representation to activity. Although almost every type of machine learning method has been applied to QSAR learning there is no agreed single best way of learning QSARs, and therefore the problem area is well-suited to meta-learning. We first carried out the most comprehensive ever comparison of machine learning methods for QSAR learning: 18 regression methods, 3 molecular representations, applied to more than 2700 QSAR problems. (These results have been made publicly available on OpenML and represent a valuable resource for testing novel meta-learning methods.) We then investigated the utility of algorithm selection for QSAR problems. We found that this meta-learning approach outperformed the best individual QSAR learning method (random forests using a molecular fingerprint representation) by up to 13%, on average. We conclude that meta-learning outperforms base-learning methods for QSAR learning, and as this investigation is one of the most extensive ever comparisons of base and meta-learning methods ever made, it provides evidence for the general effectiveness of meta-learning over base-learning.

摘要

我们研究定量构效关系（QSAR）的学习，以此作为元学习的一个案例研究。这个应用领域具有极其重要的社会意义，因为它是新药研发的关键一步。标准的QSAR学习问题是：给定一个靶点（通常是一种蛋白质）和一组具有相关生物活性（例如对靶点的抑制作用）的化合物（小分子），学习从分子表征到活性的预测映射。尽管几乎每种机器学习方法都已应用于QSAR学习，但对于学习QSAR并没有一种公认的最佳单一方法，因此这个问题领域非常适合元学习。我们首先对用于QSAR学习的机器学习方法进行了有史以来最全面的比较：18种回归方法、3种分子表征，应用于2700多个QSAR问题。（这些结果已在OpenML上公开，是测试新型元学习方法的宝贵资源。）然后我们研究了算法选择对QSAR问题的效用。我们发现，这种元学习方法平均比最佳的单个QSAR学习方法（使用分子指纹表征的随机森林）性能高出13%。我们得出结论，对于QSAR学习，元学习优于基础学习方法，并且由于这项研究是有史以来对基础学习和元学习方法进行的最广泛比较之一，它为元学习相对于基础学习的总体有效性提供了证据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5d3/6956898/171e31d1fd42/10994_2017_5685_Fig1_HTML.jpg

相似文献

Meta-QSAR: a large-scale application of meta-learning to drug design and discovery.

Mach Learn. 2018;107(1):285-311. doi: 10.1007/s10994-017-5685-x. Epub 2017 Dec 22.

Comprehensive ensemble in QSAR prediction for drug discovery.

BMC Bioinformatics. 2019 Oct 26;20(1):521. doi: 10.1186/s12859-019-3135-4.

An automated framework for QSAR model building.

J Cheminform. 2018 Jan 16;10(1):1. doi: 10.1186/s13321-017-0256-5.

An Analysis of QSAR Research Based on Machine Learning Concepts.

Curr Drug Discov Technol. 2021;18(1):17-30. doi: 10.2174/1570163817666200316104404.

A novel automated lazy learning QSAR (ALL-QSAR) approach: method development, applications, and virtual screening of chemical databases using validated ALL-QSAR models.

J Chem Inf Model. 2006 Sep-Oct;46(5):1984-95. doi: 10.1021/ci060132x.

Quantitative structure-activity relationship methods: perspectives on drug discovery and toxicology.

Environ Toxicol Chem. 2003 Aug;22(8):1666-79. doi: 10.1897/01-171.

QSAR-Based Virtual Screening: Advances and Applications in Drug Discovery.

Front Pharmacol. 2018 Nov 13;9:1275. doi: 10.3389/fphar.2018.01275. eCollection 2018.

Optimal Piecewise Linear Regression Algorithm for QSAR Modelling.

Mol Inform. 2019 Mar;38(3):e1800028. doi: 10.1002/minf.201800028. Epub 2018 Sep 24.

Profile-QSAR: a novel meta-QSAR method that combines activities across the kinase family to accurately predict affinity, selectivity, and cellular activity.

J Chem Inf Model. 2011 Aug 22;51(8):1942-56. doi: 10.1021/ci1005004. Epub 2011 Jul 19.

Targeting HIV/HCV Coinfection Using a Machine Learning-Based Multiple Quantitative Structure-Activity Relationships (Multiple QSAR) Method.

Int J Mol Sci. 2019 Jul 22;20(14):3572. doi: 10.3390/ijms20143572.

引用本文的文献

OpenML: Insights from 10 years and more than a thousand papers.

Patterns (N Y). 2025 Jul 3;6(7):101317. doi: 10.1016/j.patter.2025.101317. eCollection 2025 Jul 11.

Digital Alchemy: The Rise of Machine and Deep Learning in Small-Molecule Drug Discovery.

Int J Mol Sci. 2025 Jul 16;26(14):6807. doi: 10.3390/ijms26146807.

Improved Machine Learning Predictions of EC50s Using Uncertainty Estimation from Dose-Response Data.

J Chem Inf Model. 2025 Jun 9;65(11):5623-5634. doi: 10.1021/acs.jcim.5c00249. Epub 2025 May 19.

Evaluation of Machine Learning Based QSAR Models for the Classification of Lung Surfactant Inhibitors.

Environ Health (Wash). 2024 Sep 20;2(12):912-917. doi: 10.1021/envhealth.4c00118. eCollection 2024 Dec 20.

Machine learning models to identify lead compound and substitution optimization to have derived energetics and conformational stability through docking and MD simulations for sphingosine kinase 1.

Mol Divers. 2025 Aug;29(4):2945-2977. doi: 10.1007/s11030-024-10997-4. Epub 2024 Oct 17.

Cheminformatics and artificial intelligence for accelerating agrochemical discovery.

Front Chem. 2023 Nov 29;11:1292027. doi: 10.3389/fchem.2023.1292027. eCollection 2023.

Batched Bayesian Optimization for Drug Design in Noisy Environments.

J Chem Inf Model. 2022 Sep 12;62(17):3970-3981. doi: 10.1021/acs.jcim.2c00602. Epub 2022 Aug 31.

Deep generative models for peptide design.

Digit Discov. 2022 Mar 31;1(3):195-208. doi: 10.1039/d1dd00024a. eCollection 2022 Jun 13.

Limits of Prediction for Machine Learning in Drug Discovery.

Front Pharmacol. 2022 Mar 10;13:832120. doi: 10.3389/fphar.2022.832120. eCollection 2022.

Transformational machine learning: Learning how to learn from many related scientific problems.

Proc Natl Acad Sci U S A. 2021 Dec 7;118(49). doi: 10.1073/pnas.2108013118.

本文引用的文献

The cost of drug development.

N Engl J Med. 2015 May 14;372(20):1972. doi: 10.1056/NEJMc1504317.

protr/ProtrWeb: R package and web server for generating various numerical representation schemes of protein sequences.

Bioinformatics. 2015 Jun 1;31(11):1857-9. doi: 10.1093/bioinformatics/btv042. Epub 2015 Jan 24.

QSAR modeling: where have you been? Where are you going to?

J Med Chem. 2014 Jun 26;57(12):4977-5010. doi: 10.1021/jm4004285. Epub 2014 Jan 6.

Chemical predictive modelling to improve compound quality.

Nat Rev Drug Discov. 2013 Dec;12(12):948-62. doi: 10.1038/nrd4128.

QSAR workbench: automating QSAR modeling to drive compound design.

J Comput Aided Mol Des. 2013 Apr;27(4):321-36. doi: 10.1007/s10822-013-9648-4. Epub 2013 Apr 25.

Comparison of different approaches to define the applicability domain of QSAR models.

Molecules. 2012 Apr 25;17(5):4791-810. doi: 10.3390/molecules17054791.

Towards a gold standard: regarding quality in public domain chemistry databases and approaches to improving the situation.

Drug Discov Today. 2012 Jul;17(13-14):685-701. doi: 10.1016/j.drudis.2012.02.013. Epub 2012 Mar 8.

The inevitable QSAR renaissance.

J Comput Aided Mol Des. 2012 Jan;26(1):35-8. doi: 10.1007/s10822-011-9495-0. Epub 2011 Nov 30.

The importance of delirium: economic and societal costs.

J Am Geriatr Soc. 2011 Nov;59 Suppl 2(Suppl 2):S241-3. doi: 10.1111/j.1532-5415.2011.03671.x.

The chemical information ontology: provenance and disambiguation for chemical data on the biological semantic web.

PLoS One. 2011;6(10):e25513. doi: 10.1371/journal.pone.0025513. Epub 2011 Oct 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

元定量构效关系（Meta-QSAR）：元学习在药物设计与发现中的大规模应用。

Meta-QSAR: a large-scale application of meta-learning to drug design and discovery.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译