用于药物发现的化学结构排序：一种新的机器学习方法。

Ranking chemical structures for drug discovery: a new machine learning approach.

机构信息

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.

出版信息

J Chem Inf Model. 2010 May 24;50(5):716-31. doi: 10.1021/ci9003865.

DOI:10.1021/ci9003865

PMID:20387860

Abstract

With chemical libraries increasingly containing millions of compounds or more, there is a fast-growing need for computational methods that can rank or prioritize compounds for screening. Machine learning methods have shown considerable promise for this task; indeed, classification methods such as support vector machines (SVMs), together with their variants, have been used in virtual screening to distinguish active compounds from inactive ones, while regression methods such as partial least-squares (PLS) and support vector regression (SVR) have been used in quantitative structure-activity relationship (QSAR) analysis for predicting biological activities of compounds. Recently, a new class of machine learning methods - namely, ranking methods, which are designed to directly optimize ranking performance - have been developed for ranking tasks such as web search that arise in information retrieval (IR) and other applications. Here we report the application of these new ranking methods in machine learning to the task of ranking chemical structures. Our experiments show that the new ranking methods give better ranking performance than both classification based methods in virtual screening and regression methods in QSAR analysis. We also make some interesting connections between ranking performance measures used in cheminformatics and those used in IR studies.

摘要

随着化学库中化合物的数量越来越多，达到数百万甚至更多，因此对于能够对化合物进行排序或优先级划分以便进行筛选的计算方法的需求也在快速增长。机器学习方法在这项任务中显示出了相当大的前景；事实上，分类方法，如支持向量机（SVM）及其变体，已被用于虚拟筛选，以区分活性化合物和非活性化合物，而回归方法，如偏最小二乘（PLS）和支持向量回归（SVR），已被用于定量构效关系（QSAR）分析，以预测化合物的生物活性。最近，一类新的机器学习方法——即排序方法，旨在直接优化排序性能——已经被开发出来，用于解决信息检索（IR）和其他应用中出现的网络搜索等排序任务。在这里，我们报告了这些新的排序方法在机器学习中应用于化学结构排序任务的情况。我们的实验表明，这些新的排序方法在虚拟筛选中的分类方法和 QSAR 分析中的回归方法的排序性能都要好。我们还在化学信息学中使用的排序性能度量和 IR 研究中使用的排序性能度量之间建立了一些有趣的联系。

相似文献

Ranking chemical structures for drug discovery: a new machine learning approach.

J Chem Inf Model. 2010 May 24;50(5):716-31. doi: 10.1021/ci9003865.

StructRank: a new approach for ligand-based virtual screening.

J Chem Inf Model. 2011 Jan 24;51(1):83-92. doi: 10.1021/ci100308f. Epub 2010 Dec 17.

Application of support vector machine-based ranking strategies to search for target-selective compounds.

Methods Mol Biol. 2011;672:517-30. doi: 10.1007/978-1-60761-839-3_21.

Improvement of multivariate image analysis applied to quantitative structure-activity relationship (QSAR) analysis by using wavelet-principal component analysis ranking variable selection and least-squares support vector machine regression: QSAR study of checkpoint kinase WEE1 inhibitors.

Chem Biol Drug Des. 2009 Feb;73(2):244-52. doi: 10.1111/j.1747-0285.2008.00764.x.

Prediction of antibacterial compounds by machine learning approaches.

J Comput Chem. 2009 Jun;30(8):1202-11. doi: 10.1002/jcc.21148.

Large-scale learning of structure-activity relationships using a linear support vector machine and problem-specific metrics.

J Chem Inf Model. 2011 Feb 28;51(2):203-13. doi: 10.1021/ci100073w. Epub 2011 Jan 5.

Molecule kernels: a descriptor- and alignment-free quantitative structure-activity relationship approach.

J Chem Inf Model. 2008 Sep;48(9):1868-81. doi: 10.1021/ci800144y. Epub 2008 Sep 4.

Ligand-based virtual screening and in silico design of new antimalarial compounds using nonstochastic and stochastic total and atom-type quadratic maps.

J Chem Inf Model. 2005 Jul-Aug;45(4):1082-100. doi: 10.1021/ci050085t.

Computational study of CCR5 antagonist with support vector machines and three dimensional quantitative structure activity relationship methods.

Chem Biol Drug Des. 2010 Mar;75(3):295-309. doi: 10.1111/j.1747-0285.2009.00935.x.

A support vector machine using the lazy learning approach for multi-class classification.

J Med Eng Technol. 2006 Mar-Apr;30(2):73-7. doi: 10.1080/03091900500095729.

引用本文的文献

Novel target identification towards drug repurposing based on biological activity profiles.

PLoS One. 2025 May 6;20(5):e0319865. doi: 10.1371/journal.pone.0319865. eCollection 2025.

Label Transfer for Drug Disease Association in Three Meta-Paths.

Evol Bioinform Online. 2024 Sep 13;20:11769343241272414. doi: 10.1177/11769343241272414. eCollection 2024.

Leveraging bounded datapoints to classify molecular potency improvements.

RSC Med Chem. 2024 May 31;15(7):2474-2482. doi: 10.1039/d4md00325j. eCollection 2024 Jul 17.

The Histone Deacetylase Family: Structural Features and Application of Combined Computational Methods.

Pharmaceuticals (Basel). 2024 May 10;17(5):620. doi: 10.3390/ph17050620.

Artificial Intelligence Technologies for COVID-19 De Novo Drug Design.

Int J Mol Sci. 2022 Mar 17;23(6):3261. doi: 10.3390/ijms23063261.

Artificial Intelligence for Autonomous Molecular Design: A Perspective.

Molecules. 2021 Nov 9;26(22):6761. doi: 10.3390/molecules26226761.

Ranking-Oriented Quantitative Structure-Activity Relationship Modeling Combined with Assay-Wise Data Integration.

ACS Omega. 2021 Apr 28;6(18):11964-11973. doi: 10.1021/acsomega.1c00463. eCollection 2021 May 11.

Discovery of SARS-CoV-2 main protease inhibitors using a synthesis-directed design model.

Chem Commun (Camb). 2021 Jun 15;57(48):5909-5912. doi: 10.1039/d1cc00050k.

Drug-Target Interaction Prediction Based on Adversarial Bayesian Personalized Ranking.

Biomed Res Int. 2021 Feb 10;2021:6690154. doi: 10.1155/2021/6690154. eCollection 2021.

Cognitive biomarker prioritization in Alzheimer's Disease using brain morphometric data.

BMC Med Inform Decis Mak. 2020 Dec 2;20(1):319. doi: 10.1186/s12911-020-01339-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于药物发现的化学结构排序：一种新的机器学习方法。

Ranking chemical structures for drug discovery: a new machine learning approach.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献