结合机器学习和基于药效团的相互作用指纹进行计算机筛选。

Combining machine learning and pharmacophore-based interaction fingerprint for in silico screening.

机构信息

Department of Biophysics and Biochemistry, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan.

出版信息

J Chem Inf Model. 2010 Jan;50(1):170-85. doi: 10.1021/ci900382e.

DOI:10.1021/ci900382e

PMID:20038188

Abstract

In this study, we developed a new pharmacophore-based interaction fingerprint (Pharm-IF) and examined its usefulness for in silico screening using machine learning techniques such as support vector machine (SVM) and random forest (RF) instead of similarity-based ranking. Using the docking results of PKA, SRC, cathepsin K, carbonic anhydrase II, and HIV-1 protease, the screening efficiencies of the Pharm-IF models were compared to GLIDE score and the residue-based IF (PLIF) models. The combination of SVM and Pharm-IF demonstrated a higher enrichment factor at 10% (5.7 on average) than those of GLIDE score (4.2) and PLIF (4.3). In terms of the size of the training sets, learning more than five crystal structures enabled the machine learning models to stably achieve better efficiencies than GLIDE score. We also employed the docking poses of known active compounds, in addition to the crystal structures, as positive samples of training sets. The enrichment factors of the RF models at 10% using the docking poses for SRC and cathepsin K showed significantly higher values (6.5 and 6.3) than those using only the crystal structures (3.9 and 3.2), respectively.

摘要

在这项研究中，我们开发了一种新的基于药效团的相互作用指纹（Pharm-IF），并使用支持向量机（SVM）和随机森林（RF）等机器学习技术，而不是基于相似性的排序，来检验其在计算机筛选中的有效性。使用 PKA、SRC、组织蛋白酶 K、碳酸酐酶 II 和 HIV-1 蛋白酶的对接结果，比较了 Pharm-IF 模型的筛选效率与 GLIDE 评分和基于残基的 IF（PLIF）模型。SVM 和 Pharm-IF 的组合在 10%时表现出更高的富集因子（平均为 5.7），而 GLIDE 评分（4.2）和 PLIF（4.3）则较低。就训练集的大小而言，学习超过五个晶体结构使机器学习模型能够稳定地实现比 GLIDE 评分更好的效率。我们还将已知活性化合物的对接构象，除了晶体结构之外，用作训练集的阳性样本。使用 SRC 和组织蛋白酶 K 的对接构象作为训练集的 RF 模型在 10%时的富集因子分别为 6.5 和 6.3，明显高于仅使用晶体结构时的 3.9 和 3.2。

相似文献

Combining machine learning and pharmacophore-based interaction fingerprint for in silico screening.

J Chem Inf Model. 2010 Jan;50(1):170-85. doi: 10.1021/ci900382e.

Novel method for generating structure-based pharmacophores using energetic analysis.

J Chem Inf Model. 2009 Oct;49(10):2356-68. doi: 10.1021/ci900212v.

Structure-based approach to pharmacophore identification, in silico screening, and three-dimensional quantitative structure-activity relationship studies for inhibitors of Trypanosoma cruzi dihydrofolate reductase function.

Proteins. 2008 Dec;73(4):889-901. doi: 10.1002/prot.22115.

Virtual screening of cathepsin k inhibitors using docking and pharmacophore models.

Chem Biol Drug Des. 2008 Jul;72(1):79-90. doi: 10.1111/j.1747-0285.2008.00667.x. Epub 2008 May 21.

Large-scale learning of structure-activity relationships using a linear support vector machine and problem-specific metrics.

J Chem Inf Model. 2011 Feb 28;51(2):203-13. doi: 10.1021/ci100073w. Epub 2011 Jan 5.

Improving VEGFR-2 docking-based screening by pharmacophore postfiltering and similarity search postprocessing.

J Chem Inf Model. 2011 Apr 25;51(4):777-87. doi: 10.1021/ci1002763. Epub 2011 Mar 18.

Ligand prediction from protein sequence and small molecule information using support vector machines and fingerprint descriptors.

J Chem Inf Model. 2009 Apr;49(4):767-79. doi: 10.1021/ci900004a.

Fuzzy pharmacophore models from molecular alignments for correlation-vector-based virtual screening.

J Med Chem. 2004 Sep 9;47(19):4653-64. doi: 10.1021/jm031139y.

Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques.

Drug Metab Dispos. 2009 Mar;37(3):658-64. doi: 10.1124/dmd.108.023507. Epub 2008 Dec 4.

Ensemble docking of multiple protein structures: considering protein structural variations in molecular docking.

Proteins. 2007 Feb 1;66(2):399-421. doi: 10.1002/prot.21214.

引用本文的文献

InertDB as a generative AI-expanded resource of biologically inactive small molecules from PubChem.

J Cheminform. 2025 Apr 10;17(1):49. doi: 10.1186/s13321-025-00999-1.

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning.

BMC Biol. 2024 Dec 31;22(1):301. doi: 10.1186/s12915-024-02096-5.

Bridging Structure- and Ligand-Based Virtual Screening through Fragmented Interaction Fingerprint.

ACS Omega. 2024 Sep 3;9(37):38957-38969. doi: 10.1021/acsomega.4c05433. eCollection 2024 Sep 17.

An overview of recent advances and challenges in predicting compound-protein interaction (CPI).

Med Rev (2021). 2023 Oct 6;3(6):465-486. doi: 10.1515/mr-2023-0030. eCollection 2023 Dec.

Mind the Gap-Deciphering GPCR Pharmacology Using 3D Pharmacophores and Artificial Intelligence.

Pharmaceuticals (Basel). 2022 Oct 22;15(11):1304. doi: 10.3390/ph15111304.

Development of machine learning models for the screening of potential HSP90 inhibitors.

Front Mol Biosci. 2022 Oct 19;9:967510. doi: 10.3389/fmolb.2022.967510. eCollection 2022.

Pharmacophore Modeling Using Machine Learning for Screening the Blood-Brain Barrier Permeation of Xenobiotics.

Int J Environ Res Public Health. 2022 Oct 18;19(20):13471. doi: 10.3390/ijerph192013471.

Application of Machine Learning in Developing Quantitative Structure-Property Relationship for Electronic Properties of Polyaromatic Compounds.

ACS Omega. 2022 Jun 17;7(26):22879-22888. doi: 10.1021/acsomega.2c02650. eCollection 2022 Jul 5.

Improved method of structure-based virtual screening based on ensemble learning.

RSC Adv. 2020 Feb 19;10(13):7609-7618. doi: 10.1039/c9ra09211k. eCollection 2020 Feb 18.

ELIXIR-A: An Interactive Visualization Tool for Multi-Target Pharmacophore Refinement.

ACS Omega. 2022 Apr 5;7(15):12707-12715. doi: 10.1021/acsomega.1c07144. eCollection 2022 Apr 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

结合机器学习和基于药效团的相互作用指纹进行计算机筛选。

Combining machine learning and pharmacophore-based interaction fingerprint for in silico screening.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献