基于结构的药物筛选以及结合机器学习的基于配体的药物筛选。

Structure-based drug screening and ligand-based drug screening with machine learning.

作者信息

Fukunishi Yoshifumi

机构信息

Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-41-6 Aomi, Koto-ku, Tokyo, Japan.

出版信息

Comb Chem High Throughput Screen. 2009 May;12(4):397-408. doi: 10.2174/138620709788167890.

DOI:10.2174/138620709788167890

PMID:19442067

Abstract

The initial stage of drug development is the hit (active) compound search from a pool of millions of compounds; for this process, in silico (virtual) screening has been successfully applied. One of the problems of in silico screening, however, is the low hit ratio in relation to the high computational cost and the long CPU time. This problem becomes serious in structure-based in silico screening. The major reason is the low accuracy of the estimation of protein-compound binding free energy. The problem of ligand-based in silico screening is that the conventional quantitative structure-activity relationship (QSAR) approach is not effective at predicting new hit compounds with new scaffolds. Recently, machine-learning approaches have been applied to in silico drug screening to overcome the above problems. We review here machine-learning approaches for both structure-based and ligand-based drug screening. Machine learning is used to improve database enrichment in two ways, namely by improving the docking score calculated by the protein-compound docking program and by calculating the optimal distance between the feature vectors of active and inactive compounds. Both approaches require compounds that are known to be active with respect to the target protein. In structure-based screening, the former approach is mainly used with a protein-compound affinity matrix. In ligand-based screening, both the former and latter approaches are used, and the latter approach can be applied to various kinds of descriptors, such as 1D/2D descriptors/fingerprints and the affinity fingerprint given by the protein-compound affinity matrix.

摘要

药物研发的初始阶段是从数百万种化合物中寻找活性化合物；在此过程中，计算机虚拟筛选已得到成功应用。然而，计算机虚拟筛选存在的一个问题是，与高计算成本和长CPU时间相关的命中率较低。在基于结构的计算机虚拟筛选中，这个问题变得更加严重。主要原因是蛋白质-化合物结合自由能估计的准确性较低。基于配体的计算机虚拟筛选的问题在于，传统的定量构效关系（QSAR）方法在预测具有新骨架的新活性化合物方面效果不佳。最近，机器学习方法已被应用于计算机虚拟药物筛选，以克服上述问题。我们在此回顾基于结构和基于配体的药物筛选的机器学习方法。机器学习用于通过两种方式改善数据库富集，即通过改进蛋白质-化合物对接程序计算的对接分数，以及通过计算活性和非活性化合物特征向量之间的最佳距离。这两种方法都需要已知对目标蛋白有活性的化合物。在基于结构的筛选中，前一种方法主要与蛋白质-化合物亲和力矩阵一起使用。在基于配体的筛选中，两种方法都被使用，后一种方法可以应用于各种描述符，如1D/2D描述符/指纹以及由蛋白质-化合物亲和力矩阵给出的亲和力指纹。

相似文献

Structure-based drug screening and ligand-based drug screening with machine learning.

Comb Chem High Throughput Screen. 2009 May;12(4):397-408. doi: 10.2174/138620709788167890.

Comparative analysis of machine learning methods in ligand-based virtual screening of large compound libraries.

Comb Chem High Throughput Screen. 2009 May;12(4):344-57. doi: 10.2174/138620709788167944.

Performance of machine learning methods for ligand-based virtual screening.

Comb Chem High Throughput Screen. 2009 May;12(4):358-68. doi: 10.2174/138620709788167962.

Machine learning for virtual screening (part 1).

Comb Chem High Throughput Screen. 2009 May;12(4):330-1. doi: 10.2174/138620709788167999.

Noise reduction method for molecular interaction energy: application to in silico drug screening and in silico target protein screening.

J Chem Inf Model. 2006 Sep-Oct;46(5):2071-84. doi: 10.1021/ci060152z.

Virtual high-throughput screening of molecular databases.

Curr Opin Drug Discov Devel. 2007 May;10(3):298-307.

Virtual screening with support vector machines and structure kernels.

Comb Chem High Throughput Screen. 2009 May;12(4):409-23. doi: 10.2174/138620709788167926.

Multiple target screening method for robust and accurate in silico ligand screening.

J Mol Graph Model. 2006 Sep;25(1):61-70. doi: 10.1016/j.jmgm.2005.11.006. Epub 2005 Dec 22.

An efficient in silico screening method based on the protein-compound affinity matrix and its application to the design of a focused library for cytochrome P450 (CYP) ligands.

J Chem Inf Model. 2006 Nov-Dec;46(6):2610-22. doi: 10.1021/ci600334u.

Machine learning in virtual screening.

Comb Chem High Throughput Screen. 2009 May;12(4):332-43. doi: 10.2174/138620709788167980.

引用本文的文献

Artificial intelligence-driven prediction and validation of blood-brain barrier permeability and absorption, distribution, metabolism, excretion profiles in natural product research laboratory compounds.

Biomedicine (Taipei). 2024 Dec 1;14(4):82-91. doi: 10.37796/2211-8039.1474. eCollection 2024.

Binding free-energy landscapes of small molecule binder and non-binder to FMN riboswitch: All-atom molecular dynamics.

Biophys Physicobiol. 2023 Dec 13;20(4):e200047. doi: 10.2142/biophysico.bppb-v20.0047. eCollection 2023.

Artificial intelligence and machine-learning approaches in structure and ligand-based discovery of drugs affecting central nervous system.

Mol Divers. 2023 Apr;27(2):959-985. doi: 10.1007/s11030-022-10489-3. Epub 2022 Jul 11.

ACS Omega. 2022 Feb 3;7(6):4769-4786. doi: 10.1021/acsomega.1c04587. eCollection 2022 Feb 15.

Molecules. 2021 Nov 3;26(21):6669. doi: 10.3390/molecules26216669.

How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques.

Molecules. 2020 Mar 23;25(6):1452. doi: 10.3390/molecules25061452.

Free-energy landscape of molecular interactions between endothelin 1 and human endothelin type B receptor: fly-casting mechanism.

Protein Eng Des Sel. 2019 Dec 31;32(7):297-308. doi: 10.1093/protein/gzz029.

SimCAL: a flexible tool to compute biochemical reaction similarity.

BMC Bioinformatics. 2018 Jul 3;19(1):254. doi: 10.1186/s12859-018-2248-5.

Prospective evaluation of shape similarity based pose prediction method in D3R Grand Challenge 2015.

J Comput Aided Mol Des. 2016 Sep;30(9):685-693. doi: 10.1007/s10822-016-9931-2. Epub 2016 Aug 2.

A pose prediction approach based on ligand 3D shape similarity.

J Comput Aided Mol Des. 2016 Jun;30(6):457-69. doi: 10.1007/s10822-016-9923-2. Epub 2016 Jul 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于结构的药物筛选以及结合机器学习的基于配体的药物筛选。

Structure-based drug screening and ligand-based drug screening with machine learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献