MILCDock：用于药物发现虚拟筛选的机器学习增强共识对接。

MILCDock: Machine Learning Enhanced Consensus Docking for Virtual Screening in Drug Discovery.

机构信息

Department of Physics and Astronomy, Brigham Young University, Provo, Utah84602, United States.

Department of Computer Science, Brigham Young University, Provo, Utah84602, United States.

出版信息

J Chem Inf Model. 2022 Nov 28;62(22):5342-5350. doi: 10.1021/acs.jcim.2c00705. Epub 2022 Nov 7.

DOI:10.1021/acs.jcim.2c00705

PMID:36342217

Abstract

Molecular docking tools are regularly used to computationally identify new molecules in virtual screening for drug discovery. However, docking tools suffer from inaccurate scoring functions with widely varying performance on different proteins. To enable more accurate ranking of active over inactive ligands in virtual screening, we created a machine learning consensus docking tool, MILCDock, that uses predictions from five traditional molecular docking tools to predict the probability a ligand binds to a protein. MILCDock was trained and tested on data from both the DUD-E and LIT-PCBA docking datasets and shows improved performance over traditional molecular docking tools and other consensus docking methods on the DUD-E dataset. LIT-PCBA targets proved to be difficult for all methods tested. We also find that DUD-E data, although biased, can be effective in training machine learning tools if care is taken to avoid DUD-E's biases during training.

摘要

分子对接工具常用于药物发现的虚拟筛选中，以计算识别新分子。然而，对接工具的评分函数不准确，在不同的蛋白质上性能差异很大。为了在虚拟筛选中更准确地对活性配体和非活性配体进行排序，我们创建了一个机器学习共识对接工具 MILCDock，该工具使用来自五个传统分子对接工具的预测来预测配体与蛋白质结合的概率。MILCDock 在 DUD-E 和 LIT-PCBA 对接数据集上进行了训练和测试，在 DUD-E 数据集上的性能优于传统分子对接工具和其他共识对接方法。对于所有测试的方法来说，LIT-PCBA 靶点都被证明是困难的。我们还发现，尽管 DUD-E 数据存在偏差，但如果在训练过程中注意避免 DUD-E 的偏差，它仍然可以有效地用于训练机器学习工具。

相似文献

MILCDock: Machine Learning Enhanced Consensus Docking for Virtual Screening in Drug Discovery.

J Chem Inf Model. 2022 Nov 28;62(22):5342-5350. doi: 10.1021/acs.jcim.2c00705. Epub 2022 Nov 7.

LIT-PCBA: An Unbiased Data Set for Machine Learning and Virtual Screening.

J Chem Inf Model. 2020 Sep 28;60(9):4263-4273. doi: 10.1021/acs.jcim.0c00155. Epub 2020 Apr 23.

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation.

J Adv Res. 2023 Apr;46:135-147. doi: 10.1016/j.jare.2022.07.001. Epub 2022 Jul 25.

Docking Score ML: Target-Specific Machine Learning Models Improving Docking-Based Virtual Screening in 155 Targets.

J Chem Inf Model. 2024 Jul 22;64(14):5413-5426. doi: 10.1021/acs.jcim.4c00072. Epub 2024 Jul 3.

TocoDecoy: A New Approach to Design Unbiased Datasets for Training and Benchmarking Machine-Learning Scoring Functions.

J Med Chem. 2022 Jun 9;65(11):7918-7932. doi: 10.1021/acs.jmedchem.2c00460. Epub 2022 Jun 1.

Protein-Ligand Docking in the Machine-Learning Era.

Molecules. 2022 Jul 18;27(14):4568. doi: 10.3390/molecules27144568.

Towards Effective Consensus Scoring in Structure-Based Virtual Screening.

Interdiscip Sci. 2023 Mar;15(1):131-145. doi: 10.1007/s12539-022-00546-8. Epub 2022 Dec 23.

Improving protein-ligand docking and screening accuracies by incorporating a scoring function correction term.

Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac051.

Machine Learning Consensus Scoring Improves Performance Across Targets in Structure-Based Virtual Screening.

J Chem Inf Model. 2017 Jul 24;57(7):1579-1590. doi: 10.1021/acs.jcim.7b00153. Epub 2017 Jul 12.

PharmRF: A machine-learning scoring function to identify the best protein-ligand complexes for structure-based pharmacophore screening with high enrichments.

J Comput Chem. 2022 May 5;43(12):847-863. doi: 10.1002/jcc.26840. Epub 2022 Mar 18.

引用本文的文献

Resolution of physics and deep learning-based protein engineering filters: A case study with a lipase for industrial substrate hydrolysis.

PLoS One. 2025 Sep 12;20(9):e0332409. doi: 10.1371/journal.pone.0332409. eCollection 2025.

Teaching old docks new tricks with machine learning enhanced ensemble docking.

Sci Rep. 2024 Sep 5;14(1):20722. doi: 10.1038/s41598-024-71699-3.

Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR.

Nat Rev Drug Discov. 2024 Feb;23(2):141-155. doi: 10.1038/s41573-023-00832-0. Epub 2023 Dec 8.

Big Data and Artificial Intelligence in Drug Discovery for Gastric Cancer: Current Applications and Future Perspectives.

Curr Med Chem. 2025;32(10):1968-1986. doi: 10.2174/0929867331666230913105829.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MILCDock：用于药物发现虚拟筛选的机器学习增强共识对接。

MILCDock: Machine Learning Enhanced Consensus Docking for Virtual Screening in Drug Discovery.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献