MEMES：用于增强分子筛选的机器学习框架。

MEMES: Machine learning framework for Enhanced MolEcular Screening.

作者信息

Mehta Sarvesh, Laghuvarapu Siddhartha, Pathak Yashaswi, Sethi Aaftaab, Alvala Mallika, Priyakumar U Deva

机构信息

Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology Hyderabad 500 032 India

Department of Medicinal Chemistry, National Institute of Pharmaceutical Education and Research Hyderabad 500 037 India.

出版信息

Chem Sci. 2021 Jul 26;12(35):11710-11721. doi: 10.1039/d1sc02783b. eCollection 2021 Sep 15.

DOI:10.1039/d1sc02783b

PMID:34659706

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8442698/

Abstract

In drug discovery applications, high throughput virtual screening exercises are routinely performed to determine an initial set of candidate molecules referred to as "hits". In such an experiment, each molecule from a large small-molecule drug library is evaluated in terms of physical properties such as the docking score against a target receptor. In real-life drug discovery experiments, drug libraries are extremely large but still there is only a minor representation of the essentially infinite chemical space, and evaluation of physical properties for each molecule in the library is not computationally feasible. In the current study, a novel Machine learning framework for Enhanced MolEcular Screening (MEMES) based on Bayesian optimization is proposed for efficient sampling of the chemical space. The proposed framework is demonstrated to identify 90% of the top-1000 molecules from a molecular library of size about 100 million, while calculating the docking score only for about 6% of the complete library. We believe that such a framework would tremendously help to reduce the computational effort in not only drug-discovery but also areas that require such high-throughput experiments.

摘要

在药物发现应用中，通常会进行高通量虚拟筛选操作，以确定一组初始的候选分子，即所谓的“命中分子”。在这样的实验中，来自大型小分子药物库的每个分子都会根据诸如与目标受体的对接分数等物理性质进行评估。在实际的药物发现实验中，药物库非常大，但仍然只是本质上无限的化学空间的一小部分，并且对库中每个分子的物理性质进行评估在计算上是不可行的。在当前的研究中，提出了一种基于贝叶斯优化的用于增强分子筛选（MEMES）的新型机器学习框架，以对化学空间进行高效采样。所提出的框架被证明能够从大小约为1亿的分子库中识别出前1000个分子中的90%，同时仅对完整库的约6%计算对接分数。我们相信，这样的框架将极大地有助于减少不仅在药物发现中，而且在需要此类高通量实验的领域中的计算工作量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e0b/8442698/e8e4dccb1f56/d1sc02783b-f1.jpg

相似文献

MEMES: Machine learning framework for Enhanced MolEcular Screening.

Chem Sci. 2021 Jul 26;12(35):11710-11721. doi: 10.1039/d1sc02783b. eCollection 2021 Sep 15.

MO-MEMES: A method for accelerating virtual screening using multi-objective Bayesian optimization.

Front Med (Lausanne). 2022 Sep 23;9:916481. doi: 10.3389/fmed.2022.916481. eCollection 2022.

Machine Learning-Boosted Docking Enables the Efficient Structure-Based Virtual Screening of Giga-Scale Enumerated Chemical Libraries.

J Chem Inf Model. 2023 Sep 25;63(18):5773-5783. doi: 10.1021/acs.jcim.3c01239. Epub 2023 Sep 1.

HIt Discovery using docking ENriched by GEnerative Modeling (HIDDEN GEM): A novel computational workflow for accelerated virtual screening of ultra-large chemical libraries.

Mol Inform. 2024 Jan;43(1):e202300207. doi: 10.1002/minf.202300207. Epub 2023 Dec 19.

Deep Learning with Geometry-Enhanced Molecular Representation for Augmentation of Large-Scale Docking-Based Virtual Screening.

J Chem Inf Model. 2023 Nov 13;63(21):6501-6514. doi: 10.1021/acs.jcim.3c01371. Epub 2023 Oct 26.

NeuralDock: Rapid and Conformation-Agnostic Docking of Small Molecules.

Front Mol Biosci. 2022 Mar 22;9:867241. doi: 10.3389/fmolb.2022.867241. eCollection 2022.

A graph-based approach to construct target-focused libraries for virtual screening.

J Cheminform. 2016 Mar 15;8:14. doi: 10.1186/s13321-016-0126-6. eCollection 2016.

Large-Scale Pretraining Improves Sample Efficiency of Active Learning-Based Virtual Screening.

J Chem Inf Model. 2024 Mar 25;64(6):1882-1891. doi: 10.1021/acs.jcim.3c01938. Epub 2024 Mar 5.

Accelerating high-throughput virtual screening through molecular pool-based active learning.

Chem Sci. 2021 Apr 29;12(22):7866-7881. doi: 10.1039/d0sc06805e.

Efficient Exploration of Chemical Space with Docking and Deep Learning.

J Chem Theory Comput. 2021 Nov 9;17(11):7106-7119. doi: 10.1021/acs.jctc.1c00810. Epub 2021 Sep 30.

引用本文的文献

Linker-GPT: design of Antibody-drug conjugates linkers with molecular generators and reinforcement learning.

Sci Rep. 2025 Jul 1;15(1):20525. doi: 10.1038/s41598-025-05555-3.

Navigating the Expansive Landscapes of Soft Materials: A User Guide for High-Throughput Workflows.

ACS Polym Au. 2023 Dec 5;3(6):406-427. doi: 10.1021/acspolymersau.3c00025. eCollection 2023 Dec 13.

Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR.

Nat Rev Drug Discov. 2024 Feb;23(2):141-155. doi: 10.1038/s41573-023-00832-0. Epub 2023 Dec 8.

Streamlining pipeline efficiency: a novel model-agnostic technique for accelerating conditional generative and virtual screening pipelines.

Sci Rep. 2023 Nov 29;13(1):21069. doi: 10.1038/s41598-023-42952-y.

Generative Models Should at Least Be Able to Design Molecules That Dock Well: A New Benchmark.

J Chem Inf Model. 2023 Jun 12;63(11):3238-3247. doi: 10.1021/acs.jcim.2c01355. Epub 2023 May 24.

Machine learning for optical chemical multi-analyte imaging : Why we should dare and why it's not without risks.

Anal Bioanal Chem. 2023 Jun;415(14):2749-2761. doi: 10.1007/s00216-023-04678-8. Epub 2023 Apr 18.

MO-MEMES: A method for accelerating virtual screening using multi-objective Bayesian optimization.

Front Med (Lausanne). 2022 Sep 23;9:916481. doi: 10.3389/fmed.2022.916481. eCollection 2022.

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation.

J Adv Res. 2023 Apr;46:135-147. doi: 10.1016/j.jare.2022.07.001. Epub 2022 Jul 25.

A transfer learning approach for reaction discovery in small data situations using generative model.

iScience. 2022 Jun 22;25(7):104661. doi: 10.1016/j.isci.2022.104661. eCollection 2022 Jul 15.

本文引用的文献

Generative Models Should at Least Be Able to Design Molecules That Dock Well: A New Benchmark.

J Chem Inf Model. 2023 Jun 12;63(11):3238-3247. doi: 10.1021/acs.jcim.2c01355. Epub 2023 May 24.

MolGPT: Molecular Generation Using a Transformer-Decoder Model.

J Chem Inf Model. 2022 May 9;62(9):2064-2076. doi: 10.1021/acs.jcim.1c00600. Epub 2021 Oct 25.

SCONES: Self-Consistent Neural Network for Protein Stability Prediction Upon Mutation.

J Phys Chem B. 2021 Sep 30;125(38):10657-10671. doi: 10.1021/acs.jpcb.1c04913. Epub 2021 Sep 21.

DeepPocket: Ligand Binding Site Detection and Segmentation using 3D Convolutional Neural Networks.

J Chem Inf Model. 2022 Nov 14;62(21):5069-5079. doi: 10.1021/acs.jcim.1c00799. Epub 2021 Aug 10.

Bayesian reaction optimization as a tool for chemical synthesis.

Nature. 2021 Feb;590(7844):89-96. doi: 10.1038/s41586-021-03213-y. Epub 2021 Feb 3.

Deep learning enabled inorganic material generator.

Phys Chem Chem Phys. 2020 Dec 7;22(46):26935-26943. doi: 10.1039/d0cp03508d.

Neural Network Potential Energy Surfaces for Small Molecules and Reactions.

Chem Rev. 2021 Aug 25;121(16):10187-10217. doi: 10.1021/acs.chemrev.0c00665. Epub 2020 Oct 6.

Can easy chemistry produce complex, diverse, and novel molecules?

Drug Discov Today. 2020 Dec;25(12):2174-2181. doi: 10.1016/j.drudis.2020.09.027. Epub 2020 Oct 1.

Machine Learning for Accurate Force Calculations in Molecular Dynamics Simulations.

J Phys Chem A. 2020 Aug 27;124(34):6954-6967. doi: 10.1021/acs.jpca.0c03926. Epub 2020 Aug 14.

Deep Docking: A Deep Learning Platform for Augmentation of Structure Based Drug Discovery.

ACS Cent Sci. 2020 Jun 24;6(6):939-949. doi: 10.1021/acscentsci.0c00229. Epub 2020 May 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MEMES：用于增强分子筛选的机器学习框架。

MEMES: Machine learning framework for Enhanced MolEcular Screening.

作者信息

Mehta Sarvesh, Laghuvarapu Siddhartha, Pathak Yashaswi, Sethi Aaftaab, Alvala Mallika, Priyakumar U Deva

机构信息

Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology Hyderabad 500 032 India

Department of Medicinal Chemistry, National Institute of Pharmaceutical Education and Research Hyderabad 500 037 India.

出版信息

Chem Sci. 2021 Jul 26;12(35):11710-11721. doi: 10.1039/d1sc02783b. eCollection 2021 Sep 15.

DOI:10.1039/d1sc02783b

PMID:34659706

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8442698/

Abstract

摘要

MEMES：用于增强分子筛选的机器学习框架。

MEMES: Machine learning framework for Enhanced MolEcular Screening.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

MEMES：用于增强分子筛选的机器学习框架。

MEMES: Machine learning framework for Enhanced MolEcular Screening.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献