以大流行速度研发大流行药物：利用基于混合机器学习和物理的高性能计算机模拟加速新冠病毒药物发现的基础设施

Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers.

作者信息

Bhati Agastya P, Wan Shunzhou, Alfè Dario, Clyde Austin R, Bode Mathis, Tan Li, Titov Mikhail, Merzky Andre, Turilli Matteo, Jha Shantenu, Highfield Roger R, Rocchia Walter, Scafuri Nicola, Succi Sauro, Kranzlmüller Dieter, Mathias Gerald, Wifling David, Donon Yann, Di Meglio Alberto, Vallecorsa Sofia, Ma Heng, Trifan Anda, Ramanathan Arvind, Brettin Tom, Partin Alexander, Xia Fangfang, Duan Xiaotan, Stevens Rick, Coveney Peter V

机构信息

Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK.

Department of Earth Sciences, London Centre for Nanotechnology and Thomas Young Centre at University College London, University College London, Gower Street, London WC1E 6BT, UK.

出版信息

Interface Focus. 2021 Oct 12;11(6):20210018. doi: 10.1098/rsfs.2021.0018. eCollection 2021 Dec 6.

DOI:10.1098/rsfs.2021.0018

PMID:34956592

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8504892/

Abstract

The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case, developed for linear accelerators, and physics-based methods. The two methods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative infrastructural development that combines both approaches to accelerate drug discovery. The scale of the potential resulting workflow is such that it is dependent on supercomputing to achieve extremely high throughput. We have demonstrated the viability of this workflow for the study of inhibitors for four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead antiviral compounds through repurposing on a variety of supercomputers.

摘要

应对全球疫情挑战的竞赛提醒人们，现有的药物研发过程成本高昂、效率低下且进展缓慢。在筛选大量潜在小分子以入围抗病毒药物研发的先导化合物方面存在一个重大瓶颈。加速药物研发的新机遇存在于机器学习方法（在本文中是为线性加速器开发的）与基于物理的方法之间的交叉领域。这两种方法各有优缺点，有趣的是，它们相互补充。在此，我们展示了一种创新的基础设施开发，将这两种方法结合起来以加速药物研发。由此产生的潜在工作流程规模极大，依赖超级计算来实现极高的通量。我们已经证明了这种工作流程对于研究四种新冠病毒靶蛋白抑制剂的可行性，以及我们通过在各种超级计算机上进行重新利用来识别先导抗病毒化合物所需的大规模计算能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e694/8504892/4e220802fb03/rsfs20210018f01.jpg

相似文献

Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers.以大流行速度研发大流行药物：利用基于混合机器学习和物理的高性能计算机模拟加速新冠病毒药物发现的基础设施

Interface Focus. 2021 Oct 12;11(6):20210018. doi: 10.1098/rsfs.2021.0018. eCollection 2021 Dec 6.

Accelerating COVID-19 Drug Discovery with High-Performance Computing.利用高性能计算加速 COVID-19 药物研发。

Methods Mol Biol. 2024;2716:405-411. doi: 10.1007/978-1-0716-3449-3_19.

Artificial Intelligence for Autonomous Molecular Design: A Perspective.人工智能自主分子设计：一个视角。

Molecules. 2021 Nov 9;26(22):6761. doi: 10.3390/molecules26226761.

Machine Learning Applications in Drug Repurposing.机器学习在药物再利用中的应用。

Interdiscip Sci. 2022 Mar;14(1):15-21. doi: 10.1007/s12539-021-00487-8. Epub 2022 Jan 23.

Exploring the artificial intelligence and machine learning models in the context of drug design difficulties and future potential for the pharmaceutical sectors.探索人工智能和机器学习模型在药物设计难题方面的应用及对制药行业未来的潜在影响。

Methods. 2023 Nov;219:82-94. doi: 10.1016/j.ymeth.2023.09.010. Epub 2023 Sep 29.

Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization.结合基于云的自由能计算、综合感知枚举和目标导向的生成式机器学习，实现快速大规模化学探索和优化。

J Chem Inf Model. 2020 Sep 28;60(9):4311-4325. doi: 10.1021/acs.jcim.0c00120. Epub 2020 Jun 19.

Application of artificial intelligence and machine learning in drug repurposing.人工智能和机器学习在药物重定位中的应用。

Prog Mol Biol Transl Sci. 2024;205:171-211. doi: 10.1016/bs.pmbts.2024.03.030. Epub 2024 Mar 31.

Implementation of an Automated System Using Machine Learning Models to Accelerate the Process of In Silico Identification of Small Molecules As Drug Candidates.使用机器学习模型实现自动化系统以加速计算机模拟鉴定作为候选药物的小分子的过程。

Curr Med Chem. 2024 Oct 10. doi: 10.2174/0109298673334173241003060139.

Integrating Artificial Intelligence for Drug Discovery in the Context of Revolutionizing Drug Delivery.在药物递送变革的背景下整合人工智能用于药物发现。

Life (Basel). 2024 Feb 7;14(2):233. doi: 10.3390/life14020233.

Synergy between machine learning and natural products cheminformatics: Application to the lead discovery of anthraquinone derivatives.机器学习与天然产物 cheminformatics 的协同作用：在蒽醌衍生物的先导发现中的应用。

Chem Biol Drug Des. 2022 Aug;100(2):185-217. doi: 10.1111/cbdd.14062. Epub 2022 May 8.

引用本文的文献

Artificial Intelligence Must Be Made More Scientific.人工智能必须更加科学化。

J Chem Inf Model. 2024 Aug 12;64(15):5739-5741. doi: 10.1021/acs.jcim.4c01091. Epub 2024 Jul 27.

Machine Learning Integrating Protein Structure, Sequence, and Dynamics to Predict the Enzyme Activity of Bovine Enterokinase Variants.机器学习整合蛋白质结构、序列和动力学预测牛肠激酶变体的酶活性。

J Chem Inf Model. 2024 Apr 8;64(7):2681-2694. doi: 10.1021/acs.jcim.3c00999. Epub 2024 Feb 22.

Accelerating COVID-19 Drug Discovery with High-Performance Computing.利用高性能计算加速 COVID-19 药物研发。

Methods Mol Biol. 2024;2716:405-411. doi: 10.1007/978-1-0716-3449-3_19.

Efficient and Reliable Data Management for Biomedical Applications.生物医学应用中的高效可靠数据管理。

Methods Mol Biol. 2024;2716:383-403. doi: 10.1007/978-1-0716-3449-3_18.

Introduction to Computational Biomedicine.计算生物医学导论。

Methods Mol Biol. 2024;2716:1-13. doi: 10.1007/978-1-0716-3449-3_1.

Drug discovery through Covid-19 genome sequencing with siamese graph convolutional neural network.利用暹罗图卷积神经网络通过新冠病毒基因组测序进行药物发现。

Multimed Tools Appl. 2023 May 10:1-35. doi: 10.1007/s11042-023-15270-8.

Structures of the SARS-CoV-2 spike glycoprotein and applications for novel drug development.严重急性呼吸综合征冠状病毒2（SARS-CoV-2）刺突糖蛋白的结构及其在新型药物研发中的应用

Front Pharmacol. 2022 Aug 9;13:955648. doi: 10.3389/fphar.2022.955648. eCollection 2022.

Pre-exascale HPC approaches for molecular dynamics simulations. Covid-19 research: A use case.用于分子动力学模拟的百亿亿次级高性能计算方法。新冠疫情研究：一个应用案例。

Wiley Interdiscip Rev Comput Mol Sci. 2022 May 30:e1622. doi: 10.1002/wcms.1622.

Alchemical Free Energy Estimators and Molecular Dynamics Engines: Accuracy, Precision, and Reproducibility.炼金术自由能估算器和分子动力学引擎：准确性、精度和可重复性。

J Chem Theory Comput. 2022 Jun 14;18(6):3972-3987. doi: 10.1021/acs.jctc.2c00114. Epub 2022 May 24.

Large Scale Study of Ligand-Protein Relative Binding Free Energy Calculations: Actionable Predictions from Statistically Robust Protocols.大规模配体-蛋白相对结合自由能计算研究：来自统计稳健协议的可操作预测。

J Chem Theory Comput. 2022 Apr 12;18(4):2687-2702. doi: 10.1021/acs.jctc.1c01288. Epub 2022 Mar 16.

本文引用的文献

AI-driven multiscale simulations illuminate mechanisms of SARS-CoV-2 spike dynamics.人工智能驱动的多尺度模拟揭示了新冠病毒刺突蛋白动态变化的机制。

Int J High Perform Comput Appl. 2021 Sep;35(5):432-451. doi: 10.1177/10943420211006452.

Thermodynamic and structural insights into the repurposing of drugs that bind to SARS-CoV-2 main protease.对与新冠病毒主要蛋白酶结合的药物重新利用的热力学和结构学见解。

Mol Syst Des Eng. 2021 Nov 18;7(2):123-131. doi: 10.1039/d1me00124h. eCollection 2022 Feb 7.

New perspectives in cancer drug development: computational advances with an eye to design.癌症药物研发的新视角：着眼于设计的计算进展

RSC Med Chem. 2021 Jul 7;12(9):1491-1502. doi: 10.1039/d1md00192b. eCollection 2021 Sep 23.

Application of the ESMACS Binding Free Energy Protocol to a Multi-Binding Site Lactate Dehydogenase A Ligand Dataset.ESMACS结合自由能协议在多结合位点乳酸脱氢酶A配体数据集上的应用。

Adv Theory Simul. 2020 Jan;3(1):1900194. doi: 10.1002/adts.201900194. Epub 2019 Nov 18.

Accuracy and Precision of Alchemical Relative Free Energy Predictions with and without Replica-Exchange.有无副本交换时炼金术相对自由能预测的准确性和精确性

Adv Theory Simul. 2020 Jan;3(1):1900195. doi: 10.1002/adts.201900195. Epub 2019 Nov 27.

Best Practices for Alchemical Free Energy Calculations [Article v1.0].炼金术自由能计算的最佳实践 [文章v1.0]

Living J Comput Mol Sci. 2020;2(1). doi: 10.33011/livecoms.2.1.18378.

Machine Learning and Enhanced Sampling Simulations for Computing the Potential of Mean Force and Standard Binding Free Energy.机器学习和增强采样模拟计算平均力势和标准结合自由能。

J Chem Theory Comput. 2021 Aug 10;17(8):5287-5300. doi: 10.1021/acs.jctc.1c00177. Epub 2021 Jul 14.

Machine Learning Prediction of Allosteric Drug Activity from Molecular Dynamics.基于分子动力学的别构药物活性的机器学习预测。

J Phys Chem Lett. 2021 Apr 22;12(15):3724-3732. doi: 10.1021/acs.jpclett.1c00045. Epub 2021 Apr 12.

When we can trust computers (and when we can't).当我们可以信任计算机（以及当我们不能信任计算机时）。

Philos Trans A Math Phys Eng Sci. 2021 May 17;379(2197):20200067. doi: 10.1098/rsta.2020.0067. Epub 2021 Mar 29.

Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations.通过深度生成模型和分子动力学模拟加速抗菌药物的发现。

Nat Biomed Eng. 2021 Jun;5(6):613-623. doi: 10.1038/s41551-021-00689-x. Epub 2021 Mar 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

以大流行速度研发大流行药物：利用基于混合机器学习和物理的高性能计算机模拟加速新冠病毒药物发现的基础设施

Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献