• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AI 加速的 SARS-CoV-2 蛋白-配体对接速度提高了 100 倍,而检测结果没有明显变化。

AI-accelerated protein-ligand docking for SARS-CoV-2 is 100-fold faster with no significant change in detection.

机构信息

Argonne National Laboratory, Data Science and Learning Division, Chicago, Lemont, 60439, USA.

Department of Computer Science, University of Chicago, Chicago, 60637, USA.

出版信息

Sci Rep. 2023 Feb 6;13(1):2105. doi: 10.1038/s41598-023-28785-9.

DOI:10.1038/s41598-023-28785-9
PMID:36747041
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9901402/
Abstract

Protein-ligand docking is a computational method for identifying drug leads. The method is capable of narrowing a vast library of compounds down to a tractable size for downstream simulation or experimental testing and is widely used in drug discovery. While there has been progress in accelerating scoring of compounds with artificial intelligence, few works have bridged these successes back to the virtual screening community in terms of utility and forward-looking development. We demonstrate the power of high-speed ML models by scoring 1 billion molecules in under a day (50 k predictions per GPU seconds). We showcase a workflow for docking utilizing surrogate AI-based models as a pre-filter to a standard docking workflow. Our workflow is ten times faster at screening a library of compounds than the standard technique, with an error rate less than 0.01% of detecting the underlying best scoring 0.1% of compounds. Our analysis of the speedup explains that another order of magnitude speedup must come from model accuracy rather than computing speed. In order to drive another order of magnitude of acceleration, we share a benchmark dataset consisting of 200 million 3D complex structures and 2D structure scores across a consistent set of 13 million "in-stock" molecules over 15 receptors, or binding sites, across the SARS-CoV-2 proteome. We believe this is strong evidence for the community to begin focusing on improving the accuracy of surrogate models to improve the ability to screen massive compound libraries 100 × or even 1000 × faster than current techniques and reduce missing top hits. The technique outlined aims to be a fast drop-in replacement for docking for screening billion-scale molecular libraries.

摘要

蛋白质配体对接是一种用于识别药物先导物的计算方法。该方法能够将庞大的化合物库缩小到可处理的规模,以便进行下游模拟或实验测试,因此被广泛应用于药物发现领域。虽然在利用人工智能加速化合物评分方面已经取得了进展,但很少有工作在效用和前瞻性发展方面将这些成功与虚拟筛选社区联系起来。我们通过在不到一天的时间内(每个 GPU 秒预测 50 k)对 10 亿个分子进行评分,展示了高速 ML 模型的强大功能。我们展示了一种利用基于 AI 的替代模型进行对接的工作流程,作为标准对接工作流程的预筛选。与标准技术相比,我们的工作流程在筛选化合物库方面的速度快了 10 倍,错误率低于检测到基础最佳评分的 0.01%的化合物的 0.01%。我们对加速的分析表明,另一个数量级的加速必须来自模型的准确性,而不是计算速度。为了实现另一个数量级的加速,我们共享了一个基准数据集,该数据集包含 2 亿个 3D 复合物结构和 2D 结构分数,涵盖了 15 个 SARS-CoV-2 蛋白质组中 15 个受体或结合位点上的 1300 万“现货”分子的一致集合。我们认为,这为社区提供了强有力的证据,开始关注提高替代模型的准确性,以提高筛选大规模化合物库的能力,速度比当前技术快 100 倍甚至 1000 倍,并减少错过顶级命中。概述的技术旨在成为一种快速的对接替代品,用于筛选十亿规模的分子库。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/1d8d8290a874/41598_2023_28785_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/cbb094f9fa71/41598_2023_28785_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/9eaa4c746f4c/41598_2023_28785_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/7d32c6a384ad/41598_2023_28785_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/0bb33af2b79b/41598_2023_28785_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/b0996bd8a965/41598_2023_28785_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/676893c0f74c/41598_2023_28785_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/1d8d8290a874/41598_2023_28785_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/cbb094f9fa71/41598_2023_28785_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/9eaa4c746f4c/41598_2023_28785_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/7d32c6a384ad/41598_2023_28785_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/0bb33af2b79b/41598_2023_28785_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/b0996bd8a965/41598_2023_28785_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/676893c0f74c/41598_2023_28785_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e70/9902441/1d8d8290a874/41598_2023_28785_Fig7_HTML.jpg

相似文献

1
AI-accelerated protein-ligand docking for SARS-CoV-2 is 100-fold faster with no significant change in detection.AI 加速的 SARS-CoV-2 蛋白-配体对接速度提高了 100 倍,而检测结果没有明显变化。
Sci Rep. 2023 Feb 6;13(1):2105. doi: 10.1038/s41598-023-28785-9.
2
Advances in the discovery of new chemotypes through ultra-large library docking.通过超大型文库对接发现新化学型的进展。
Expert Opin Drug Discov. 2023 Mar;18(3):303-313. doi: 10.1080/17460441.2023.2171984. Epub 2023 Feb 2.
3
SARS-CoV2 billion-compound docking.SARS-CoV2 十亿化合物对接。
Sci Data. 2023 Mar 28;10(1):173. doi: 10.1038/s41597-023-01984-9.
4
Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking.基于深度对接的人工智能辅助超大规模化学库虚拟筛选。
Nat Protoc. 2022 Mar;17(3):672-697. doi: 10.1038/s41596-021-00659-2. Epub 2022 Feb 4.
5
Uni-Dock: GPU-Accelerated Docking Enables Ultralarge Virtual Screening.Uni-Dock:GPU 加速对接实现超大规模虚拟筛选。
J Chem Theory Comput. 2023 Jun 13;19(11):3336-3345. doi: 10.1021/acs.jctc.2c01145. Epub 2023 Apr 26.
6
Identification of RdRp inhibitors against SARS-CoV-2 through E-pharmacophore-based virtual screening, molecular docking and MD simulations approaches.通过基于药效基团的虚拟筛选、分子对接和 MD 模拟方法鉴定针对 SARS-CoV-2 的 RdRp 抑制剂。
Int J Biol Macromol. 2023 May 15;237:124169. doi: 10.1016/j.ijbiomac.2023.124169. Epub 2023 Mar 28.
7
Accelerating AutoDock Vina with GPUs.使用 GPU 加速 AutoDock Vina。
Molecules. 2022 May 9;27(9):3041. doi: 10.3390/molecules27093041.
8
Structure-based virtual screening, in silico docking, ADME properties prediction and molecular dynamics studies for the identification of potential inhibitors against SARS-CoV-2 M.基于结构的虚拟筛选、计算机对接、ADME 性质预测和分子动力学研究,以鉴定针对 SARS-CoV-2 M 的潜在抑制剂。
Mol Divers. 2022 Jun;26(3):1645-1661. doi: 10.1007/s11030-021-10298-0. Epub 2021 Sep 4.
9
Targeting SARS-CoV-2 main protease: structure based virtual screening, in silico ADMET studies and molecular dynamics simulation for identification of potential inhibitors.靶向 SARS-CoV-2 主蛋白酶:基于结构的虚拟筛选、计算机 ADMET 研究和分子动力学模拟,以鉴定潜在的抑制剂。
J Biomol Struct Dyn. 2022 May;40(8):3609-3625. doi: 10.1080/07391102.2020.1848636. Epub 2020 Nov 23.
10
Synthon-based ligand discovery in virtual libraries of over 11 billion compounds.基于合成子的配体发现虚拟库超过 110 亿化合物。
Nature. 2022 Jan;601(7893):452-459. doi: 10.1038/s41586-021-04220-9. Epub 2021 Dec 15.

引用本文的文献

1
Biotransformation of medicarpin from homopterocarpin by Aspergillus niger and its biological characterization.黑曲霉将高紫檀素生物转化为紫铆因及其生物学特性
Sci Rep. 2025 Jul 1;15(1):21371. doi: 10.1038/s41598-025-06729-9.
2
HDBind: encoding of molecular structure with hyperdimensional binary representations.HDBind:采用超维二进制表示法对分子结构进行编码。
Sci Rep. 2024 Nov 23;14(1):29025. doi: 10.1038/s41598-024-80009-w.
3
Optimal Molecular Design: Generative Active Learning Combining REINVENT with Precise Binding Free Energy Ranking Simulations.

本文引用的文献

1
High-Throughput Virtual Screening and Validation of a SARS-CoV-2 Main Protease Noncovalent Inhibitor.高通量虚拟筛选和 SARS-CoV-2 主蛋白酶非共价抑制剂的验证。
J Chem Inf Model. 2022 Jan 10;62(1):116-128. doi: 10.1021/acs.jcim.1c00851. Epub 2021 Nov 18.
2
Drugs repurposed for COVID-19 by virtual screening of 6,218 drugs and cell-based assay.通过对 6218 种药物和基于细胞的检测进行虚拟筛选,为 COVID-19 重新利用药物。
Proc Natl Acad Sci U S A. 2021 Jul 27;118(30). doi: 10.1073/pnas.2024302118.
3
Learning curves for drug response prediction in cancer cell lines.
最优分子设计:结合REINVENT与精确结合自由能排序模拟的生成式主动学习
J Chem Theory Comput. 2024 Sep 3;20(18):8308-28. doi: 10.1021/acs.jctc.4c00576.
4
Correlation of protein binding pocket properties with hits' chemistries used in generation of ultra-large virtual libraries.与生成超大虚拟库时所用命中化合物的化学性质相关的蛋白质结合口袋特性的相关性。
J Comput Aided Mol Des. 2024 May 16;38(1):22. doi: 10.1007/s10822-024-00562-4.
5
Considerations Around Structure-Based Drug Discovery for KRAS Using DOCK.考虑使用 DOCK 进行 KRAS 的基于结构的药物发现。
Methods Mol Biol. 2024;2797:67-90. doi: 10.1007/978-1-0716-3822-4_6.
6
Molecular Property Diagnostic Suite for COVID-19 (MPDS): an open-source disease-specific drug discovery portal.用于COVID-19的分子特性诊断套件(MPDS):一个开源的针对特定疾病的药物发现平台。
GigaByte. 2024 Mar 14;2024:gigabyte114. doi: 10.46471/gigabyte.114. eCollection 2024.
7
Integration of Computational Docking into Anti-Cancer Drug Response Prediction Models.将计算对接技术整合到抗癌药物反应预测模型中。
Cancers (Basel). 2023 Dec 21;16(1):50. doi: 10.3390/cancers16010050.
肿瘤细胞系药物反应预测的学习曲线。
BMC Bioinformatics. 2021 May 17;22(1):252. doi: 10.1186/s12859-021-04163-y.
4
Lean-Docking: Exploiting Ligands' Predicted Docking Scores to Accelerate Molecular Docking.瘦 docking:利用配体预测的 docking 分数来加速分子 docking。
J Chem Inf Model. 2021 May 24;61(5):2341-2352. doi: 10.1021/acs.jcim.0c01452. Epub 2021 Apr 16.
5
An update review of emerging small-molecule therapeutic options for COVID-19.新冠病毒小分子治疗药物的最新研究进展综述。
Biomed Pharmacother. 2021 May;137:111313. doi: 10.1016/j.biopha.2021.111313. Epub 2021 Feb 3.
6
A multi-pronged approach targeting SARS-CoV-2 proteins using ultra-large virtual screening.一种使用超大型虚拟筛选针对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)蛋白的多管齐下方法。
iScience. 2021 Feb 19;24(2):102021. doi: 10.1016/j.isci.2020.102021. Epub 2021 Jan 5.
7
Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models.分子集(MOSES):分子生成模型的基准测试平台。
Front Pharmacol. 2020 Dec 18;11:565644. doi: 10.3389/fphar.2020.565644. eCollection 2020.
8
Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19.基于超级计算机的集成对接药物发现管道及其在新冠病毒中的应用。
J Chem Inf Model. 2020 Dec 28;60(12):5832-5852. doi: 10.1021/acs.jcim.0c01010. Epub 2020 Dec 16.
9
SAVI, in silico generation of billions of easily synthesizable compounds through expert-system type rules.SAVI,通过专家系统类型规则在计算机中生成数十亿种易于合成的化合物。
Sci Data. 2020 Nov 11;7(1):384. doi: 10.1038/s41597-020-00727-4.
10
Anti-COVID-19 terpenoid from marine sources: A docking, admet and molecular dynamics study.来自海洋来源的抗新冠病毒萜类化合物:对接、ADMET 和分子动力学研究
J Mol Struct. 2021 Mar 15;1228:129433. doi: 10.1016/j.molstruc.2020.129433. Epub 2020 Oct 10.