• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的深度全卷积神经网络(DFCNN)在超大规模虚拟筛选中的验证及其在胰蛋白酶I蛋白酶抑制剂发现中的应用

Validation of Deep Learning-Based DFCNN in Extremely Large-Scale Virtual Screening and Application in Trypsin I Protease Inhibitor Discovery.

作者信息

Zhang Haiping, Lin Xiao, Wei Yanjie, Zhang Huiling, Liao Linbu, Wu Hao, Pan Yi, Wu Xuli

机构信息

Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.

School of Medicine, Shenzhen University, Shenzhen, China.

出版信息

Front Mol Biosci. 2022 Jun 1;9:872086. doi: 10.3389/fmolb.2022.872086. eCollection 2022.

DOI:10.3389/fmolb.2022.872086
PMID:35720125
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9200220/
Abstract

Computational methods with affordable computational resources are highly desirable for identifying active drug leads from millions of compounds. This requires a model that is both highly efficient and relatively accurate, which cannot be achieved by most of the current methods. In real virtual screening (VS) application scenarios, the desired method should perform much better in selecting active compounds by prediction than by random chance. Here, we systematically evaluate the performance of our previously developed DFCNN model in large-scale virtual screening, and the results show our method has approximately 22 times the success rate compared to the random chance on average with a score cutoff of 0.99. Of the 102 test cases, 10 cases have more than 98 times the success rate of a random guess. Interestingly, in three cases, the prediction success rate is 99 times that of a random guess by a score cutoff of 0.99. This indicates that in most situations after our extremely large-scale VS, the dataset can be reduced 20 to 100 times for the next step of virtual screening based on docking or MD simulation. Furthermore, we have employed an experimental method to verify our computational method by finding several activity inhibitors for Trypsin I Protease. In addition, we also show its proof-of-concept application in drug screening. The results indicate the massive potential of this method in the first step of the real drug development workflow. Moreover, DFCNN only takes about 0.0000225s for one protein-compound prediction on average with 80 Intel CPU cores (2.00 GHz) and 60 GB RAM, which is at least tens of thousands of times faster than AutoDock Vina or Schrödinger high-throughput virtual screening. Additionally, an online webserver based on DFCNN for large-scale screening is available at http://cbblab.siat.ac.cn/DFCNN/index.php for the convenience of the users.

摘要

对于从数百万种化合物中识别出有活性的药物先导物而言,具备可承受计算资源的计算方法是非常理想的。这需要一个既高效又相对准确的模型,而这是目前大多数方法无法实现的。在实际的虚拟筛选(VS)应用场景中,理想的方法在通过预测选择活性化合物方面应比随机选择表现得好得多。在此,我们系统地评估了我们之前开发的DFCNN模型在大规模虚拟筛选中的性能,结果表明,在分数截止值为0.99时,我们的方法平均成功率约为随机选择的22倍。在102个测试案例中,有10个案例的成功率比随机猜测高98倍以上。有趣的是,在三个案例中,分数截止值为0.99时预测成功率是随机猜测的99倍。这表明在我们进行超大规模VS后的大多数情况下,对于基于对接或分子动力学(MD)模拟的下一步虚拟筛选,数据集可以减少20到100倍。此外,我们采用了一种实验方法,通过找到几种胰蛋白酶I蛋白酶的活性抑制剂来验证我们的计算方法。此外,我们还展示了其在药物筛选中的概念验证应用。结果表明该方法在实际药物开发流程的第一步中具有巨大潜力。而且,在配备80个英特尔CPU核心(2.00 GHz)和60 GB内存的情况下,DFCNN平均预测一个蛋白质 - 化合物对仅需约0.0000225秒,这比AutoDock Vina或薛定谔高通量虚拟筛选至少快数万倍。此外,为方便用户,基于DFCNN的大规模筛选在线网络服务器可在http://cbblab.siat.ac.cn/DFCNN/index.php获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/66fa6401b1bc/fmolb-09-872086-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/d20a02d18edf/fmolb-09-872086-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/562ec875aa67/fmolb-09-872086-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/d05191231c50/fmolb-09-872086-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/3d6141b922b3/fmolb-09-872086-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/0c6bd009dca3/fmolb-09-872086-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/86f19a9e58b0/fmolb-09-872086-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/66fa6401b1bc/fmolb-09-872086-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/d20a02d18edf/fmolb-09-872086-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/562ec875aa67/fmolb-09-872086-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/d05191231c50/fmolb-09-872086-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/3d6141b922b3/fmolb-09-872086-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/0c6bd009dca3/fmolb-09-872086-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/86f19a9e58b0/fmolb-09-872086-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f09/9200220/66fa6401b1bc/fmolb-09-872086-g007.jpg

相似文献

1
Validation of Deep Learning-Based DFCNN in Extremely Large-Scale Virtual Screening and Application in Trypsin I Protease Inhibitor Discovery.基于深度学习的深度全卷积神经网络(DFCNN)在超大规模虚拟筛选中的验证及其在胰蛋白酶I蛋白酶抑制剂发现中的应用
Front Mol Biosci. 2022 Jun 1;9:872086. doi: 10.3389/fmolb.2022.872086. eCollection 2022.
2
DeepBindBC: A practical deep learning method for identifying native-like protein-ligand complexes in virtual screening.DeepBindBC:一种实用的深度学习方法,用于在虚拟筛选中识别天然样蛋白-配体复合物。
Methods. 2022 Sep;205:247-262. doi: 10.1016/j.ymeth.2022.07.009. Epub 2022 Jul 22.
3
Generating and screening de novo compounds against given targets using ultrafast deep learning models as core components.利用超快深度学习模型作为核心组件,针对给定的靶标生成和筛选从头化合物。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac226.
4
Accelerating AutoDock Vina with GPUs.使用 GPU 加速 AutoDock Vina。
Molecules. 2022 May 9;27(9):3041. doi: 10.3390/molecules27093041.
5
IVS2vec: A tool of Inverse Virtual Screening based on word2vec and deep learning techniques.IVS2vec:一种基于 word2vec 和深度学习技术的反向虚拟筛选工具。
Methods. 2019 Aug 15;166:57-65. doi: 10.1016/j.ymeth.2019.03.012. Epub 2019 Mar 22.
6
Novel ANOVA-Statistic-Reduced Deep Fully Connected Neural Network for the Damage Grade Prediction of Post-Earthquake Buildings.用于地震后建筑物损伤等级预测的新型方差分析统计量简化深度全连接神经网络
Sensors (Basel). 2023 Jul 16;23(14):6439. doi: 10.3390/s23146439.
7
Calculating an optimal box size for ligand docking and virtual screening against experimental and predicted binding pockets.针对实验性和预测性结合口袋计算用于配体对接和虚拟筛选的最佳盒子大小。
J Cheminform. 2015 May 15;7:18. doi: 10.1186/s13321-015-0067-5. eCollection 2015.
8
Chaos-embedded particle swarm optimization approach for protein-ligand docking and virtual screening.用于蛋白质-配体对接和虚拟筛选的嵌入混沌的粒子群优化方法
J Cheminform. 2018 Dec 14;10(1):62. doi: 10.1186/s13321-018-0320-9.
9
Virtual Screening of Novel Glucosamine-6-Phosphate Synthase Inhibitors.新型6-磷酸葡萄糖胺合酶抑制剂的虚拟筛选
Comb Chem High Throughput Screen. 2018;21(3):182-193. doi: 10.2174/1386207321666180330114457.
10
Consensus scoring criteria for improving enrichment in virtual screening.用于提高虚拟筛选富集度的共识评分标准。
J Chem Inf Model. 2005 Jul-Aug;45(4):1134-46. doi: 10.1021/ci050034w.

引用本文的文献

1
Revolutionizing GPCR-ligand predictions: DeepGPCR with experimental validation for high-precision drug discovery.颠覆 GPCR 配体预测:具有实验验证的 DeepGPCR 助力高精度药物发现。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae281.
2
DeepBindGCN: Integrating Molecular Vector Representation with Graph Convolutional Neural Networks for Protein-Ligand Interaction Prediction.DeepBindGCN:将分子向量表示与图卷积神经网络集成用于蛋白质-配体相互作用预测。
Molecules. 2023 Jun 10;28(12):4691. doi: 10.3390/molecules28124691.

本文引用的文献

1
An Integrated Deep Learning and Molecular Dynamics Simulation-Based Screening Pipeline Identifies Inhibitors of a New Cancer Drug Target TIPE2.一种基于深度学习与分子动力学模拟的整合筛选流程鉴定出新型癌症药物靶点TIPE2的抑制剂。
Front Pharmacol. 2021 Nov 23;12:772296. doi: 10.3389/fphar.2021.772296. eCollection 2021.
2
A novel virtual screening procedure identifies Pralatrexate as inhibitor of SARS-CoV-2 RdRp and it reduces viral replication in vitro.一种新的虚拟筛选程序将普拉曲沙鉴定为 SARS-CoV-2 RdRp 的抑制剂,并在体外降低病毒复制。
PLoS Comput Biol. 2020 Dec 31;16(12):e1008489. doi: 10.1371/journal.pcbi.1008489. eCollection 2020 Dec.
3
Deep Learning Based Drug Screening for Novel Coronavirus 2019-nCov.
基于深度学习的新型冠状病毒 2019-nCov 药物筛选。
Interdiscip Sci. 2020 Sep;12(3):368-376. doi: 10.1007/s12539-020-00376-6. Epub 2020 Jun 1.
4
An open-source drug discovery platform enables ultra-large virtual screens.一个开源药物发现平台可实现超大规模虚拟筛选。
Nature. 2020 Apr;580(7805):663-668. doi: 10.1038/s41586-020-2117-z. Epub 2020 Mar 9.
5
RCSB Protein Data Bank: Enabling biomedical research and drug discovery.RCSB 蛋白质数据库:推动生物医学研究和药物发现。
Protein Sci. 2020 Jan;29(1):52-65. doi: 10.1002/pro.3730. Epub 2019 Nov 29.
6
Convolutional Neural Network of Atomic Surface Structures To Predict Binding Energies for High-Throughput Screening of Catalysts.用于预测催化剂高通量筛选结合能的原子表面结构卷积神经网络
J Phys Chem Lett. 2019 Aug 1;10(15):4401-4408. doi: 10.1021/acs.jpclett.9b01428. Epub 2019 Jul 22.
7
IVS2vec: A tool of Inverse Virtual Screening based on word2vec and deep learning techniques.IVS2vec:一种基于 word2vec 和深度学习技术的反向虚拟筛选工具。
Methods. 2019 Aug 15;166:57-65. doi: 10.1016/j.ymeth.2019.03.012. Epub 2019 Mar 22.
8
Deep reinforcement learning for de novo drug design.基于深度强化学习的从头药物设计。
Sci Adv. 2018 Jul 25;4(7):eaap7885. doi: 10.1126/sciadv.aap7885. eCollection 2018 Jul.
9
Is It Reliable to Take the Molecular Docking Top Scoring Position as the Best Solution without Considering Available Structural Data?不考虑现有结构数据,仅将分子对接得分最高的构象作为最佳解决方案是否可靠?
Molecules. 2018 Apr 28;23(5):1038. doi: 10.3390/molecules23051038.
10
The rise of deep learning in drug discovery.深度学习在药物发现中的崛起。
Drug Discov Today. 2018 Jun;23(6):1241-1250. doi: 10.1016/j.drudis.2018.01.039. Epub 2018 Jan 31.