• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一刀切并不适用:基于结构的模型在药物发现中的局限性。

One Size Does Not Fit All: The Limits of Structure-Based Models in Drug Discovery.

作者信息

Ross Gregory A, Morris Garrett M, Biggin Philip C

机构信息

Structural Bioinformatics and Computational Biochemistry, Department of Biochemistry, University of Oxford , South Parks Road, Oxford, Oxfordshire OX1 3QU, United Kingdom.

出版信息

J Chem Theory Comput. 2013 Sep 10;9(9):4266-4274. doi: 10.1021/ct4004228. Epub 2013 Aug 5.

DOI:10.1021/ct4004228
PMID:24124403
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3793897/
Abstract

A major goal in computational chemistry has been to discover the set of rules that can accurately predict the binding affinity of any protein-drug complex, using only a single snapshot of its three-dimensional structure. Despite the continual development of structure-based models, predictive accuracy remains low, and the fundamental factors that inhibit the inference of all-encompassing rules have yet to be fully explored. Using statistical learning theory and information theory, here we prove that even the very best generalized structure-based model is inherently limited in its accuracy, and protein-specific models are always likely to be better. Our results refute the prevailing assumption that large data sets and advanced machine learning techniques will yield accurate, universally applicable models. We anticipate that the results will aid the development of more robust virtual screening strategies and scoring function error estimations.

摘要

计算化学的一个主要目标是发现一套规则,仅使用蛋白质-药物复合物三维结构的单个快照,就能准确预测其结合亲和力。尽管基于结构的模型不断发展,但预测准确性仍然很低,抑制推导全面规则的基本因素尚未得到充分探索。利用统计学习理论和信息论,我们在此证明,即使是最好的广义基于结构的模型,其准确性也存在固有局限性,特定于蛋白质的模型往往可能更好。我们的结果驳斥了一种普遍的假设,即大数据集和先进的机器学习技术将产生准确的、普遍适用的模型。我们预计这些结果将有助于开发更强大的虚拟筛选策略和评分函数误差估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/6a7b70fe6711/ct-2013-004228_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/d1ba14112237/ct-2013-004228_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/335634d3034c/ct-2013-004228_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/5985511e8e29/ct-2013-004228_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/6a7b70fe6711/ct-2013-004228_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/d1ba14112237/ct-2013-004228_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/335634d3034c/ct-2013-004228_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/5985511e8e29/ct-2013-004228_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b3d/3793897/6a7b70fe6711/ct-2013-004228_0004.jpg

相似文献

1
One Size Does Not Fit All: The Limits of Structure-Based Models in Drug Discovery.一刀切并不适用:基于结构的模型在药物发现中的局限性。
J Chem Theory Comput. 2013 Sep 10;9(9):4266-4274. doi: 10.1021/ct4004228. Epub 2013 Aug 5.
2
Machine learning in computational docking.计算对接中的机器学习。
Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.
3
Application of Machine Learning Techniques to Predict Binding Affinity for Drug Targets: A Study of Cyclin-Dependent Kinase 2.机器学习技术在预测药物靶点结合亲和力中的应用:以细胞周期蛋白依赖性激酶 2 为例的研究。
Curr Med Chem. 2021;28(2):253-265. doi: 10.2174/2213275912666191102162959.
4
Improving the accuracy of high-throughput protein-protein affinity prediction may require better training data.提高高通量蛋白质-蛋白质亲和力预测的准确性可能需要更好的训练数据。
BMC Bioinformatics. 2017 Mar 23;18(Suppl 5):102. doi: 10.1186/s12859-017-1533-z.
5
A Comparative Assessment of Predictive Accuracies of Conventional and Machine Learning Scoring Functions for Protein-Ligand Binding Affinity Prediction.传统评分函数与机器学习评分函数在蛋白质-配体结合亲和力预测中的预测准确性比较评估
IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):335-47. doi: 10.1109/TCBB.2014.2351824.
6
BgN-Score and BsN-Score: bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes.BgN分数和BsN分数:基于装袋法和提升法的集成神经网络评分函数,用于准确预测蛋白质-配体复合物的结合亲和力。
BMC Bioinformatics. 2015;16 Suppl 4(Suppl 4):S8. doi: 10.1186/1471-2105-16-S4-S8. Epub 2015 Feb 23.
7
NMR-based screening methods for lead discovery.基于核磁共振的先导化合物发现筛选方法。
EXS. 2003(93):183-202. doi: 10.1007/978-3-0348-7997-2_9.
8
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
9
Empirical Scoring Functions for Structure-Based Virtual Screening: Applications, Critical Aspects, and Challenges.基于结构的虚拟筛选的经验评分函数:应用、关键方面及挑战
Front Pharmacol. 2018 Sep 24;9:1089. doi: 10.3389/fphar.2018.01089. eCollection 2018.
10
PHOENIX: a scoring function for affinity prediction derived using high-resolution crystal structures and calorimetry measurements.PHOENIX:一种基于高分辨率晶体结构和量热测量数据的亲和力预测打分函数。
J Chem Inf Model. 2011 Feb 28;51(2):214-28. doi: 10.1021/ci100257s. Epub 2011 Jan 7.

引用本文的文献

1
Exploring Scoring Function Space: Developing Computational Models for Drug Discovery.探索评分函数空间:开发药物发现的计算模型。
Curr Med Chem. 2024;31(17):2361-2377. doi: 10.2174/0929867330666230321103731.
2
Sunsetting Binding MOAD with its last data update and the addition of 3D-ligand polypharmacology tools.停用 Binding MOAD,最后一次更新数据,并增加 3D 配体多药效学工具。
Sci Rep. 2023 Feb 21;13(1):3008. doi: 10.1038/s41598-023-29996-w.
3
Chemoinformatics and artificial intelligence colloquium: progress and challenges in developing bioactive compounds.

本文引用的文献

1
Automated Optimization of Potential Parameters.潜在参数的自动优化
J Chem Theory Comput. 2013 Aug 13;9(8):3311-3320. doi: 10.1021/ct400313n.
2
Calculating the sensitivity and robustness of binding free energy calculations to force field parameters.计算结合自由能计算对力场参数的敏感性和稳健性。
J Chem Theory Comput. 2013 Jul 9;9(7):3072-3083. doi: 10.1021/ct400315q.
3
The Effects of Computational Modeling Errors on the Estimation of Statistical Mechanical Variables.计算建模误差对统计力学变量估计的影响。
化学信息学与人工智能研讨会:生物活性化合物开发的进展与挑战
J Cheminform. 2022 Dec 2;14(1):82. doi: 10.1186/s13321-022-00661-0.
4
Novel computational pipelines in antiviral structure‑based drug design (Review).基于抗病毒结构的药物设计中的新型计算流程(综述)
Biomed Rep. 2022 Oct 24;17(6):97. doi: 10.3892/br.2022.1580. eCollection 2022 Dec.
5
Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.基于结构的深度学习预测蛋白质-配体结合亲和力的评分函数综述
Front Bioinform. 2022 Jun 17;2. doi: 10.3389/fbinf.2022.885983.
6
Deciphering the language of antibodies using self-supervised learning.利用自监督学习破解抗体语言。
Patterns (N Y). 2022 May 18;3(7):100513. doi: 10.1016/j.patter.2022.100513. eCollection 2022 Jul 8.
7
An pipeline for the discovery of multitarget ligands: A case study for epi-polypharmacology based on DNMT1/HDAC2 inhibition.一种多靶点配体发现的流程:基于DNMT1/HDAC2抑制的表观多药理学案例研究
Artif Intell Life Sci. 2021 Dec;1. doi: 10.1016/j.ailsci.2021.100008. Epub 2021 Sep 12.
8
A virtual drug-screening approach to conquer huge chemical libraries.一种攻克海量化学文库的虚拟药物筛选方法。
Nature. 2022 Jan;601(7893):322-323. doi: 10.1038/d41586-021-03682-1.
9
Learning protein-ligand binding affinity with atomic environment vectors.利用原子环境向量学习蛋白质-配体结合亲和力。
J Cheminform. 2021 Aug 14;13(1):59. doi: 10.1186/s13321-021-00536-w.
10
Machine learning and AI-based approaches for bioactive ligand discovery and GPCR-ligand recognition.基于机器学习和人工智能的生物活性配体发现和 GPCR 配体识别方法。
Methods. 2020 Aug 1;180:89-110. doi: 10.1016/j.ymeth.2020.06.016. Epub 2020 Jul 6.
J Chem Theory Comput. 2012 Oct 9;8(10):3769-3776. doi: 10.1021/ct300024z. Epub 2012 Mar 12.
4
Automated design of ligands to polypharmacological profiles.配体的多药效特性自动化设计。
Nature. 2012 Dec 13;492(7428):215-20. doi: 10.1038/nature11691.
5
Efficient discovery of potent anti-HIV agents targeting the Tyr181Cys variant of HIV reverse transcriptase.高效发现针对 HIV 逆转录酶 Tyr181Cys 变异体的强效抗 HIV 药物。
J Am Chem Soc. 2011 Oct 5;133(39):15686-96. doi: 10.1021/ja2058583. Epub 2011 Sep 9.
6
CSAR benchmark exercise of 2010: combined evaluation across all submitted scoring functions.2010 年的 CSAR 基准测试练习:所有提交的评分函数的综合评估。
J Chem Inf Model. 2011 Sep 26;51(9):2115-31. doi: 10.1021/ci200269q. Epub 2011 Aug 29.
7
Formal Estimation of Errors in Computed Absolute Interaction Energies of Protein-ligand Complexes.蛋白质-配体复合物计算绝对相互作用能中误差的形式化估计
J Chem Theory Comput. 2011 Mar 8;7(3):790-797. doi: 10.1021/ct100563b.
8
Leave-cluster-out cross-validation is appropriate for scoring functions derived from diverse protein data sets.留群外交叉验证适用于从不同蛋白质数据集得出的评分函数。
J Chem Inf Model. 2010 Nov 22;50(11):1961-9. doi: 10.1021/ci100264e. Epub 2010 Oct 12.
9
NNScore: a neural-network-based scoring function for the characterization of protein-ligand complexes.NNScore:一种基于神经网络的打分函数,用于描述蛋白质-配体复合物。
J Chem Inf Model. 2010 Oct 25;50(10):1865-71. doi: 10.1021/ci100244v.
10
Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions.蛋白质-配体对接的评分函数及其评价方法:最新进展和未来方向。
Phys Chem Chem Phys. 2010 Oct 28;12(40):12899-908. doi: 10.1039/c0cp00151a. Epub 2010 Aug 23.