评估定量构效关系（QSAR）模型预测的可靠性。

Assessing the reliability of a QSAR model's predictions.

作者信息

He Linnan, Jurs Peter C

机构信息

Department of Chemistry, The Pennsylvania State University, 104 Chemistry Building, University Park, PA 16802, USA.

出版信息

J Mol Graph Model. 2005 Jun;23(6):503-23. doi: 10.1016/j.jmgm.2005.03.003.

DOI:10.1016/j.jmgm.2005.03.003

PMID:15896992

Abstract

Quantitative structure activity relationships (QSAR) are one of the well-developed areas in computational chemistry. In this field, many successful predictive models have been developed for various property, activity or toxicity predictions. However, the predictive power of models for new query compounds is often not well characterized. The breadth of applicability of models is often not characterized. In other words, with a given QSAR model and a specific query compound to be predicted, can the model be used reliably for the desired prediction? In this study, we assessed the reliability of QSAR models' prediction on query compounds. Our approach, employing hierarchical clustering, was developed and tested using a test dataset containing 322 organic compounds with fathead minnow acute aquatic toxicity as the activity of interest. The hypothesis of the approach was that if a query compound is more similar to the compounds used to generate the QSAR model, it should be predicted more accurately. Thus, the core of the approach is to determine the relationship between the similarity of query compounds to the training set compounds of the QSAR model and the prediction accuracy given by that model. This relationship determination was achieved by comparing the results given by the two major components of the approach: objects clustering and activity prediction. With the resultant information from the two steps, a direct relationship was shown.

摘要

定量构效关系（QSAR）是计算化学中发展较为成熟的领域之一。在该领域，已经开发出许多成功的预测模型用于各种性质、活性或毒性预测。然而，模型对新的查询化合物的预测能力往往没有得到很好的表征。模型的适用范围也常常没有得到表征。换句话说，对于给定的QSAR模型和要预测的特定查询化合物，该模型能否可靠地用于所需的预测？在本研究中，我们评估了QSAR模型对查询化合物预测的可靠性。我们采用层次聚类的方法，使用一个包含322种有机化合物的测试数据集进行开发和测试，该数据集以黑头呆鱼的急性水生毒性作为感兴趣的活性。该方法的假设是，如果查询化合物与用于生成QSAR模型的化合物更相似，那么它应该被更准确地预测。因此，该方法的核心是确定查询化合物与QSAR模型训练集化合物的相似性与该模型给出的预测准确性之间的关系。这种关系的确定是通过比较该方法的两个主要组成部分给出的结果来实现的：对象聚类和活性预测。利用这两个步骤得到的信息，显示了一种直接关系。

相似文献

Assessing the reliability of a QSAR model's predictions.评估定量构效关系（QSAR）模型预测的可靠性。

J Mol Graph Model. 2005 Jun;23(6):503-23. doi: 10.1016/j.jmgm.2005.03.003.

Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis.针对梨形四膜虫测试的化学毒物的组合定量构效关系建模。

J Chem Inf Model. 2008 Apr;48(4):766-84. doi: 10.1021/ci700443v. Epub 2008 Mar 1.

Local and global quantitative structure-activity relationship modeling and prediction for the baseline toxicity.基线毒性的局部和全局定量构效关系建模与预测

J Chem Inf Model. 2007 Jan-Feb;47(1):159-69. doi: 10.1021/ci600299j.

Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: focusing on applicability domain and overfitting by variable selection.针对梨形四膜虫的环境毒性定量构效关系（QSAR）模型的批判性评估：聚焦适用域及变量选择导致的过拟合问题

J Chem Inf Model. 2008 Sep;48(9):1733-46. doi: 10.1021/ci800151m. Epub 2008 Aug 26.

Application of predictive QSAR models to database mining: identification and experimental validation of novel anticonvulsant compounds.预测性定量构效关系模型在数据库挖掘中的应用：新型抗惊厥化合物的鉴定与实验验证

J Med Chem. 2004 Apr 22;47(9):2356-64. doi: 10.1021/jm030584q.

Prediction of fathead minnow acute toxicity of organic compounds from molecular structure.从分子结构预测有机化合物对黑头呆鱼的急性毒性

Chem Res Toxicol. 1999 Jul;12(7):670-8. doi: 10.1021/tx980273w.

Mode of action-based local QSAR modeling for the prediction of acute toxicity in the fathead minnow.基于作用模式的局部定量构效关系建模用于预测黑头呆鱼的急性毒性

J Mol Graph Model. 2007 Jul;26(1):327-35. doi: 10.1016/j.jmgm.2006.12.009. Epub 2006 Dec 16.

Unified QSAR approach to antimicrobials. Part 3: first multi-tasking QSAR model for input-coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds.抗菌剂的统一定量构效关系方法。第3部分：用于抗原生动物化合物的输入编码预测、结构反向投影和复杂网络聚类的首个多任务定量构效关系模型。

Bioorg Med Chem. 2008 Jun 1;16(11):5871-80. doi: 10.1016/j.bmc.2008.04.068. Epub 2008 Apr 29.

Validation of a QSAR model for acute toxicity.急性毒性定量构效关系（QSAR）模型的验证

SAR QSAR Environ Res. 2006 Apr;17(2):147-71. doi: 10.1080/10659360600636253.

Ligand-based virtual screening and in silico design of new antimalarial compounds using nonstochastic and stochastic total and atom-type quadratic maps.基于配体的虚拟筛选以及使用非随机和随机全原子型及原子类型二次映射的新型抗疟化合物的计算机辅助设计。

J Chem Inf Model. 2005 Jul-Aug;45(4):1082-100. doi: 10.1021/ci050085t.

引用本文的文献

Electronic Properties of Small Psychotropic Substances in WaterPhenylamines.水中精神活性小分子物质——苯胺类的电子性质

ACS Omega. 2025 Aug 15;10(33):37383-37397. doi: 10.1021/acsomega.5c03078. eCollection 2025 Aug 26.

Computer-aided discovery of novel SmDHODH inhibitors for schistosomiasis therapy: Ligand-based drug design, molecular docking, molecular dynamic simulations, drug-likeness, and ADMET studies.计算机辅助发现新型 SmDHODH 抑制剂用于血吸虫病治疗：基于配体的药物设计、分子对接、分子动力学模拟、类药性和 ADMET 研究。

PLoS Negl Trop Dis. 2024 Sep 12;18(9):e0012453. doi: 10.1371/journal.pntd.0012453. eCollection 2024 Sep.

DeepARV: ensemble deep learning to predict drug-drug interaction of clinical relevance with antiretroviral therapy.DeepARV：用于预测抗逆转录病毒治疗中具有临床相关性的药物-药物相互作用的集成深度学习。

NPJ Syst Biol Appl. 2024 May 6;10(1):48. doi: 10.1038/s41540-024-00374-0.

Chapter 9 Molecular Similarity: Advances in Methods, Applications and Validations in Virtual Screening and QSAR.第9章分子相似性：虚拟筛选和定量构效关系中方法、应用及验证的进展

Annu Rep Comput Chem. 2006;2:141-168. doi: 10.1016/S1574-1400(06)02009-3. Epub 2006 Nov 7.

How Precise Are Our Quantitative Structure-Activity Relationship Derived Predictions for New Query Chemicals?我们基于定量构效关系得出的针对新查询化学品的预测有多精确？

ACS Omega. 2018 Sep 19;3(9):11392-11406. doi: 10.1021/acsomega.8b01647. eCollection 2018 Sep 30.

Cross-validation pitfalls when selecting and assessing regression and classification models.交叉验证在选择和评估回归与分类模型时的陷阱。

J Cheminform. 2014 Mar 29;6(1):10. doi: 10.1186/1758-2946-6-10.

Reliably assessing prediction reliability for high dimensional QSAR data.可靠评估高维 QSAR 数据的预测可靠性。

Mol Divers. 2013 Feb;17(1):63-73. doi: 10.1007/s11030-012-9415-9. Epub 2012 Dec 19.

Rank order entropy: why one metric is not enough.秩次熵：为何一种度量指标并不够。

J Chem Inf Model. 2011 Sep 26;51(9):2302-19. doi: 10.1021/ci200170k. Epub 2011 Aug 29.

QSAR models for CXCR2 receptor antagonists based on the genetic algorithm for data preprocessing prior to application of the PLS linear regression method and design of the new compounds using in silico virtual screening.基于遗传算法进行数据预处理，再采用偏最小二乘线性回归方法建立 CXCR2 受体拮抗剂的定量构效关系模型，并通过计算机虚拟筛选设计新化合物。

Molecules. 2011 Feb 25;16(3):1928-55. doi: 10.3390/molecules16031928.

DPRESS: Localizing estimates of predictive uncertainty.DPRESS：本地化预测不确定性的估计。

J Cheminform. 2009 Jul 14;1(1):11. doi: 10.1186/1758-2946-1-11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估定量构效关系（QSAR）模型预测的可靠性。

Assessing the reliability of a QSAR model's predictions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献