关于定量构效关系模型的解释与可解释性

On the interpretation and interpretability of quantitative structure-activity relationship models.

作者信息

Guha Rajarshi

机构信息

School of Informatics, Indiana University, Bloomington, IN 47408, USA.

出版信息

J Comput Aided Mol Des. 2008 Dec;22(12):857-71. doi: 10.1007/s10822-008-9240-5. Epub 2008 Sep 11.

DOI:10.1007/s10822-008-9240-5

PMID:18784976

Abstract

The goal of a quantitative structure-activity relationship (QSAR) model is to encode the relationship between molecular structure and biological activity or physical property. Based on this encoding, such models can be used for predictive purposes. Assuming the use of relevant and meaningful descriptors, and a statistically significant model, extraction of the encoded structure-activity relationships (SARs) can provide insight into what makes a molecule active or inactive. Such analyses by QSAR models are useful in a number of scenarios, such as suggesting structural modifications to enhance activity, explanation of outliers and exploratory analysis of novel SARs. In this paper we discuss the need for interpretation and an overview of the factors that affect interpretability of QSAR models. We then describe interpretation protocols for different types of models, highlighting the different types of interpretations, ranging from very broad, global, trends to very specific, case-by-case, descriptions of the SAR, using examples from the training set. Finally, we discuss a number of case studies where workers have provide some form of interpretation of a QSAR model.

摘要

定量构效关系（QSAR）模型的目标是对分子结构与生物活性或物理性质之间的关系进行编码。基于这种编码，此类模型可用于预测目的。假设使用相关且有意义的描述符以及具有统计学意义的模型，提取编码的构效关系（SAR）能够深入了解使分子具有活性或无活性的因素。QSAR模型的此类分析在许多情况下都很有用，例如建议进行结构修饰以增强活性、解释异常值以及对新型SAR进行探索性分析。在本文中，我们讨论了解释的必要性以及影响QSAR模型可解释性的因素概述。然后，我们描述了不同类型模型的解释协议，突出了不同类型的解释，从非常宽泛的全局趋势到非常具体的逐个案例的SAR描述，并使用训练集中的示例进行说明。最后，我们讨论了一些案例研究，其中研究人员对QSAR模型进行了某种形式的解释。

相似文献

On the interpretation and interpretability of quantitative structure-activity relationship models.

J Comput Aided Mol Des. 2008 Dec;22(12):857-71. doi: 10.1007/s10822-008-9240-5. Epub 2008 Sep 11.

QSAR using evolved neural networks for the inhibition of mutant PfDHFR by pyrimethamine derivatives.

Biosystems. 2008 Apr;92(1):10-5. doi: 10.1016/j.biosystems.2007.10.005. Epub 2007 Nov 17.

Free energy force field (FEFF) 3D-QSAR analysis of a set of Plasmodium falciparum dihydrofolate reductase inhibitors.

J Comput Aided Mol Des. 2001 Sep;15(9):787-810. doi: 10.1023/a:1013199108020.

Quantitative structure-activity relationships by evolved neural networks for the inhibition of dihydrofolate reductase by pyrimidines.

Biosystems. 2002 Feb;65(1):37-47. doi: 10.1016/s0303-2647(01)00192-7.

Quantitative structure activity relationship study of 2,4,6-trisubstituted-s-triazine derivatives as antimalarial inhibitors of Plasmodium falciparum dihydrofolate reductase.

Chem Biol Drug Des. 2011 Jan;77(1):57-62. doi: 10.1111/j.1747-0285.2010.01045.x. Epub 2010 Oct 19.

Characterization of dihydrofolate reductases from multiple strains of Plasmodium falciparum using mathematical descriptors of their inhibitors.

Chem Biodivers. 2011 Mar;8(3):440-53. doi: 10.1002/cbdv.201000111.

Three-dimensional quantitative structure-activity relationship analysis of a set of Plasmodium falciparum dihydrofolate reductase inhibitors using a pharmacophore generation approach.

J Med Chem. 2004 Aug 12;47(17):4258-67. doi: 10.1021/jm040769c.

A DFT-based QSAR study on inhibition of human dihydrofolate reductase.

J Mol Graph Model. 2016 Nov;70:23-29. doi: 10.1016/j.jmgm.2016.09.005. Epub 2016 Sep 6.

Structure-based approach to pharmacophore identification, in silico screening, and three-dimensional quantitative structure-activity relationship studies for inhibitors of Trypanosoma cruzi dihydrofolate reductase function.

Proteins. 2008 Dec;73(4):889-901. doi: 10.1002/prot.22115.

A search for sources of drug resistance by the 4D-QSAR analysis of a set of antimalarial dihydrofolate reductase inhibitors.

J Comput Aided Mol Des. 2001 Jan;15(1):1-12. doi: 10.1023/a:1011152818340.

引用本文的文献

Multi-Target In-Silico modeling strategies to discover novel angiotensin converting enzyme and neprilysin dual inhibitors.

Sci Rep. 2024 Jul 10;14(1):15991. doi: 10.1038/s41598-024-66230-7.

Advances, opportunities, and challenges in methods for interrogating the structure activity relationships of natural products.

Nat Prod Rep. 2024 Oct 17;41(10):1543-1578. doi: 10.1039/d4np00009a.

Generalizability Improvement of Interpretable Symbolic Regression Models for Quantitative Structure-Activity Relationships.

ACS Omega. 2024 Feb 16;9(8):9463-9474. doi: 10.1021/acsomega.3c09047. eCollection 2024 Feb 27.

Design, synthesis and evaluation of newer 1,4-dihydropyridine based amlodipine bio-isosteres as promising antihypertensive agents.

RSC Adv. 2023 Nov 22;13(48):34239-34248. doi: 10.1039/d3ra06387a. eCollection 2023 Nov 16.

Guidance for good practice in the application of machine learning in development of toxicological quantitative structure-activity relationships (QSARs).

PLoS One. 2023 May 10;18(5):e0282924. doi: 10.1371/journal.pone.0282924. eCollection 2023.

Machine Learning and Artificial Intelligence in Toxicological Sciences.

Toxicol Sci. 2022 Aug 25;189(1):7-19. doi: 10.1093/toxsci/kfac075.

Progress on open chemoinformatic tools for expanding and exploring the chemical space.

J Comput Aided Mol Des. 2022 May;36(5):341-354. doi: 10.1007/s10822-021-00399-1. Epub 2021 Jun 18.

The kernel-weighted local polynomial regression (KwLPR) approach: an efficient, novel tool for development of QSAR/QSAAR toxicity extrapolation models.

J Cheminform. 2021 Feb 12;13(1):9. doi: 10.1186/s13321-021-00484-5.

Multi-Target Chemometric Modelling, Fragment Analysis and Virtual Screening with ERK Inhibitors as Potential Anticancer Agents.

Molecules. 2019 Oct 30;24(21):3909. doi: 10.3390/molecules24213909.

Development of Multi-Target Chemometric Models for the Inhibition of Class I PI3K Enzyme Isoforms: A Case Study Using QSAR-Co Tool.

Int J Mol Sci. 2019 Aug 27;20(17):4191. doi: 10.3390/ijms20174191.

本文引用的文献

Using multivariate adaptive regression splines to QSAR studies of dihydroartemisinin derivatives.

Eur J Med Chem. 1996;31(10):797-803. doi: 10.1016/0223-5234(96)83973-0.

Scores of extended connectivity fingerprint as descriptors in QSPR study of melting point and aqueous solubility.

J Chem Inf Model. 2008 May;48(5):981-7. doi: 10.1021/ci800024c. Epub 2008 May 9.

Nonlinear support vector machine visualization for risk factor analysis using nomograms and localized radial basis function kernels.

IEEE Trans Inf Technol Biomed. 2008 Mar;12(2):247-56. doi: 10.1109/TITB.2007.902300.

Some new trends in chemical graph theory.

Chem Rev. 2008 Mar;108(3):1127-69. doi: 10.1021/cr0780006. Epub 2008 Feb 27.

Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assays.

J Comput Aided Mol Des. 2008 Jun-Jul;22(6-7):367-84. doi: 10.1007/s10822-008-9192-9. Epub 2008 Feb 19.

Sharing chemical information without sharing chemical structure.

J Chem Inf Model. 2008 Feb;48(2):256-61. doi: 10.1021/ci600383v. Epub 2008 Feb 7.

QSAR: dead or alive?

J Comput Aided Mol Des. 2008 Feb;22(2):81-9. doi: 10.1007/s10822-007-9162-7. Epub 2008 Jan 9.

Application of quantitative structure-activity relationships to the modeling of antitubercular compounds. 1. The hydrazide family.

J Med Chem. 2008 Feb 14;51(3):612-24. doi: 10.1021/jm701048s. Epub 2008 Jan 5.

The trouble with QSAR (or how I learned to stop worrying and embrace fallacy).

J Chem Inf Model. 2008 Jan;48(1):25-6. doi: 10.1021/ci700332k. Epub 2007 Dec 28.

A composite model for HERG blockade.

ChemMedChem. 2008 Feb;3(2):254-65. doi: 10.1002/cmdc.200700221.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

关于定量构效关系模型的解释与可解释性

On the interpretation and interpretability of quantitative structure-activity relationship models.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献