• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能定量构效关系(aiQSAR)的方法:一种用于定量构效关系建模的特定基团方法。

Methodology of aiQSAR: a group-specific approach to QSAR modelling.

作者信息

Vukovic Kristijan, Gadaleta Domenico, Benfenati Emilio

机构信息

Istituto di Ricerche Farmacologiche Mario Negri-IRCCS, Via Mario Negri 2, 20156, Milan, Italy.

Jozef Stefan International Postgraduate School, Jamova cesta 39, 1000, Ljubljana, Slovenia.

出版信息

J Cheminform. 2019 Apr 3;11(1):27. doi: 10.1186/s13321-019-0350-y.

DOI:10.1186/s13321-019-0350-y
PMID:30945010
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6446381/
Abstract

BACKGROUND

Several QSAR methodology developments have shown promise in recent years. These include the consensus approach to generate the final prediction of a model, utilizing new, advanced machine learning algorithms and streamlining, standardization and automation of various QSAR steps. One approach that seems under-explored is at-the-runtime generation of local models specific to individual compounds. This approach was quite likely limited by the computational requirements, but with current increases in processing power and the widespread availability of cluster-computing infrastructure, this limitation is no longer that severe.

RESULTS

We propose a new QSAR methodology: aiQSAR, whose aim is to generate endpoint predictions directly from the input dataset by building an array of local models generated at-the-runtime and specific for each compound in the dataset. The local group of each compound is selected on the basis of fingerprint similarities and the final prediction is calculated by integrating the results of a number of autonomous mathematical models. The method is applicable to regression, binary classification and multi-class classification and was tested on one dataset for each endpoint type: bioconcentration factor (BCF) for regression, Ames test for binary classification and Environmental Protection Agency (EPA) acute rat oral toxicity ranking for multi-class classification. As part of this method, the applicability domain of each prediction is assessed through the applicability domain measure, calculated on the basis of the fingerprint similarities in each local group of compounds.

CONCLUSIONS

We outline the methodology for a new QSAR-based predictive tool whose advantages are automation, group-specific approach to modelling and simplicity of execution. Our aim now will be to develop this method into a stand-alone software tool. We hope that eventual adoption of our tool would make QSAR modelling more accessible and transparent. Our methodology could be used as an initial modelling step, to predict new compounds by simply loading the training dataset as an input. Predictions could then be further evaluated and refined either by other tools or through optimization of aiQSAR parameters.

摘要

背景

近年来,几种定量构效关系(QSAR)方法的发展显示出了前景。这些包括用于生成模型最终预测的共识方法,利用新的、先进的机器学习算法以及简化、标准化和自动化各种QSAR步骤。一种似乎未被充分探索的方法是在运行时生成特定于单个化合物的局部模型。这种方法很可能受到计算要求的限制,但随着当前处理能力的提高和集群计算基础设施的广泛可用,这种限制不再那么严重。

结果

我们提出了一种新的QSAR方法:aiQSAR,其目的是通过构建在运行时生成的、特定于数据集中每个化合物的局部模型数组,直接从输入数据集中生成终点预测。每个化合物的局部组基于指纹相似性进行选择,最终预测通过整合多个自主数学模型的结果来计算。该方法适用于回归、二元分类和多类分类,并针对每种终点类型在一个数据集上进行了测试:用于回归的生物富集因子(BCF)、用于二元分类的艾姆斯试验以及用于多类分类的美国环境保护局(EPA)急性大鼠经口毒性排名。作为该方法的一部分,通过基于每个局部化合物组中的指纹相似性计算的适用性域度量来评估每个预测的适用性域。

结论

我们概述了一种基于QSAR的新预测工具的方法,其优点是自动化、针对建模的组特异性方法和执行的简单性。我们现在的目标是将该方法开发成一个独立的软件工具。我们希望最终采用我们的工具将使QSAR建模更易于使用和透明。我们的方法可以用作初始建模步骤,通过简单地加载训练数据集作为输入来预测新化合物。然后可以通过其他工具或通过优化aiQSAR参数来进一步评估和完善预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/cd01ea249406/13321_2019_350_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/746f85a2c58f/13321_2019_350_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/9e4779803641/13321_2019_350_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/929af2a1d10f/13321_2019_350_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/cd01ea249406/13321_2019_350_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/746f85a2c58f/13321_2019_350_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/9e4779803641/13321_2019_350_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/929af2a1d10f/13321_2019_350_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdd/6446381/cd01ea249406/13321_2019_350_Fig5_HTML.jpg

相似文献

1
Methodology of aiQSAR: a group-specific approach to QSAR modelling.人工智能定量构效关系(aiQSAR)的方法:一种用于定量构效关系建模的特定基团方法。
J Cheminform. 2019 Apr 3;11(1):27. doi: 10.1186/s13321-019-0350-y.
2
QSAR Modelling of Rat Acute Toxicity on the Basis of PASS Prediction.基于 PASS 预测的大鼠急性毒性 QSAR 建模。
Mol Inform. 2011 Mar 14;30(2-3):241-50. doi: 10.1002/minf.201000151. Epub 2011 Mar 18.
3
Integrated QSAR Models to Predict Acute Oral Systemic Toxicity.整合的定量构效关系模型预测急性口服系统毒性。
Mol Inform. 2019 Aug;38(8-9):e1800124. doi: 10.1002/minf.201800124. Epub 2018 Dec 14.
4
Evaluation and comparison of benchmark QSAR models to predict a relevant REACH endpoint: The bioconcentration factor (BCF).评估和比较基准定量构效关系模型,以预测相关的 REACH 终点:生物浓缩因子 (BCF)。
Environ Res. 2015 Feb;137:398-409. doi: 10.1016/j.envres.2014.12.019. Epub 2015 Jan 21.
5
Study of the Applicability Domain of the QSAR Classification Models by Means of the Rivality and Modelability Indexes.应用域的定量构效关系分类模型的研究通过竞争和可模型化指数。
Molecules. 2018 Oct 24;23(11):2756. doi: 10.3390/molecules23112756.
6
Assessing the reliability of a QSAR model's predictions.评估定量构效关系(QSAR)模型预测的可靠性。
J Mol Graph Model. 2005 Jun;23(6):503-23. doi: 10.1016/j.jmgm.2005.03.003.
7
Predicting PBT and CMR properties of substances of very high concern (SVHCs) using QSAR models, and application for K-REACH.使用定量构效关系(QSAR)模型预测高关注度物质(SVHCs)的持久性、生物累积性和毒性(PBT)及化学物质的其他性质,并应用于韩国化学品注册、评估、许可和限制制度(K-REACH)。
Toxicol Rep. 2020 Aug 15;7:995-1000. doi: 10.1016/j.toxrep.2020.08.014. eCollection 2020.
8
QSAR modelling study of the bioconcentration factor and toxicity of organic compounds to aquatic organisms using machine learning and ensemble methods.基于机器学习和集成方法的有机化合物对水生生物的生物浓缩因子和毒性的定量构效关系建模研究。
Ecotoxicol Environ Saf. 2019 Sep 15;179:71-78. doi: 10.1016/j.ecoenv.2019.04.035. Epub 2019 Apr 23.
9
An ensemble model of QSAR tools for regulatory risk assessment.用于监管风险评估的QSAR工具集成模型。
J Cheminform. 2016 Sep 22;8:48. doi: 10.1186/s13321-016-0164-0. eCollection 2016.
10
Ecotoxicological QSAR modeling of organic compounds against fish: Application of fragment based descriptors in feature analysis.有机化合物对鱼类的生态毒理学定量构效关系模型研究:基于片段描述符的特征分析应用。
Aquat Toxicol. 2019 Jul;212:162-174. doi: 10.1016/j.aquatox.2019.05.011. Epub 2019 May 17.

引用本文的文献

1
Role of artificial intelligence in revolutionizing drug discovery.人工智能在变革药物研发中的作用。
Fundam Res. 2024 May 9;5(3):1273-1287. doi: 10.1016/j.fmre.2024.04.021. eCollection 2025 May.
2
Molecular docking analysis of novel quercetin derivatives for combating SARS-CoV-2.用于对抗新型冠状病毒 2 的新型槲皮素衍生物的分子对接分析。
Bioinformation. 2023 Feb 28;19(2):178-183. doi: 10.6026/97320630019178. eCollection 2023.
3
Comparing LD/LC Machine Learning Models for Multiple Species.比较多种物种的LD/LC机器学习模型

本文引用的文献

1
A new semi-automated workflow for chemical data retrieval and quality checking for modeling applications.一种用于建模应用的化学数据检索和质量检查的新型半自动工作流程。
J Cheminform. 2018 Dec 10;10(1):60. doi: 10.1186/s13321-018-0315-6.
2
Predictive Models for Acute Oral Systemic Toxicity: A Workshop to Bridge the Gap from Research to Regulation.急性经口全身毒性预测模型:弥合从研究到监管差距的研讨会
Comput Toxicol. 2018 Nov;8(11):21-24. doi: 10.1016/j.comtox.2018.08.002.
3
QSAR Modeling of ToxCast Assays Relevant to the Molecular Initiating Events of AOPs Leading to Hepatic Steatosis.
J Chem Health Saf. 2023 Mar 27;30(2):83-97. doi: 10.1021/acs.chas.2c00088. Epub 2023 Feb 23.
4
PredAOT: a computational framework for prediction of acute oral toxicity based on multiple random forest models.PredAOT:一种基于多个随机森林模型的急性口服毒性预测计算框架。
BMC Bioinformatics. 2023 Feb 24;24(1):66. doi: 10.1186/s12859-023-05176-5.
5
Principles and Procedures for Assessment of Acute Toxicity Incorporating Methods.包含多种方法的急性毒性评估原则与程序
Comput Toxicol. 2022 Nov;24. doi: 10.1016/j.comtox.2022.100237. Epub 2022 Jul 14.
6
Direct Prediction of Physicochemical Properties and Toxicities of Chemicals from Analytical Descriptors by GC-MS.基于 GC-MS 的分析描述符直接预测化学品的物理化学性质和毒性。
Anal Chem. 2022 Jun 28;94(25):9149-9157. doi: 10.1021/acs.analchem.2c01667. Epub 2022 Jun 14.
7
[Ensemble hologram quantitative structure activity relationship model of the chromatographic retention index of aldehydes and ketones].[醛酮类化合物色谱保留指数的集成全息定量构效关系模型]
Se Pu. 2021 Mar;39(3):331-337. doi: 10.3724/SP.J.1123.2020.06011.
8
CATMoS: Collaborative Acute Toxicity Modeling Suite.CATMoS:协作急性毒性建模套件。
Environ Health Perspect. 2021 Apr;129(4):47013. doi: 10.1289/EHP8495. Epub 2021 Apr 30.
9
SAR and QSAR modeling of a large collection of LD rat acute oral toxicity data.对大量大鼠急性经口毒性数据进行构效关系(SAR)和定量构效关系(QSAR)建模。
J Cheminform. 2019 Aug 30;11(1):58. doi: 10.1186/s13321-019-0383-2.
QSAR 模型构建——预测导致脂肪性肝病的 AOPs 的分子起始事件的 ToxCast 检测结果。
J Chem Inf Model. 2018 Aug 27;58(8):1501-1517. doi: 10.1021/acs.jcim.8b00297. Epub 2018 Jul 26.
4
DPubChem: a web tool for QSAR modeling and high-throughput virtual screening.DPubChem:一个用于定量构效关系建模和高通量虚拟筛选的网络工具。
Sci Rep. 2018 Jun 14;8(1):9110. doi: 10.1038/s41598-018-27495-x.
5
ChemSAR: an online pipelining platform for molecular SAR modeling.化学结构活性关系(ChemSAR):一个用于分子结构活性关系建模的在线流水线平台。
J Cheminform. 2017 May 4;9(1):27. doi: 10.1186/s13321-017-0215-1.
6
Integrating computational methods to predict mutagenicity of aromatic azo compounds.整合计算方法以预测芳香族偶氮化合物的致突变性。
J Environ Sci Health C Environ Carcinog Ecotoxicol Rev. 2017 Oct 2;35(4):239-257. doi: 10.1080/10590501.2017.1391521. Epub 2017 Nov 27.
7
Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?为什么田本系数是基于指纹的相似性计算的合适选择?
J Cheminform. 2015 May 20;7:20. doi: 10.1186/s13321-015-0069-3. eCollection 2015.
8
Russell and Burch's 3Rs then and now: the need for clarity in definition and purpose.拉塞尔和伯奇的3R原则:过去与现在,定义和目的需清晰明确。
J Am Assoc Lab Anim Sci. 2015 Mar;54(2):120-32.
9
QSAR modeling: where have you been? Where are you going to?定量构效关系模型:你从何处来?你将往何处去?
J Med Chem. 2014 Jun 26;57(12):4977-5010. doi: 10.1021/jm4004285. Epub 2014 Jan 6.
10
lazar: a modular predictive toxicology framework.拉扎尔:一个模块化的预测毒理学框架。
Front Pharmacol. 2013 Apr 9;4:38. doi: 10.3389/fphar.2013.00038. eCollection 2013.