• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于监管风险评估的QSAR工具集成模型。

An ensemble model of QSAR tools for regulatory risk assessment.

作者信息

Pradeep Prachi, Povinelli Richard J, White Shannon, Merrill Stephen J

机构信息

National Center for Computational Toxicology (ORISE Fellow), US EPA, Research Triangle Park, NC USA.

Electrical and Computer Engineering Department, Marquette University, Milwaukee, WI USA.

出版信息

J Cheminform. 2016 Sep 22;8:48. doi: 10.1186/s13321-016-0164-0. eCollection 2016.

DOI:10.1186/s13321-016-0164-0
PMID:28316646
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5034616/
Abstract

Quantitative structure activity relationships (QSARs) are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR predictions can be used for chemical risk assessment for protection of human and environmental health, which makes them interesting to regulators, especially in the absence of experimental data. For compatibility with regulatory use, QSAR models should be transparent, reproducible and optimized to minimize the number of false negatives. In silico QSAR tools are gaining wide acceptance as a faster alternative to otherwise time-consuming clinical and animal testing methods. However, different QSAR tools often make conflicting predictions for a given chemical and may also vary in their predictive performance across different chemical datasets. In a regulatory context, conflicting predictions raise interpretation, validation and adequacy concerns. To address these concerns, ensemble learning techniques in the machine learning paradigm can be used to integrate predictions from multiple tools. By leveraging various underlying QSAR algorithms and training datasets, the resulting consensus prediction should yield better overall predictive ability. We present a novel ensemble QSAR model using Bayesian classification. The model allows for varying a cut-off parameter that allows for a selection in the desirable trade-off between model sensitivity and specificity. The predictive performance of the ensemble model is compared with four in silico tools (Toxtree, Lazar, OECD Toolbox, and Danish QSAR) to predict carcinogenicity for a dataset of air toxins (332 chemicals) and a subset of the gold carcinogenic potency database (480 chemicals). Leave-one-out cross validation results show that the ensemble model achieves the best trade-off between sensitivity and specificity (accuracy: 83.8 % and 80.4 %, and balanced accuracy: 80.6 % and 80.8 %) and highest inter-rater agreement [kappa (): 0.63 and 0.62] for both the datasets. The ROC curves demonstrate the utility of the cut-off feature in the predictive ability of the ensemble model. This feature provides an additional control to the regulators in grading a chemical based on the severity of the toxic endpoint under study.

摘要

定量构效关系(QSARs)是将化学结构的定量测量与物理性质或生物效应相关联的理论模型。QSAR预测可用于化学风险评估,以保护人类和环境健康,这使得它们对监管机构很有吸引力,尤其是在缺乏实验数据的情况下。为了与监管用途兼容,QSAR模型应具有透明度、可重复性,并进行优化以尽量减少假阴性的数量。计算机模拟QSAR工具作为耗时的临床和动物测试方法的更快替代方案正获得广泛认可。然而,不同的QSAR工具对于给定的化学物质往往会做出相互矛盾的预测,并且在不同化学数据集上的预测性能也可能有所不同。在监管背景下,相互矛盾的预测引发了对解释、验证和充分性的担忧。为了解决这些担忧,可以使用机器学习范式中的集成学习技术来整合来自多个工具的预测。通过利用各种潜在的QSAR算法和训练数据集,所得的共识预测应具有更好的整体预测能力。我们提出了一种使用贝叶斯分类的新型集成QSAR模型。该模型允许改变截止参数,从而可以在模型敏感性和特异性之间进行理想的权衡选择。将集成模型的预测性能与四种计算机模拟工具(Toxtree、Lazar、经合组织工具箱和丹麦QSAR)进行比较,以预测空气毒素数据集(332种化学物质)和黄金致癌潜力数据库子集(480种化学物质)的致癌性。留一法交叉验证结果表明,对于这两个数据集,集成模型在敏感性和特异性之间实现了最佳权衡(准确率:83.8%和80.4%,平衡准确率:80.6%和80.8%),并且具有最高的评分者间一致性[kappa():0.63和0.62]。ROC曲线证明了截止特征在集成模型预测能力中的效用。此特征为监管机构根据所研究的毒性终点的严重程度对化学物质进行分级提供了额外的控制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3816/5034616/ec69a352177d/13321_2016_164_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3816/5034616/bcc3681816da/13321_2016_164_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3816/5034616/ec69a352177d/13321_2016_164_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3816/5034616/bcc3681816da/13321_2016_164_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3816/5034616/ec69a352177d/13321_2016_164_Fig2_HTML.jpg

相似文献

1
An ensemble model of QSAR tools for regulatory risk assessment.用于监管风险评估的QSAR工具集成模型。
J Cheminform. 2016 Sep 22;8:48. doi: 10.1186/s13321-016-0164-0. eCollection 2016.
2
Prediction of rodent carcinogenic potential of naturally occurring chemicals in the human diet using high-throughput QSAR predictive modeling.使用高通量定量构效关系预测模型预测人类饮食中天然存在的化学物质的啮齿动物致癌潜力。
Toxicol Appl Pharmacol. 2007 Jul 1;222(1):1-16. doi: 10.1016/j.taap.2007.03.012. Epub 2007 Mar 24.
3
In Silico Study of In Vitro GPCR Assays by QSAR Modeling.通过定量构效关系(QSAR)建模对体外G蛋白偶联受体(GPCR)分析进行计算机模拟研究。
Methods Mol Biol. 2016;1425:361-81. doi: 10.1007/978-1-4939-3609-0_16.
4
Exploring the QSAR's predictive truthfulness of the novel N-tuple discrete derivative indices on benchmark datasets.探索新型N元组离散导数指标在基准数据集上的定量构效关系(QSAR)预测真实性。
SAR QSAR Environ Res. 2017 May;28(5):367-389. doi: 10.1080/1062936X.2017.1326403.
5
Multispecies QSAR modeling for predicting the aquatic toxicity of diverse organic chemicals for regulatory toxicology.用于预测多种有机化学品对监管毒理学的水生毒性的多物种定量构效关系建模。
Chem Res Toxicol. 2014 May 19;27(5):741-53. doi: 10.1021/tx400371w. Epub 2014 Apr 17.
6
Integrating QSAR models predicting acute contact toxicity and mode of action profiling in honey bees (A. mellifera): Data curation using open source databases, performance testing and validation.整合预测急性接触毒性和作用模式的定量构效关系模型在蜜蜂(A. mellifera)中的应用:使用开源数据库进行数据整理、性能测试和验证。
Sci Total Environ. 2020 Sep 15;735:139243. doi: 10.1016/j.scitotenv.2020.139243. Epub 2020 May 17.
7
Ecotoxicological QSAR modeling of organic compounds against fish: Application of fragment based descriptors in feature analysis.有机化合物对鱼类的生态毒理学定量构效关系模型研究:基于片段描述符的特征分析应用。
Aquat Toxicol. 2019 Jul;212:162-174. doi: 10.1016/j.aquatox.2019.05.011. Epub 2019 May 17.
8
Predicting PBT and CMR properties of substances of very high concern (SVHCs) using QSAR models, and application for K-REACH.使用定量构效关系(QSAR)模型预测高关注度物质(SVHCs)的持久性、生物累积性和毒性(PBT)及化学物质的其他性质,并应用于韩国化学品注册、评估、许可和限制制度(K-REACH)。
Toxicol Rep. 2020 Aug 15;7:995-1000. doi: 10.1016/j.toxrep.2020.08.014. eCollection 2020.
9
Improvement of quantitative structure-activity relationship (QSAR) tools for predicting Ames mutagenicity: outcomes of the Ames/QSAR International Challenge Project.用于预测埃姆斯致突变性的定量构效关系(QSAR)工具的改进:埃姆斯/QSAR国际挑战赛项目的成果
Mutagenesis. 2019 Mar 6;34(1):3-16. doi: 10.1093/mutage/gey031.
10
[Ensemble hologram quantitative structure activity relationship model of the chromatographic retention index of aldehydes and ketones].[醛酮类化合物色谱保留指数的集成全息定量构效关系模型]
Se Pu. 2021 Mar;39(3):331-337. doi: 10.3724/SP.J.1123.2020.06011.

引用本文的文献

1
Coupled In Silico Toxicology Models Reveal Equivalent Ecological Risks from BPA and Its Alternatives in Chinese Surface Waters.耦合的计算机模拟毒理学模型揭示了双酚A及其替代品在中国地表水中的等效生态风险。
Toxics. 2025 Aug 9;13(8):671. doi: 10.3390/toxics13080671.
2
Application of in silico methods to predict the acute toxicity of bicyclic organophosphorus compounds as potential chemical weapon.应用计算机模拟方法预测双环有机磷化合物作为潜在化学武器的急性毒性。
Arch Toxicol. 2025 Mar 7. doi: 10.1007/s00204-025-04000-8.
3
Application of Machine Learning in the Development of Fourth Degree Quantitative Structure-Activity Relationship Model for Triclosan Analogs Tested against 3D7.

本文引用的文献

1
Integration of QSAR models for bioconcentration suitable for REACH.适用于 REACH 的生物浓缩 QSAR 模型的整合。
Sci Total Environ. 2013 Jul 1;456-457:325-32. doi: 10.1016/j.scitotenv.2013.03.104. Epub 2013 Apr 24.
2
Interpretable, probability-based confidence metric for continuous quantitative structure-activity relationship models.基于概率的可解释性置信度度量方法,用于连续的定量构效关系模型。
J Chem Inf Model. 2013 Feb 25;53(2):368-83. doi: 10.1021/ci300554t. Epub 2013 Feb 5.
3
Toxicokinetics as a key to the integrated toxicity risk assessment based primarily on non-animal approaches.
机器学习在三氯生类似物针对3D7测试的四阶定量构效关系模型开发中的应用。
ACS Omega. 2024 Oct 25;9(44):44436-44447. doi: 10.1021/acsomega.4c05768. eCollection 2024 Nov 5.
4
A numerical compass for experiment design in chemical kinetics and molecular property estimation.化学动力学和分子性质估计实验设计的数值指南。
J Cheminform. 2024 Mar 22;16(1):34. doi: 10.1186/s13321-024-00825-0.
5
Development and application of consensus models for advancing high-throughput toxicological predictions.用于推进高通量毒理学预测的共识模型的开发与应用。
Front Pharmacol. 2024 Jan 25;15:1307905. doi: 10.3389/fphar.2024.1307905. eCollection 2024.
6
Evaluation of Existing QSAR Models and Structural Alerts and Development of New Ensemble Models for Genotoxicity Using a Newly Compiled Experimental Dataset.利用新编制的实验数据集评估现有定量构效关系(QSAR)模型和结构警示,并开发用于遗传毒性的新集成模型。
Comput Toxicol. 2021 May 1;18. doi: 10.1016/j.comtox.2021.100167.
7
Using Chemical Structure Information to Develop Predictive Models for Toxicokinetic Parameters to Inform High-throughput Risk-assessment.利用化学结构信息开发毒代动力学参数预测模型以指导高通量风险评估。
Comput Toxicol. 2020 Nov 1;16. doi: 10.1016/j.comtox.2020.100136.
8
Structure-based QSAR Models to Predict Repeat Dose Toxicity Points of Departure.基于结构的定量构效关系模型预测重复剂量毒性的起始点。
Comput Toxicol. 2020 Nov 1;16(November 2020). doi: 10.1016/j.comtox.2020.100139.
9
Computational Approaches in Preclinical Studies on Drug Discovery and Development.药物发现与开发临床前研究中的计算方法。
Front Chem. 2020 Sep 11;8:726. doi: 10.3389/fchem.2020.00726. eCollection 2020.
10
Assessment of the cytotoxic and mutagenic potential of dichlorvos (DDVP) using in silico classification model; a health hazard awareness in Nigeria.使用计算机分类模型评估敌敌畏(DDVP)的细胞毒性和致突变潜力;尼日利亚的健康危害认知
Environ Anal Health Toxicol. 2020 Sep;35(3):e2020016. doi: 10.5620/eaht.2020016. Epub 2020 Sep 28.
基于非动物方法的综合毒性风险评估,毒代动力学是关键。
Toxicol In Vitro. 2013 Aug;27(5):1570-7. doi: 10.1016/j.tiv.2012.06.012. Epub 2012 Jul 4.
4
The challenges involved in modeling toxicity data in silico: a review.计算机模拟毒性数据所涉及的挑战:综述
Curr Pharm Des. 2012;18(9):1266-91. doi: 10.2174/138161212799436359.
5
Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient.QSAR 模型的真实外部预测能力:如何评估?不同验证标准的比较及使用一致性相关系数的建议。
J Chem Inf Model. 2011 Sep 26;51(9):2320-35. doi: 10.1021/ci200211n. Epub 2011 Aug 12.
6
Comparative evaluation of in silico systems for ames test mutagenicity prediction: scope and limitations.计算机系统预测 Ames 试验致突变性的比较评估:范围和局限性。
Chem Res Toxicol. 2011 Jun 20;24(6):843-54. doi: 10.1021/tx2000398. Epub 2011 May 2.
7
Ensemble QSAR: a QSAR method based on conformational ensembles and metric descriptors.集成定量构效关系:一种基于构象集合和度量描述符的定量构效关系方法。
J Comput Chem. 2011 Jul 30;32(10):2204-18. doi: 10.1002/jcc.21804. Epub 2011 Apr 21.
8
In silico toxicology models and databases as FDA Critical Path Initiative toolkits.计算机毒理学模型和数据库作为 FDA 关键路径倡议工具包。
Hum Genomics. 2011 Mar;5(3):200-7. doi: 10.1186/1479-7364-5-3-200.
9
Combined Use of MC4PC, MDL-QSAR, BioEpisteme, Leadscope PDM, and Derek for Windows Software to Achieve High-Performance, High-Confidence, Mode of Action-Based Predictions of Chemical Carcinogenesis in Rodents.结合使用 MC4PC、MDL-QSAR、BioEpisteme、Leadscope PDM 和 Derek for Windows 软件,实现基于作用模式的啮齿动物化学致癌性的高准确性、高可信度预测。
Toxicol Mech Methods. 2008;18(2-3):189-206. doi: 10.1080/15376510701857379.
10
A new hybrid system of QSAR models for predicting bioconcentration factors (BCF).一种用于预测生物富集因子(BCF)的新型QSAR模型混合系统。
Chemosphere. 2008 Dec;73(11):1701-7. doi: 10.1016/j.chemosphere.2008.09.033. Epub 2008 Oct 26.