• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于定量构效关系(QSAR)/定量构效预测(QSPR)和读靶预测的新型可扩展分子子结构集。

-A New, Extensible Set of Molecular Substructures for QSAR/QSPR and Read-Across Predictions.

机构信息

Sciome LLC, Research Triangle Park, North Carolina 27709, United States.

National Institute of Environmental Health Sciences (NIEHS), National Toxicology Program (NTP), Research Triangle Park, North Carolina 27709, United States.

出版信息

Chem Res Toxicol. 2021 Feb 15;34(2):634-640. doi: 10.1021/acs.chemrestox.0c00464. Epub 2020 Dec 25.

DOI:10.1021/acs.chemrestox.0c00464
PMID:33356152
Abstract

Molecular structure-based predictive models provide a proven alternative to costly and inefficient animal testing. However, due to a lack of interpretability of predictive models built with abstract molecular descriptors they have earned the notoriety of being black boxes. Interpretable models require interpretable descriptors to provide chemistry-backed predictive reasoning and facilitate intelligent molecular design. We developed a novel set of extensible chemistry-aware substructures, , to support interpretable predictive models and read-across protocols. Performance of in chemical characterization and search for structurally similar actives for read-across applications was compared with four publicly available fingerprint sets (MACCS (166), PubChem (881), ECFP4 (1024), ToxPrint (729)) in three benchmark sets (MUV, ULS, and Tox21) spanning ∼145 000 compounds and 78 molecular targets at 1%, 2%, 5%, and 10% false discovery rates. In 18 of the 20 comparisons, interpretable features performed better than the publicly available, but less interpretable and fixed-bit length, fingerprints. Examples are provided to show the enhanced capability of in extracting compounds with higher scaffold similarity. features are interpretable and efficiently characterize diverse chemical collections, thus making them a better choice for building interpretable predictive models and read-across protocols.

摘要

基于分子结构的预测模型为代价高昂且效率低下的动物测试提供了一种经过验证的替代方法。然而,由于用抽象分子描述符构建的预测模型缺乏可解释性,它们被认为是“黑箱”。可解释的模型需要可解释的描述符来提供有化学依据的预测推理,并促进智能分子设计。我们开发了一套新的可扩展的化学感知子结构 ,以支持可解释的预测模型和读通协议。在三个基准集(MUV、ULS 和 Tox21)中,在 1%、2%、5%和 10%的假发现率下,比较了 在化学特征描述和寻找结构相似的活性物质以进行读通应用方面的性能,与四个公开可用的指纹集(MACCS(166)、PubChem(881)、ECFP4(1024)、ToxPrint(729))进行了比较,涵盖了约 145000 种化合物和 78 个分子靶标。在 20 次比较中的 18 次中,可解释的 特征比公开的、但可解释性较低且固定位长的指纹表现更好。提供了示例来说明 特征在提取具有更高支架相似性的化合物方面的增强能力。 特征是可解释的,可以有效地描述多样化的化学集合,因此,它们是构建可解释的预测模型和读通协议的更好选择。

相似文献

1
-A New, Extensible Set of Molecular Substructures for QSAR/QSPR and Read-Across Predictions.用于定量构效关系(QSAR)/定量构效预测(QSPR)和读靶预测的新型可扩展分子子结构集。
Chem Res Toxicol. 2021 Feb 15;34(2):634-640. doi: 10.1021/acs.chemrestox.0c00464. Epub 2020 Dec 25.
2
Molecular Similarity in Predictive Toxicology with a Focus on the q-RASAR Technique.预测毒理学中的分子相似性研究——聚焦 q-RASAR 技术。
Methods Mol Biol. 2025;2834:41-63. doi: 10.1007/978-1-0716-4003-6_2.
3
Multi-Descriptor Read Across (MuDRA): A Simple and Transparent Approach for Developing Accurate Quantitative Structure-Activity Relationship Models.多描述符读通(MuDRA):一种用于开发准确定量构效关系模型的简单透明方法。
J Chem Inf Model. 2018 Jun 25;58(6):1214-1223. doi: 10.1021/acs.jcim.8b00124. Epub 2018 Jun 13.
4
Integrated decision support for assessing chemical liabilities.用于评估化学责任的综合决策支持。
J Chem Inf Model. 2011 Aug 22;51(8):1840-7. doi: 10.1021/ci200242c. Epub 2011 Aug 5.
5
Design and evaluation of a molecular fingerprint involving the transformation of property descriptor values into a binary classification scheme.一种涉及将性质描述符值转化为二元分类方案的分子指纹的设计与评估。
J Chem Inf Comput Sci. 2003 Jul-Aug;43(4):1151-7. doi: 10.1021/ci030285+.
6
Fragmental approach in QSPR.定量构效关系中的片段法。
J Chem Inf Comput Sci. 2002 Sep-Oct;42(5):1112-22. doi: 10.1021/ci020010e.
7
Genetic Algorithm and Self-Organizing Maps for QSPR Study of Some N-aryl Derivatives as Butyrylcholinesterase Inhibitors.用于某些N-芳基衍生物作为丁酰胆碱酯酶抑制剂的定量构效关系研究的遗传算法和自组织映射
Curr Drug Discov Technol. 2016;13(4):232-253. doi: 10.2174/1570163813666160725114241.
8
Molecule kernels: a descriptor- and alignment-free quantitative structure-activity relationship approach.分子内核:一种无描述符和比对的定量构效关系方法。
J Chem Inf Model. 2008 Sep;48(9):1868-81. doi: 10.1021/ci800144y. Epub 2008 Sep 4.
9
First report on pesticide sub-chronic and chronic toxicities against dogs using QSAR and chemical read-across.利用定量构效关系(QSAR)和化学物质类推法对狗进行农药亚慢性和慢性毒性的首次报告。
SAR QSAR Environ Res. 2024 Mar;35(3):241-263. doi: 10.1080/1062936X.2024.2320143. Epub 2024 Feb 23.
10
Computational models for the classification of mPGES-1 inhibitors with fingerprint descriptors.基于指纹描述符的 mPGES-1 抑制剂分类的计算模型。
Mol Divers. 2017 Aug;21(3):661-675. doi: 10.1007/s11030-017-9743-x. Epub 2017 May 8.

引用本文的文献

1
A Novel Machine Learning Model and a Web Portal for Predicting the Human Skin Sensitization Effects of Chemical Agents.一种用于预测化学试剂对人体皮肤致敏作用的新型机器学习模型及网络门户。
Toxics. 2024 Nov 7;12(11):803. doi: 10.3390/toxics12110803.
2
A systematic analysis of read-across adaptations in testing proposal evaluations by the European Chemicals Agency.欧洲化学品管理局对测试提案评估中类推法应用的系统分析。
ALTEX. 2025;42(1):22-38. doi: 10.14573/altex.2408292. Epub 2024 Nov 22.
3
An in vitro and machine learning framework for quantifying serum albumin binding of per- and polyfluoroalkyl substances.
一种用于量化全氟和多氟烷基物质血清白蛋白结合的体外和机器学习框架。
Toxicol Sci. 2025 Jan 1;203(1):67-78. doi: 10.1093/toxsci/kfae124.
4
A Systematic Analysis of Read-Across Adaptations in Testing Proposal Evaluations by the European Chemicals Agency.欧洲化学品管理局对测试提案评估中类推适应情况的系统分析。
bioRxiv. 2024 Aug 30:2024.08.29.610278. doi: 10.1101/2024.08.29.610278.
5
Characterizing PFAS hazards and risks: a human population-based in vitro cardiotoxicity assessment strategy.描述 PFAS 危害和风险:基于人群的体外心脏毒性评估策略。
Hum Genomics. 2024 Sep 2;18(1):92. doi: 10.1186/s40246-024-00665-x.
6
Hazard and risk characterization of 56 structurally diverse PFAS using a targeted battery of broad coverage assays using six human cell types.使用涵盖六种人类细胞类型的靶向广谱测定法对 56 种结构多样的全氟和多氟烷基物质进行危害和风险特征描述。
Toxicology. 2024 Mar;503:153763. doi: 10.1016/j.tox.2024.153763. Epub 2024 Feb 27.
7
Artificial intelligence (AI)-it's the end of the tox as we know it (and I feel fine).人工智能(AI)——这是我们所知道的毒理学的终结(我感觉很好)。
Arch Toxicol. 2024 Mar;98(3):735-754. doi: 10.1007/s00204-023-03666-2. Epub 2024 Jan 20.
8
Tox21Enricher-Shiny: an R Shiny application for toxicity functional annotation analysis.Tox21Enricher-Shiny:一款用于毒性功能注释分析的R Shiny应用程序。
Front Toxicol. 2023 Jun 27;5:1147608. doi: 10.3389/ftox.2023.1147608. eCollection 2023.
9
Quantifying Analogue Suitability for SAR-Based Read-Across Toxicological Assessment.基于 SAR 的类推毒性评估中分析物适用性的量化。
Chem Res Toxicol. 2023 Feb 20;36(2):230-242. doi: 10.1021/acs.chemrestox.2c00311. Epub 2023 Jan 26.
10
Predicting Prenatal Developmental Toxicity Based On the Combination of Chemical Structures and Biological Data.基于化学结构和生物数据组合预测产前发育毒性。
Environ Sci Technol. 2022 May 3;56(9):5984-5998. doi: 10.1021/acs.est.2c01040. Epub 2022 Apr 22.