• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

毒理学大数据的机器学习可实现优于动物测试重现性的读交叉结构活性关系 (RASAR)。

Machine Learning of Toxicological Big Data Enables Read-Across Structure Activity Relationships (RASAR) Outperforming Animal Test Reproducibility.

机构信息

Johns Hopkins University, Bloomberg School of Public Health, Center for Alternatives to Animal Testing (CAAT), Baltimore, Maryland 21205.

ToxTrack, Baltimore, Maryland 21209.

出版信息

Toxicol Sci. 2018 Sep 1;165(1):198-212. doi: 10.1093/toxsci/kfy152.

DOI:10.1093/toxsci/kfy152
PMID:30007363
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6135638/
Abstract

Earlier we created a chemical hazard database via natural language processing of dossiers submitted to the European Chemical Agency with approximately 10 000 chemicals. We identified repeat OECD guideline tests to establish reproducibility of acute oral and dermal toxicity, eye and skin irritation, mutagenicity and skin sensitization. Based on 350-700+ chemicals each, the probability that an OECD guideline animal test would output the same result in a repeat test was 78%-96% (sensitivity 50%-87%). An expanded database with more than 866 000 chemical properties/hazards was used as training data and to model health hazards and chemical properties. The constructed models automate and extend the read-across method of chemical classification. The novel models called RASARs (read-across structure activity relationship) use binary fingerprints and Jaccard distance to define chemical similarity. A large chemical similarity adjacency matrix is constructed from this similarity metric and is used to derive feature vectors for supervised learning. We show results on 9 health hazards from 2 kinds of RASARs-"Simple" and "Data Fusion". The "Simple" RASAR seeks to duplicate the traditional read-across method, predicting hazard from chemical analogs with known hazard data. The "Data Fusion" RASAR extends this concept by creating large feature vectors from all available property data rather than only the modeled hazard. Simple RASAR models tested in cross-validation achieve 70%-80% balanced accuracies with constraints on tested compounds. Cross validation of data fusion RASARs show balanced accuracies in the 80%-95% range across 9 health hazards with no constraints on tested compounds.

摘要

早些时候,我们通过对提交给欧洲化学品管理局的档案进行自然语言处理,创建了一个化学危害数据库,其中包含大约 10000 种化学物质。我们确定了重复的 OECD 指导方针测试,以建立急性口服和皮肤毒性、眼睛和皮肤刺激、致突变性和皮肤致敏性的重现性。基于每个测试约有 350-700+种化学物质,OECD 指导方针动物测试在重复测试中得出相同结果的概率为 78%-96%(灵敏度为 50%-87%)。一个包含超过 866000 种化学性质/危害的扩展数据库被用作训练数据,并用于模拟健康危害和化学性质。构建的模型使化学分类的读取方法自动化和扩展。这些名为 RASARs(读取相似性结构活性关系)的新型模型使用二进制指纹和 Jaccard 距离来定义化学相似性。从这种相似性度量标准构建了一个大型化学相似性邻接矩阵,并用于为监督学习生成特征向量。我们展示了来自两种 RASARs(“简单”和“数据融合”)的 9 种健康危害的结果。“简单”RASAR 试图复制传统的读取方法,根据具有已知危害数据的化学类似物预测危害。“数据融合”RASAR 通过从所有可用的属性数据而不是仅建模的危害中创建大型特征向量来扩展此概念。经过交叉验证的简单 RASAR 模型在测试化合物受限的情况下达到 70%-80%的平衡准确性。数据融合 RASAR 的交叉验证在 9 种健康危害中显示出 80%-95%的平衡准确性,对测试化合物没有限制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/4c4e71c9ddc1/kfy152f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/af24777f58d5/kfy152f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/635092e5a461/kfy152f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/c2cda96f6159/kfy152f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/a8444de7686b/kfy152f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/27881e7824a4/kfy152f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/102615fe8ef2/kfy152f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/6a65d552c734/kfy152f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/4c4e71c9ddc1/kfy152f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/af24777f58d5/kfy152f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/635092e5a461/kfy152f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/c2cda96f6159/kfy152f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/a8444de7686b/kfy152f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/27881e7824a4/kfy152f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/102615fe8ef2/kfy152f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/6a65d552c734/kfy152f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e726/6135638/4c4e71c9ddc1/kfy152f8.jpg

相似文献

1
Machine Learning of Toxicological Big Data Enables Read-Across Structure Activity Relationships (RASAR) Outperforming Animal Test Reproducibility.毒理学大数据的机器学习可实现优于动物测试重现性的读交叉结构活性关系 (RASAR)。
Toxicol Sci. 2018 Sep 1;165(1):198-212. doi: 10.1093/toxsci/kfy152.
2
Prediction-Inspired Intelligent Training for the Development of Classification Read-across Structure-Activity Relationship (c-RASAR) Models for Organic Skin Sensitizers: Assessment of Classification Error Rate from Novel Similarity Coefficients.基于预测的智能训练在分类读靶结构-活性关系(c-RASAR)模型开发中的应用:新型相似系数分类错误率评估。
Chem Res Toxicol. 2023 Sep 18;36(9):1518-1531. doi: 10.1021/acs.chemrestox.3c00155. Epub 2023 Aug 16.
3
Predicting chemical hazard across taxa through machine learning.通过机器学习预测跨分类群的化学危害。
Environ Int. 2022 May;163:107184. doi: 10.1016/j.envint.2022.107184. Epub 2022 Mar 17.
4
Analysis of Draize eye irritation testing and its prediction by mining publicly available 2008-2014 REACH data.通过挖掘2008 - 2014年可公开获取的REACH数据对德莱兹眼刺激试验进行分析及其预测。
ALTEX. 2016;33(2):123-34. doi: 10.14573/altex.1510053. Epub 2016 Feb 11.
5
The application of chemical similarity measures in an unconventional modeling framework c-RASAR along with dimensionality reduction techniques to a representative hepatotoxicity dataset.化学相似性度量在非常规建模框架 c-RASAR 中的应用以及降维技术在具有代表性的肝毒性数据集上的应用。
Sci Rep. 2024 Sep 6;14(1):20812. doi: 10.1038/s41598-024-71892-4.
6
Supporting read-across using biological data.利用生物学数据支持交叉参照。
ALTEX. 2016;33(2):167-82. doi: 10.14573/altex.1601252. Epub 2016 Feb 11.
7
Relevance and Application of Read-Across - Mini Review of European Consensus Platform for Alternatives and Scandinavian Society for Cell Toxicology 2017 Workshop Session.基于读片的相关性和应用——对 2017 年欧洲替代方法共识平台和斯堪的纳维亚细胞毒理学学会研讨会会议的简要回顾。
Basic Clin Pharmacol Toxicol. 2018 Sep;123 Suppl 5:37-41. doi: 10.1111/bcpt.13006. Epub 2018 Apr 24.
8
Predicting toxicity of chemicals: software beats animal testing.预测化学物质的毒性:软件胜过动物试验。
EFSA J. 2019 Jul 8;17(Suppl 1):e170710. doi: 10.2903/j.efsa.2019.e170710. eCollection 2019 Jul.
9
Molecular similarity in chemical informatics and predictive toxicity modeling: from quantitative read-across (q-RA) to quantitative read-across structure-activity relationship (q-RASAR) with the application of machine learning.化学信息学和预测性毒理学建模中的分子相似性:从定量文献外推 (q-RA) 到基于机器学习的定量文献外推结构-活性关系 (q-RASAR)。
Crit Rev Toxicol. 2024 Oct;54(9):659-684. doi: 10.1080/10408444.2024.2386260. Epub 2024 Sep 3.
10
Machine learning-based q-RASAR predictions of the bioconcentration factor of organic molecules estimated following the organisation for economic co-operation and development guideline 305.基于机器学习的 q-RASAR 预测有机分子的生物浓缩因子,该预测方法是按照经济合作与发展组织的指南 305 进行估算的。
J Hazard Mater. 2024 Nov 5;479:135725. doi: 10.1016/j.jhazmat.2024.135725. Epub 2024 Sep 3.

引用本文的文献

1
BioBricks.ai: a versioned data registry for life sciences data assets.BioBricks.ai:生命科学数据资产的版本化数据注册库。
Front Artif Intell. 2025 Aug 13;8:1599412. doi: 10.3389/frai.2025.1599412. eCollection 2025.
2
From structure to strategy: chemometric modeling for the prediction of terminal half-life of pharmaceuticals and its role in future therapeutics.从结构到策略:药物终末半衰期预测的化学计量学建模及其在未来治疗中的作用
Mol Divers. 2025 Aug 21. doi: 10.1007/s11030-025-11322-3.
3
ToxACoL: an endpoint-aware and task-focused compound representation learning paradigm for acute toxicity assessment.

本文引用的文献

1
Big-data and machine learning to revamp computational toxicology and its use in risk assessment.大数据与机器学习助力革新计算毒理学及其在风险评估中的应用。
Toxicol Res (Camb). 2018 May 1;7(5):732-744. doi: 10.1039/c8tx00051d. eCollection 2018 Sep 1.
2
3S - Systematic, systemic, and systems biology and toxicology.3S - 系统生物学、系统毒理学和整体医学。
ALTEX. 2018;35(2):139-162. doi: 10.14573/altex.1804051.
3
Rebooting the generally recognized as safe (GRAS) approach for food additive safety in the US.重启美国食品添加剂安全性的“一般公认安全”(GRAS)方法。
ToxACoL:一种用于急性毒性评估的、以终点为导向且专注于任务的化合物表示学习范式。
Nat Commun. 2025 Jul 1;16(1):5992. doi: 10.1038/s41467-025-60989-7.
4
Semi-Correlations for the Simulation of Dermal Toxicity.用于模拟皮肤毒性的半相关性。
Toxics. 2025 Mar 23;13(4):235. doi: 10.3390/toxics13040235.
5
The round-robin approach applied to nanoinformatics: consensus prediction of nanomaterials zeta potential.应用于纳米信息学的循环赛方法:纳米材料zeta电位的共识预测
Beilstein J Nanotechnol. 2024 Nov 29;15:1536-1553. doi: 10.3762/bjnano.15.121. eCollection 2024.
6
A Novel Machine Learning Model and a Web Portal for Predicting the Human Skin Sensitization Effects of Chemical Agents.一种用于预测化学试剂对人体皮肤致敏作用的新型机器学习模型及网络门户。
Toxics. 2024 Nov 7;12(11):803. doi: 10.3390/toxics12110803.
7
Molecular Similarity in Predictive Toxicology with a Focus on the q-RASAR Technique.预测毒理学中的分子相似性研究——聚焦 q-RASAR 技术。
Methods Mol Biol. 2025;2834:41-63. doi: 10.1007/978-1-0716-4003-6_2.
8
BioBricks.ai: A Versioned Data Registry for Life Sciences Data Assets.BioBricks.ai:生命科学数据资产的版本化数据注册库。
ArXiv. 2024 Aug 30:arXiv:2408.17320v1.
9
The application of chemical similarity measures in an unconventional modeling framework c-RASAR along with dimensionality reduction techniques to a representative hepatotoxicity dataset.化学相似性度量在非常规建模框架 c-RASAR 中的应用以及降维技术在具有代表性的肝毒性数据集上的应用。
Sci Rep. 2024 Sep 6;14(1):20812. doi: 10.1038/s41598-024-71892-4.
10
Molecular similarity in chemical informatics and predictive toxicity modeling: from quantitative read-across (q-RA) to quantitative read-across structure-activity relationship (q-RASAR) with the application of machine learning.化学信息学和预测性毒理学建模中的分子相似性:从定量文献外推 (q-RA) 到基于机器学习的定量文献外推结构-活性关系 (q-RASAR)。
Crit Rev Toxicol. 2024 Oct;54(9):659-684. doi: 10.1080/10408444.2024.2386260. Epub 2024 Sep 3.
ALTEX. 2018;35(1):3-25. doi: 10.14573/altex.1712181.
4
Green Toxicology-Know Early About and Avoid Toxic Product Liabilities.绿色毒理学——尽早了解并避免有毒产品责任。
Toxicol Sci. 2018 Feb 1;161(2):285-289. doi: 10.1093/toxsci/kfx243.
5
Computational approaches to chemical hazard assessment.计算方法在化学危害评估中的应用。
ALTEX. 2017;34(4):459-478. doi: 10.14573/altex.1710141.
6
Thresholds of Toxicological Concern - Setting a threshold for testing below which there is little concern.毒理学关注阈值——设定一个测试阈值,低于该阈值则几乎无需担忧。
ALTEX. 2017;34(3):331-351. doi: 10.14573/altex.1707011.
7
The Threshold of Toxicological Concern for prenatal developmental toxicity in rats and rabbits.大鼠和家兔产前发育毒性的毒理学关注阈值。
Regul Toxicol Pharmacol. 2017 Aug;88:157-172. doi: 10.1016/j.yrtph.2017.06.008. Epub 2017 Jun 20.
8
Green Toxicology: a strategy for sustainable chemical and material development.绿色毒理学:可持续化学与材料开发策略
Environ Sci Eur. 2017;29(1):16. doi: 10.1186/s12302-017-0115-z. Epub 2017 Apr 4.
9
Opinion versus evidence for the need to move away from animal testing.关于摆脱动物实验必要性的观点与证据
ALTEX. 2017;34(2):193-200. doi: 10.14573/altex.1703291.
10
The need for strategic development of safety sciences.安全科学的战略发展需求。
ALTEX. 2017;34(1):3-21. doi: 10.14573/altex.1701031.