关于基于机器学习的q-RASAR方法中用于高效定量预测选定毒性终点的一些基于新颖相似性的函数。

On Some Novel Similarity-Based Functions Used in the ML-Based q-RASAR Approach for Efficient Quantitative Predictions of Selected Toxicity End Points.

作者信息

Banerjee Arkaprava, Roy Kunal

机构信息

Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700 032, India.

出版信息

Chem Res Toxicol. 2023 Mar 20;36(3):446-464. doi: 10.1021/acs.chemrestox.2c00374. Epub 2023 Feb 22.

DOI:10.1021/acs.chemrestox.2c00374

PMID:36811528

Abstract

The novel quantitative read-across structure-activity relationship (q-RASAR) approach uses read-across-derived similarity functions in the quantitative structure-activity relationship (QSAR) modeling framework in a unique way for supervised model generation. The aim of this study is to explore how this workflow enhances the external (test set) prediction quality of conventional QSAR models by the incorporation of some novel similarity-based functions as additional descriptors using the same level of chemical information. To establish this, five different toxicity data sets, for which QSAR models were reported previously, have been considered in the q-RASAR modeling exercise, which uses chemical similarity-derived measures. The identical sets of chemical features along with the same compositions of training and test sets as reported previously were used in the present analysis for ease of comparison. The RASAR descriptors were calculated based on a chosen similarity measure with the default setting of relevant hyperparameter(s) and were then clubbed with the original structural and physicochemical descriptors, and the number of selected features was further optimized by employing a grid search technique applied on the respective training sets. These features were then used to develop multiple linear regression (MLR) q-RASAR models that show enhanced predictivity as compared to the QSAR models developed previously. Moreover, various other ML algorithms like support vector machine (SVM), linear SVM, random forest, partial least squares, and ridge regression were also employed using the same feature combinations as used in the MLR models to compare the prediction qualities. The q-RASAR models for five different data sets possess at least one of the RASAR descriptors, , and , suggesting that these are important determinants of similarities that contribute to the development of predictive q-RASAR models, as also evident from the SHAP analysis of the models.

摘要

新型定量类推结构-活性关系（q-RASAR）方法在定量结构-活性关系（QSAR）建模框架中以独特方式使用类推衍生的相似性函数来生成监督模型。本研究的目的是探索这种工作流程如何通过纳入一些基于相似性的新型函数作为额外描述符，利用相同水平的化学信息来提高传统QSAR模型的外部（测试集）预测质量。为了证实这一点，在q-RASAR建模实践中考虑了五个先前已报道QSAR模型的不同毒性数据集，该实践使用化学相似性衍生的度量。为便于比较，本分析使用了与先前报道相同的化学特征集以及相同组成的训练集和测试集。基于选定的相似性度量并使用相关超参数的默认设置计算RASAR描述符，然后将其与原始结构和物理化学描述符合并，通过在各个训练集上应用网格搜索技术进一步优化所选特征的数量。然后使用这些特征开发多元线性回归（MLR）q-RASAR模型，与先前开发的QSAR模型相比，该模型显示出更高的预测能力。此外，还使用与MLR模型相同的特征组合采用了各种其他机器学习算法，如支持向量机（SVM）、线性SVM、随机森林、偏最小二乘法和岭回归，以比较预测质量。五个不同数据集的q-RASAR模型至少拥有一个RASAR描述符，、和，这表明这些是相似性的重要决定因素，有助于开发预测性q-RASAR模型，这也从模型的SHAP分析中得到证实。

相似文献

On Some Novel Similarity-Based Functions Used in the ML-Based q-RASAR Approach for Efficient Quantitative Predictions of Selected Toxicity End Points.关于基于机器学习的q-RASAR方法中用于高效定量预测选定毒性终点的一些基于新颖相似性的函数。

Chem Res Toxicol. 2023 Mar 20;36(3):446-464. doi: 10.1021/acs.chemrestox.2c00374. Epub 2023 Feb 22.

First report of q-RASAR modeling toward an approach of easy interpretability and efficient transferability.首次报告 q-RASAR 建模，旨在实现易于解释和高效可迁移性的方法。

Mol Divers. 2022 Oct;26(5):2847-2862. doi: 10.1007/s11030-022-10478-6. Epub 2022 Jun 29.

Efficient predictions of cytotoxicity of TiO-based multi-component nanoparticles using a machine learning-based q-RASAR approach.使用基于机器学习的q-RASAR方法对TiO基多组分纳米颗粒的细胞毒性进行高效预测。

Nanotoxicology. 2023 Feb;17(1):78-93. doi: 10.1080/17435390.2023.2186280. Epub 2023 Mar 8.

Machine learning - based q-RASAR modeling to predict acute contact toxicity of binary organic pesticide mixtures in honey bees.基于机器学习的 q-RASAR 模型预测二元有机农药混合物对蜜蜂的急性接触毒性。

J Hazard Mater. 2023 Oct 15;460:132358. doi: 10.1016/j.jhazmat.2023.132358. Epub 2023 Aug 22.

The application of chemical similarity measures in an unconventional modeling framework c-RASAR along with dimensionality reduction techniques to a representative hepatotoxicity dataset.化学相似性度量在非常规建模框架 c-RASAR 中的应用以及降维技术在具有代表性的肝毒性数据集上的应用。

Sci Rep. 2024 Sep 6;14(1):20812. doi: 10.1038/s41598-024-71892-4.

Prediction-Inspired Intelligent Training for the Development of Classification Read-across Structure-Activity Relationship (c-RASAR) Models for Organic Skin Sensitizers: Assessment of Classification Error Rate from Novel Similarity Coefficients.基于预测的智能训练在分类读靶结构-活性关系（c-RASAR）模型开发中的应用：新型相似系数分类错误率评估。

Chem Res Toxicol. 2023 Sep 18;36(9):1518-1531. doi: 10.1021/acs.chemrestox.3c00155. Epub 2023 Aug 16.

Quantitative read-across structure-activity relationship (q-RASAR): A novel approach to estimate the subchronic oral safety (NOAEL) of diverse organic chemicals in rats.定量结构-活性关系（q-RASAR）：一种估计大鼠中多种有机化合物亚慢性口服安全性（NOAEL）的新方法。

Toxicology. 2024 Jun;505:153824. doi: 10.1016/j.tox.2024.153824. Epub 2024 May 4.

Quantitative Read-across structure-activity relationship (q-RASAR): A new approach methodology to model aquatic toxicity of organic pesticides against different fish species.定量跨读构效关系（q-RASAR）：一种模拟有机农药对不同鱼类水生毒性的新方法学。

Aquat Toxicol. 2023 Dec;265:106776. doi: 10.1016/j.aquatox.2023.106776. Epub 2023 Nov 20.

Methods Mol Biol. 2025;2834:41-63. doi: 10.1007/978-1-0716-4003-6_2.

Machine learning-based q-RASAR predictions of the bioconcentration factor of organic molecules estimated following the organisation for economic co-operation and development guideline 305.基于机器学习的 q-RASAR 预测有机分子的生物浓缩因子，该预测方法是按照经济合作与发展组织的指南 305 进行估算的。

J Hazard Mater. 2024 Nov 5;479:135725. doi: 10.1016/j.jhazmat.2024.135725. Epub 2024 Sep 3.

引用本文的文献

From structure to strategy: chemometric modeling for the prediction of terminal half-life of pharmaceuticals and its role in future therapeutics.从结构到策略：药物终末半衰期预测的化学计量学建模及其在未来治疗中的作用

Mol Divers. 2025 Aug 21. doi: 10.1007/s11030-025-11322-3.

Assessment of the rat acute oral toxicity of quinoline-based pharmaceutical scaffold molecules using QSTR, q-RASTR and machine learning methods.使用定量结构-活性关系（QSTR）、定量响应-活性关系（q-RASTR）和机器学习方法评估喹啉类药物支架分子的大鼠急性经口毒性。

Mol Divers. 2025 Jun 27. doi: 10.1007/s11030-025-11265-9.

Machine learning assisted classification RASAR modeling for the nephrotoxicity potential of a curated set of orally active drugs.机器学习辅助分类RASAR模型用于一组精选口服活性药物的肾毒性潜力评估

Sci Rep. 2025 Jan 4;15(1):808. doi: 10.1038/s41598-024-85063-y.

Development of hybrid models by the integration of the read-across hypothesis with the QSAR framework for the assessment of developmental and reproductive toxicity (DART) tested according to OECD TG 414.通过将类推假设与QSAR框架相结合来开发混合模型，以评估根据经合组织测试指南414进行测试的发育和生殖毒性（DART）。

Toxicol Rep. 2024 Nov 19;13:101822. doi: 10.1016/j.toxrep.2024.101822. eCollection 2024 Dec.

The round-robin approach applied to nanoinformatics: consensus prediction of nanomaterials zeta potential.应用于纳米信息学的循环赛方法：纳米材料zeta电位的共识预测

Beilstein J Nanotechnol. 2024 Nov 29;15:1536-1553. doi: 10.3762/bjnano.15.121. eCollection 2024.

Methods Mol Biol. 2025;2834:41-63. doi: 10.1007/978-1-0716-4003-6_2.

Sci Rep. 2024 Sep 6;14(1):20812. doi: 10.1038/s41598-024-71892-4.

Crit Rev Toxicol. 2024 Oct;54(9):659-684. doi: 10.1080/10408444.2024.2386260. Epub 2024 Sep 3.

soil degradation and ecotoxicity analysis of veterinary pharmaceuticals on terrestrial species: first report.兽用药品对陆地物种的土壤退化及生态毒性分析：首次报告

Toxicol Res (Camb). 2024 Feb 26;13(1):tfae020. doi: 10.1093/toxres/tfae020. eCollection 2024 Feb.

Automated machine learning in nanotoxicity assessment: A comparative study of predictive model performance.纳米毒性评估中的自动化机器学习：预测模型性能的比较研究

Comput Struct Biotechnol J. 2024 Feb 9;25:9-19. doi: 10.1016/j.csbj.2024.02.003. eCollection 2024 Dec.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

关于基于机器学习的q-RASAR方法中用于高效定量预测选定毒性终点的一些基于新颖相似性的函数。

On Some Novel Similarity-Based Functions Used in the ML-Based q-RASAR Approach for Efficient Quantitative Predictions of Selected Toxicity End Points.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献