Suppr超能文献

交叉验证策略的应用可避免 2D-QSAR 模型对混合物的水生毒性预测性能的过高估计。

Application of cross-validation strategies to avoid overestimation of performance of 2D-QSAR models for the prediction of aquatic toxicity of chemical mixtures.

机构信息

Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India.

出版信息

SAR QSAR Environ Res. 2022 Jun;33(6):463-484. doi: 10.1080/1062936X.2022.2081255. Epub 2022 May 31.

Abstract

The quantitative structure-activity relationship (QSAR) modelling of mixtures is not as simple as that for individual chemicals, and it needs additional care to avoid overestimation of the performance. In this research, we have developed a 2D-QSAR model using only 2D interpretable and reproducible descriptors to predict the aquatic toxicity of mixtures of polar and non-polar narcotic substances present in the environment. Partial least squares (PLS) regression has been used to model the response variable (log 1/EC50 against ) and the structural features of 84 binary mixtures of polar and nonpolar narcotic toxicants complying with the Organization of Economic Co-operation and Development (OECD) protocols. The model was cross-validated by mixtures-out and compounds-out cross-validation to nullify the developmental bias. The reliability of prediction of the model has been judged by the Prediction Reliability Indicator (PRI) tool using a newly designed set. The new model is robust, reproducible, extremely predictive, easily interpretable, and can be used for reliable prediction of aquatic toxicity of any untested chemical mixtures within the applicability domain. We have additionally used a machine learning-based chemical read-across algorithm in this study to improve the quality of predictions for the toxicity of the mixtures with the modelled descriptors.

摘要

混合物的定量构效关系(QSAR)建模并不像单个化学物质那样简单,需要额外的注意以避免对性能的过高估计。在这项研究中,我们开发了一个仅使用二维可解释和可重现描述符的二维 QSAR 模型,以预测环境中存在的极性和非极性麻醉物质混合物的水生毒性。偏最小二乘(PLS)回归已用于对响应变量(log1/EC50 与 )和 84 种符合经济合作与发展组织(OECD)协议的极性和非极性麻醉毒性混合物的结构特征进行建模。通过混合物外和化合物外交叉验证来消除发展偏差,对模型进行了交叉验证。使用新设计的数据集,通过预测可靠性指标(PRI)工具来判断模型预测的可靠性。新模型具有稳健性、可重复性、高度预测性、易于解释性,并且可以在适用性范围内可靠地预测任何未经测试的化学混合物的水生毒性。此外,我们在这项研究中还使用了基于机器学习的化学类推算法来提高模型描述符混合物毒性预测的质量。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验