• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基本化学结构和替代度量指标,使用深度神经网络(DNN)预测有机化合物的溶质描述符。

Predicting Solute Descriptors for Organic Chemicals by a Deep Neural Network (DNN) Using Basic Chemical Structures and a Surrogate Metric.

机构信息

Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, Ohio 44106, United States.

出版信息

Environ Sci Technol. 2022 Feb 1;56(3):2054-2064. doi: 10.1021/acs.est.1c05398. Epub 2022 Jan 7.

DOI:10.1021/acs.est.1c05398
PMID:34995441
Abstract

Solute descriptors have been widely used to model chemical transfer processes through poly-parameter linear free energy relationships (pp-LFERs); however, there are still substantial difficulties in obtaining these descriptors accurately and quickly for new organic chemicals. In this research, models (PaDEL-DNN) that require only SMILES of chemicals were built to satisfactorily estimate pp-LFER descriptors using deep neural networks (DNN) and the PaDEL chemical representation. The PaDEL-DNN-estimated pp-LFER descriptors demonstrated good performance in modeling storage-lipid/water partitioning coefficient (log ), bioconcentration factor (BCF), aqueous solubility (ESOL), and hydration free energy (freesolve). Then, assuming that the accuracy in the estimated values of widely available properties, e.g., logP (octanol-water partition coefficient), can calibrate estimates for less available but related properties, we proposed logP as a surrogate metric for evaluating the overall accuracy of the estimated pp-LFER descriptors. When using the pp-LFER descriptors to model log, BCF, ESOL, and freesolve, we achieved around 0.1 log unit lower errors for chemicals whose estimated pp-LFER descriptors were deemed "accurate" by the surrogate metric. The interpretation of the PaDEL-DNN models revealed that, for a given test chemical, having several (around 5) "similar" chemicals in the training data set was crucial for accurate estimation while the remaining less similar training chemicals provided reasonable baseline estimates. Lastly, pp-LFER descriptors for over 2800 persistent, bioaccumulative, and toxic chemicals were reasonably estimated by combining PaDEL-DNN with the surrogate metric. Overall, the PaDEL-DNN/surrogate metric and newly estimated descriptors will greatly benefit chemical transfer modeling.

摘要

已有广泛应用溶质描述符通过多参数线性自由能关系(pp-LFER)来模拟化学传递过程;然而,对于新的有机化合物,准确快速地获取这些描述符仍然存在很大的困难。在这项研究中,构建了仅需要化学物质 SMILES 的模型(PaDEL-DNN),使用深度神经网络(DNN)和 PaDEL 化学表示来满意地估计 pp-LFER 描述符。PaDEL-DNN 估计的 pp-LFER 描述符在模拟储存脂质/水分配系数(logP)、生物浓缩系数(BCF)、水溶解度(ESOL)和水合自由能(freesolve)方面表现出良好的性能。然后,假设广泛可用性质(例如,logP(辛醇-水分配系数))的估计值的准确性可以校准较少可用但相关性质的估计值,我们提出了 logP 作为评估估计的 pp-LFER 描述符整体准确性的替代指标。当使用 pp-LFER 描述符来模拟 logP、BCF、ESOL 和 freesolve 时,对于被替代指标认为“准确”的估计 pp-LFER 描述符的化学物质,我们的模型可以实现约 0.1 log 单位的更低误差。PaDEL-DNN 模型的解释表明,对于给定的测试化学物质,在训练数据集中有几个(约 5 个)“相似”的化学物质对于准确估计至关重要,而其余较少相似的训练化学物质提供了合理的基线估计。最后,通过结合 PaDEL-DNN 和替代指标,对超过 2800 种持久性、生物累积性和毒性化学物质的 pp-LFER 描述符进行了合理估计。总的来说,PaDEL-DNN/替代指标和新估计的描述符将极大地促进化学传递模型的发展。

相似文献

1
Predicting Solute Descriptors for Organic Chemicals by a Deep Neural Network (DNN) Using Basic Chemical Structures and a Surrogate Metric.基于基本化学结构和替代度量指标,使用深度神经网络(DNN)预测有机化合物的溶质描述符。
Environ Sci Technol. 2022 Feb 1;56(3):2054-2064. doi: 10.1021/acs.est.1c05398. Epub 2022 Jan 7.
2
Predicting solvent-water partitioning of charged organic species using quantum-chemically estimated Abraham pp-LFER solute parameters.利用量子化学估算的亚伯拉罕pp-LFER溶质参数预测带电有机物种的溶剂-水分配。
Chemosphere. 2016 Dec;164:634-642. doi: 10.1016/j.chemosphere.2016.08.135. Epub 2016 Sep 13.
3
Development and exploration of an organic contaminant fate model using poly-parameter linear free energy relationships.利用多参数线性自由能关系开发和探索有机污染物归宿模型
Environ Sci Technol. 2009 Sep 1;43(17):6676-83. doi: 10.1021/es901205j.
4
Applications of polyparameter linear free energy relationships in environmental chemistry.多参数线性自由能关系在环境化学中的应用。
Environ Sci Technol. 2014 Nov 4;48(21):12477-91. doi: 10.1021/es503369t. Epub 2014 Oct 17.
5
Modeling of adipose/blood partition coefficient for environmental chemicals.环境化学物质的脂肪/血液分配系数建模。
Food Chem Toxicol. 2017 Dec;110:274-285. doi: 10.1016/j.fct.2017.10.044. Epub 2017 Oct 31.
6
Experimental determination of polyparameter linear free energy relationship (pp-LFER) substance descriptors for pesticides and other contaminants: new measurements and recommendations.实验确定农药和其他污染物的多参数线性自由能关系(pp-LFER)物质描述符:新的测量和建议。
Environ Sci Technol. 2013 Dec 17;47(24):14204-14. doi: 10.1021/es404150e. Epub 2013 Nov 22.
7
Polyparameter linear free energy relationships for estimating the equilibrium partition of organic compounds between water and the natural organic matter in soils and sediments.用于估算有机化合物在水与土壤和沉积物中天然有机物之间平衡分配的多参数线性自由能关系。
Environ Sci Technol. 2005 Feb 15;39(4):913-24. doi: 10.1021/es048839s.
8
Expanding the applicability of multimedia fate models to polar organic chemicals.扩大多媒体归宿模型对极性有机化学品的适用性。
Environ Sci Technol. 2003 Nov 1;37(21):4934-43. doi: 10.1021/es034454i.
9
Development of polyparameter linear free energy relationship models for octanol-air partition coefficients of diverse chemicals.多种化学品正辛醇-空气分配系数的多参数线性自由能关系模型的开发。
Environ Sci Process Impacts. 2017 Mar 22;19(3):300-306. doi: 10.1039/c6em00626d.
10
Prediction of polydimethylsiloxane-water partition coefficients based on the pp-LFER and QSAR models.基于 pp-LFER 和 QSAR 模型预测聚二甲基硅氧烷-水分配系数。
Ecotoxicol Environ Saf. 2019 Oct 30;182:109374. doi: 10.1016/j.ecoenv.2019.109374. Epub 2019 Jun 26.