Suppr超能文献

利用分子指纹和机器学习对环境化学品的物理化学性质进行计算机模拟预测。

In Silico Prediction of Physicochemical Properties of Environmental Chemicals Using Molecular Fingerprints and Machine Learning.

作者信息

Zang Qingda, Mansouri Kamel, Williams Antony J, Judson Richard S, Allen David G, Casey Warren M, Kleinstreuer Nicole C

机构信息

Integrated Laboratory Systems, Inc. , Research Triangle Park, North Carolina 27709, United States.

National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency , Research Triangle Park, North Carolina 27711, United States.

出版信息

J Chem Inf Model. 2017 Jan 23;57(1):36-49. doi: 10.1021/acs.jcim.6b00625. Epub 2017 Jan 9.

Abstract

There are little available toxicity data on the vast majority of chemicals in commerce. High-throughput screening (HTS) studies, such as those being carried out by the U.S. Environmental Protection Agency (EPA) ToxCast program in partnership with the federal Tox21 research program, can generate biological data to inform models for predicting potential toxicity. However, physicochemical properties are also needed to model environmental fate and transport, as well as exposure potential. The purpose of the present study was to generate an open-source quantitative structure-property relationship (QSPR) workflow to predict a variety of physicochemical properties that would have cross-platform compatibility to integrate into existing cheminformatics workflows. In this effort, decades-old experimental property data sets available within the EPA EPI Suite were reanalyzed using modern cheminformatics workflows to develop updated QSPR models capable of supplying computationally efficient, open, and transparent HTS property predictions in support of environmental modeling efforts. Models were built using updated EPI Suite data sets for the prediction of six physicochemical properties: octanol-water partition coefficient (logP), water solubility (logS), boiling point (BP), melting point (MP), vapor pressure (logVP), and bioconcentration factor (logBCF). The coefficient of determination (R) between the estimated values and experimental data for the six predicted properties ranged from 0.826 (MP) to 0.965 (BP), with model performance for five of the six properties exceeding those from the original EPI Suite models. The newly derived models can be employed for rapid estimation of physicochemical properties within an open-source HTS workflow to inform fate and toxicity prediction models of environmental chemicals.

摘要

市面上绝大多数化学品的毒性数据都很少。高通量筛选(HTS)研究,比如美国环境保护局(EPA)的ToxCast项目与联邦Tox21研究项目合作开展的那些研究,能够生成生物学数据,为预测潜在毒性的模型提供信息。然而,还需要物理化学性质来模拟环境归宿和迁移,以及暴露可能性。本研究的目的是生成一个开源的定量结构-性质关系(QSPR)工作流程,以预测各种具有跨平台兼容性的物理化学性质,从而能够整合到现有的化学信息学工作流程中。在这项工作中,利用现代化学信息学工作流程重新分析了EPA EPI Suite中已有数十年历史的实验性质数据集,以开发更新的QSPR模型,这些模型能够提供计算高效、开放且透明的高通量筛选性质预测,以支持环境建模工作。使用更新后的EPI Suite数据集建立了用于预测六种物理化学性质的模型:辛醇-水分配系数(logP)、水溶性(logS)、沸点(BP)、熔点(MP)、蒸气压(logVP)和生物富集因子(logBCF)。六种预测性质的估计值与实验数据之间的决定系数(R)范围为0.826(MP)至0.965(BP),六种性质中有五种的模型性能超过了原始EPI Suite模型。新推导的模型可用于在开源高通量筛选工作流程中快速估计物理化学性质,以为环境化学品的归宿和毒性预测模型提供信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fe1/6131700/dbccba5b209f/nihms-1504120-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验