College of Environmental and Resource Sciences, and Women's Hospital, School of Medicine, Zhejiang University, Hangzhou 310058, China.
Solid Waste and Chemicals Management Center, Ministry of Ecology and Environment of the People's Republic of China, Beijing 100029, China.
Environ Sci Technol. 2024 Apr 9;58(14):6149-6157. doi: 10.1021/acs.est.4c01201. Epub 2024 Apr 1.
The global management for persistent, mobile, and toxic (PMT) and very persistent and very mobile (vPvM) substances has been further strengthened with the rapid increase of emerging contaminants. The development of a ready-to-use and publicly available tool for the high-throughput screening of PMT/vPvM substances is thus urgently needed. However, the current model building with the coupling of conventional algorithms, small-scale data set, and simplistic features hinders the development of a robust model for screening PMT/vPvM with wide application domains. Here, we construct a graph convolutional network (GCN)-enhanced model with feature fusion of a molecular graph and molecular descriptors to effectively utilize the significant correlation between critical descriptors and PMT/vPvM substances. The model is built with 213,084 substances following the latest PMT classification criteria. The application domains of the GCN-enhanced model assessed by kernel density estimation demonstrate the high suitability for high-throughput screening PMT/vPvM substances with both a high accuracy rate (86.6%) and a low false-negative rate (6.8%). An online server named PMT/vPvM profiler is further developed with a user-friendly web interface (http://www.pmt.zj.cn/). Our study facilitates a more efficient evaluation of PMT/vPvM substances with a globally accessible screening platform.
随着新兴污染物的快速增加,持久性、移动性和毒性(PMT)以及高持久性、高流动性(vPvM)物质的全球管理得到了进一步加强。因此,迫切需要开发一种可用于高通量筛选 PMT/vPvM 物质的即用型和公开可用的工具。然而,目前使用传统算法、小规模数据集和简单特征进行模型构建,阻碍了具有广泛应用领域的筛选 PMT/vPvM 的稳健模型的发展。在这里,我们构建了一个图卷积网络(GCN)增强模型,该模型融合了分子图和分子描述符的特征融合,以有效利用关键描述符与 PMT/vPvM 物质之间的显著相关性。该模型是使用最新的 PMT 分类标准构建的,包含 213,084 种物质。通过核密度估计评估的 GCN 增强模型的应用领域表明,该模型非常适合高通量筛选 PMT/vPvM 物质,具有高准确率(86.6%)和低假阴性率(6.8%)。进一步开发了一个名为 PMT/vPvM 分析器的在线服务器,具有用户友好的 Web 界面(http://www.pmt.zj.cn/)。我们的研究为使用全球可访问的筛选平台更有效地评估 PMT/vPvM 物质提供了便利。