• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

集成迁移学习和多任务学习策略,构建用于预测化学品生物积累参数的图神经网络模型。

Integrated Transfer Learning and Multitask Learning Strategies to Construct Graph Neural Network Models for Predicting Bioaccumulation Parameters of Chemicals.

机构信息

Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China.

Key Laboratory of Integrated Regulation and Resources Development of Shallow Lakes of Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China.

出版信息

Environ Sci Technol. 2024 Sep 3;58(35):15650-15660. doi: 10.1021/acs.est.4c02421. Epub 2024 Jul 25.

DOI:10.1021/acs.est.4c02421
PMID:39051472
Abstract

Accurate prediction of parameters related to the environmental exposure of chemicals is crucial for the sound management of chemicals. However, the lack of large data sets for training models may result in poor prediction accuracy and robustness. Herein, integrated transfer learning (TL) and multitask learning (MTL) was proposed for constructing a graph neural network (GNN) model (abbreviated as TL-MTL-GNN model) using -octanol/water partition coefficients as a source domain. The TL-MTL-GNN model was trained to predict three bioaccumulation parameters based on enlarged data sets that cover 2496 compounds with at least one bioaccumulation parameter. Results show that the TL-MTL-GNN model outperformed single-task GNN models with and without the TL, as well as conventional machine learning models trained with molecular descriptors or fingerprints. Applicability domains were characterized by a state-of-the-art structure-activity landscape-based (abbreviated as AD) methodology. The TL-MTL-GNN model coupled with the optimal AD was employed to predict bioaccumulation parameters for around 60,000 chemicals, with more than 13,000 compounds identified as bioaccumulative chemicals. The high predictive accuracy and robustness of the TL-MTL-GNN model demonstrate the feasibility of integrating the TL and MTL strategy in modeling small-sized data sets. The strategy holds significant potential for addressing small data challenges in modeling environmental chemicals.

摘要

准确预测与化学品环境暴露相关的参数对于化学品的合理管理至关重要。然而,缺乏用于训练模型的大型数据集可能导致预测准确性和稳健性较差。在此,提出了一种集成迁移学习(TL)和多任务学习(MTL)的方法,使用辛醇/水分配系数作为源域来构建图神经网络(GNN)模型(简称 TL-MTL-GNN 模型)。该 TL-MTL-GNN 模型基于包含至少一个生物累积参数的 2496 种化合物的扩充数据集进行训练,用于预测三个生物累积参数。结果表明,TL-MTL-GNN 模型优于具有和不具有 TL 的单任务 GNN 模型,以及使用分子描述符或指纹训练的传统机器学习模型。应用域通过一种基于最新结构-活性景观的(简称 AD)方法进行了描述。TL-MTL-GNN 模型与最优 AD 结合,用于预测约 60000 种化学品的生物累积参数,其中有 13000 多种化合物被确定为生物累积性化学品。TL-MTL-GNN 模型具有较高的预测准确性和稳健性,证明了在小数据集建模中集成 TL 和 MTL 策略的可行性。该策略在解决环境化学物质建模中的小数据集挑战方面具有重要潜力。

相似文献

1
Integrated Transfer Learning and Multitask Learning Strategies to Construct Graph Neural Network Models for Predicting Bioaccumulation Parameters of Chemicals.集成迁移学习和多任务学习策略,构建用于预测化学品生物积累参数的图神经网络模型。
Environ Sci Technol. 2024 Sep 3;58(35):15650-15660. doi: 10.1021/acs.est.4c02421. Epub 2024 Jul 25.
2
Evaluating the Use of Graph Neural Networks and Transfer Learning for Oral Bioavailability Prediction.评估图神经网络和迁移学习在口服生物利用度预测中的应用。
J Chem Inf Model. 2023 Aug 28;63(16):5035-5044. doi: 10.1021/acs.jcim.3c00554. Epub 2023 Aug 15.
3
Retention time prediction in hydrophilic interaction liquid chromatography with graph neural network and transfer learning.基于图神经网络和迁移学习的亲水相互作用液相色谱保留时间预测。
J Chromatogr A. 2021 Oct 25;1656:462536. doi: 10.1016/j.chroma.2021.462536. Epub 2021 Sep 7.
4
Applicability Domains Based on Molecular Graph Contrastive Learning Enable Graph Attention Network Models to Accurately Predict 15 Environmental End Points.基于分子图对比学习的适用性域使图注意网络模型能够准确预测 15 个环境终点。
Environ Sci Technol. 2023 Nov 7;57(44):16906-16917. doi: 10.1021/acs.est.3c03860. Epub 2023 Oct 28.
5
MD-GNN: A mechanism-data-driven graph neural network for molecular properties prediction and new material discovery.MD-GNN:一种基于机制数据的图神经网络,用于分子性质预测和新材料发现。
J Mol Graph Model. 2023 Sep;123:108506. doi: 10.1016/j.jmgm.2023.108506. Epub 2023 May 9.
6
Improved GNNs for Log  Prediction by Transferring Knowledge from Low-Fidelity Data.通过从低质量数据转移知识来改进图神经网络进行日志预测。
J Chem Inf Model. 2023 Apr 24;63(8):2345-2359. doi: 10.1021/acs.jcim.2c01564. Epub 2023 Mar 31.
7
Multi-source transfer learning with Graph Neural Network for excellent modelling the bioactivities of ligands targeting orphan G protein-coupled receptors.基于图神经网络的多源迁移学习在靶向孤儿 G 蛋白偶联受体的配体生物活性建模中的优异表现。
Math Biosci Eng. 2023 Jan;20(2):2588-2608. doi: 10.3934/mbe.2023121. Epub 2022 Nov 25.
8
Accurate Prediction of Aqueous Free Solvation Energies Using 3D Atomic Feature-Based Graph Neural Network with Transfer Learning.使用基于 3D 原子特征的图神经网络与迁移学习准确预测水相自由溶解能。
J Chem Inf Model. 2022 Apr 25;62(8):1840-1848. doi: 10.1021/acs.jcim.2c00260. Epub 2022 Apr 14.
9
FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction.FP-GNN:一种用于增强分子性质预测的多功能深度学习架构。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac408.
10
Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.使用多任务卷积神经网络从自由文本病理报告中自动提取癌症登记报告信息。
J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.

引用本文的文献

1
Leveraging scenario differences for cross-task generalization in water plant transfer machine learning models.利用场景差异实现水厂转移机器学习模型中的跨任务泛化。
Environ Sci Ecotechnol. 2025 Jul 23;27:100604. doi: 10.1016/j.ese.2025.100604. eCollection 2025 Sep.