Suppr超能文献

利用可转移性图谱进行快速有效的分子性质预测。

Fast and effective molecular property prediction with transferability map.

作者信息

Yao Shaolun, Song Jie, Jia Lingxiang, Cheng Lechao, Zhong Zipeng, Song Mingli, Feng Zunlei

机构信息

Collaborative Innovation Center of Artificial Intelligence by MOE and Zhejiang Provincial Government, Zhejiang University, 310027, Hangzhou, China.

College of Computer Science and Technology, Zhejiang University, 310027, Hangzhou, China.

出版信息

Commun Chem. 2024 Apr 17;7(1):85. doi: 10.1038/s42004-024-01169-4.

Abstract

Effective transfer learning for molecular property prediction has shown considerable strength in addressing insufficient labeled molecules. Many existing methods either disregard the quantitative relationship between source and target properties, risking negative transfer, or require intensive training on target tasks. To quantify transferability concerning task-relatedness, we propose Principal Gradient-based Measurement (PGM) for transferring molecular property prediction ability. First, we design an optimization-free scheme to calculate a principal gradient for approximating the direction of model optimization on a molecular property prediction dataset. We have analyzed the close connection between the principal gradient and model optimization through mathematical proof. PGM measures the transferability as the distance between the principal gradient obtained from the source dataset and that derived from the target dataset. Then, we perform PGM on various molecular property prediction datasets to build a quantitative transferability map for source dataset selection. Finally, we evaluate PGM on multiple combinations of transfer learning tasks across 12 benchmark molecular property prediction datasets and demonstrate that it can serve as fast and effective guidance to improve the performance of a target task. This work contributes to more efficient discovery of drugs, materials, and catalysts by offering a task-relatedness quantification prior to transfer learning and understanding the relationship between chemical properties.

摘要

用于分子性质预测的有效迁移学习在解决标记分子不足问题方面展现出了相当大的优势。许多现有方法要么忽视源属性与目标属性之间的定量关系,存在负迁移的风险,要么需要在目标任务上进行大量训练。为了量化与任务相关性相关的可迁移性,我们提出了基于主梯度的度量方法(PGM)来迁移分子性质预测能力。首先,我们设计了一种无需优化的方案,用于在分子性质预测数据集上计算一个主梯度,以近似模型优化的方向。我们通过数学证明分析了主梯度与模型优化之间的紧密联系。PGM将可迁移性度量为从源数据集获得的主梯度与从目标数据集导出的主梯度之间的距离。然后,我们在各种分子性质预测数据集上执行PGM,以构建用于源数据集选择的定量可迁移性图谱。最后,我们在12个基准分子性质预测数据集上的多个迁移学习任务组合上评估PGM,并证明它可以作为快速有效的指导,以提高目标任务的性能。这项工作通过在迁移学习之前提供任务相关性量化并理解化学性质之间的关系,有助于更高效地发现药物、材料和催化剂。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a644/11024153/d35a4ed9ccba/42004_2024_1169_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验