• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种数据驱动的生成策略,用于避免多目标分子设计中的奖励操纵。

A data-driven generative strategy to avoid reward hacking in multi-objective molecular design.

作者信息

Yoshizawa Tatsuya, Ishida Shoichi, Sato Tomohiro, Ohta Masateru, Honma Teruki, Terayama Kei

机构信息

Graduate School of Medical Life Science, Yokohama City University, 1-7-29, Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Kanagawa, Japan.

RIKEN Center for Biosystems Dynamics Research, 1-7-22, Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045, Kanagawa, Japan.

出版信息

Nat Commun. 2025 Mar 11;16(1):2409. doi: 10.1038/s41467-025-57582-3.

DOI:10.1038/s41467-025-57582-3
PMID:40069140
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11897179/
Abstract

Molecular design using data-driven generative models has emerged as a promising technology, impacting various fields such as drug discovery and the development of functional materials. However, this approach is often susceptible to optimization failure due to reward hacking, where prediction models fail to extrapolate, i.e., fail to accurately predict properties for designed molecules that considerably deviate from the training data. While methods for estimating prediction reliability, such as the applicability domain (AD), have been used for mitigating reward hacking, multi-objective optimization makes it challenging. The difficulty arises from the need to determine in advance whether the multiple ADs with some reliability levels overlap in chemical space, and to appropriately adjust the reliability levels for each property prediction. Herein, we propose a reliable design framework to perform multi-objective optimization using generative models while preventing reward hacking. To demonstrate the effectiveness of the proposed framework, we designed candidates for anticancer drugs as a typical example of multi-objective optimization. We successfully designed molecules with high predicted values and reliabilities, including an approved drug. In addition, the reliability levels can be automatically adjusted according to the property prioritization specified by the user without any detailed settings.

摘要

利用数据驱动生成模型的分子设计已成为一项很有前景的技术,影响着药物发现和功能材料开发等各个领域。然而,由于奖励操纵,这种方法往往容易出现优化失败的情况,即预测模型无法外推,也就是说,无法准确预测与训练数据有很大偏差的设计分子的性质。虽然诸如适用域(AD)等估计预测可靠性的方法已被用于减轻奖励操纵,但多目标优化使其具有挑战性。困难在于需要提前确定具有一定可靠性水平的多个适用域在化学空间中是否重叠,并为每个性质预测适当调整可靠性水平。在此,我们提出了一个可靠的设计框架,以在防止奖励操纵的同时使用生成模型进行多目标优化。为了证明所提出框架的有效性,我们将抗癌药物的候选物设计作为多目标优化的一个典型例子。我们成功地设计出了具有高预测值和可靠性的分子,包括一种已获批的药物。此外,可靠性水平可以根据用户指定的性质优先级自动调整,而无需任何详细设置。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/6e944cf872d5/41467_2025_57582_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/5f90ed7e12da/41467_2025_57582_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/7bfe08a38af0/41467_2025_57582_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/5f9d25680036/41467_2025_57582_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/85bec28ea6da/41467_2025_57582_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/e6ddd0c5fbe9/41467_2025_57582_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/6e944cf872d5/41467_2025_57582_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/5f90ed7e12da/41467_2025_57582_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/7bfe08a38af0/41467_2025_57582_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/5f9d25680036/41467_2025_57582_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/85bec28ea6da/41467_2025_57582_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/e6ddd0c5fbe9/41467_2025_57582_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c754/11897179/6e944cf872d5/41467_2025_57582_Fig6_HTML.jpg

相似文献

1
A data-driven generative strategy to avoid reward hacking in multi-objective molecular design.一种数据驱动的生成策略,用于避免多目标分子设计中的奖励操纵。
Nat Commun. 2025 Mar 11;16(1):2409. doi: 10.1038/s41467-025-57582-3.
2
Molecular Assays Simulator to Unravel Predictors Hacking in Goal-Directed Molecular Generations.分子分析模拟器,揭示定向分子生成中的黑客预测因子。
J Chem Inf Model. 2023 Jul 10;63(13):3983-3998. doi: 10.1021/acs.jcim.3c00195. Epub 2023 Jun 22.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Multi-objective latent space optimization of generative molecular design models.生成式分子设计模型的多目标潜在空间优化
Patterns (N Y). 2024 Aug 12;5(10):101042. doi: 10.1016/j.patter.2024.101042. eCollection 2024 Oct 11.
5
De novo drug design based on Stack-RNN with multi-objective reward-weighted sum and reinforcement learning.基于堆叠循环神经网络的多目标奖励加权和强化学习的从头药物设计。
J Mol Model. 2023 Mar 30;29(4):121. doi: 10.1007/s00894-023-05523-6.
6
FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers.FSM-DDTR:使用变压器的多目标从头药物设计的端到端反馈策略。
Comput Biol Med. 2023 Sep;164:107285. doi: 10.1016/j.compbiomed.2023.107285. Epub 2023 Jul 31.
7
Beyond generative models: superfast traversal, optimization, novelty, exploration and discovery (STONED) algorithm for molecules using SELFIES.超越生成模型:使用SELFIES的分子超快速遍历、优化、新颖性、探索与发现(STONED)算法
Chem Sci. 2021 Apr 20;12(20):7079-7090. doi: 10.1039/d1sc00231g.
8
MOLER: Incorporate Molecule-Level Reward to Enhance Deep Generative Model for Molecule Optimization.莫勒:纳入分子级奖励以增强用于分子优化的深度生成模型。
IEEE Trans Knowl Data Eng. 2022 Nov;34(11):5459-5471. doi: 10.1109/tkde.2021.3052150. Epub 2021 Jan 21.
9
Designing optimized drug candidates with Generative Adversarial Network.利用生成对抗网络设计优化的候选药物。
J Cheminform. 2022 Jun 26;14(1):40. doi: 10.1186/s13321-022-00623-6.
10
High-throughput property-driven generative design of functional organic molecules.高通量特性驱动的功能有机分子生成设计。
Nat Comput Sci. 2023 Feb;3(2):139-148. doi: 10.1038/s43588-022-00391-1. Epub 2023 Feb 6.

引用本文的文献

1
Generative Deep Learning for de Novo Drug Design─A Chemical Space Odyssey.用于从头药物设计的生成式深度学习——一场化学空间奥德赛。
J Chem Inf Model. 2025 Jul 28;65(14):7352-7372. doi: 10.1021/acs.jcim.5c00641. Epub 2025 Jul 9.

本文引用的文献

1
Mothra: Multiobjective Molecular Generation Using Monte Carlo Tree Search. mothra:基于蒙特卡洛树搜索的多目标分子生成
J Chem Inf Model. 2024 Oct 14;64(19):7291-7302. doi: 10.1021/acs.jcim.4c00759. Epub 2024 Sep 25.
2
AI-driven molecular generation of not-patented pharmaceutical compounds using world open patent data.利用世界公开专利数据,通过人工智能驱动进行非专利药物化合物的分子生成。
J Cheminform. 2023 Dec 13;15(1):120. doi: 10.1186/s13321-023-00791-z.
3
Deep Generative Models in Drug Molecule Generation.深度生成模型在药物分子生成中的应用。
J Chem Inf Model. 2024 Apr 8;64(7):2174-2194. doi: 10.1021/acs.jcim.3c01496. Epub 2023 Nov 7.
4
Prediction of organic compound aqueous solubility using machine learning: a comparison study of descriptor-based and fingerprints-based models.使用机器学习预测有机化合物的水溶性:基于描述符和基于指纹的模型的比较研究
J Cheminform. 2023 Oct 18;15(1):99. doi: 10.1186/s13321-023-00752-6.
5
Impact of Applicability Domains to Generative Artificial Intelligence.适用领域对生成式人工智能的影响。
ACS Omega. 2023 Jun 12;8(25):23148-23167. doi: 10.1021/acsomega.3c00883. eCollection 2023 Jun 27.
6
Molecular Assays Simulator to Unravel Predictors Hacking in Goal-Directed Molecular Generations.分子分析模拟器,揭示定向分子生成中的黑客预测因子。
J Chem Inf Model. 2023 Jul 10;63(13):3983-3998. doi: 10.1021/acs.jcim.3c00195. Epub 2023 Jun 22.
7
Computer-aided multi-objective optimization in small molecule discovery.小分子发现中的计算机辅助多目标优化
Patterns (N Y). 2023 Feb 10;4(2):100678. doi: 10.1016/j.patter.2023.100678.
8
Exposing the Limitations of Molecular Machine Learning with Activity Cliffs.利用活性悬崖揭示分子机器学习的局限性。
J Chem Inf Model. 2022 Dec 12;62(23):5938-5951. doi: 10.1021/acs.jcim.2c01073. Epub 2022 Dec 1.
9
Selective Inhibitor Design for Kinase Homologs Using Multiobjective Monte Carlo Tree Search.使用多目标蒙特卡罗树搜索为激酶同源物设计选择性抑制剂。
J Chem Inf Model. 2022 Nov 28;62(22):5351-5360. doi: 10.1021/acs.jcim.2c00787. Epub 2022 Nov 5.
10
Understanding the evolution of a de novo molecule generator via characteristic functional group monitoring.通过特征官能团监测理解从头分子生成器的演变。
Sci Technol Adv Mater. 2022 Jun 1;23(1):352-360. doi: 10.1080/14686996.2022.2075240. eCollection 2022.