• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人在回路辅助的从头分子设计

Human-in-the-loop assisted de novo molecular design.

作者信息

Sundin Iiris, Voronov Alexey, Xiao Haoping, Papadopoulos Kostas, Bjerrum Esben Jannik, Heinonen Markus, Patronov Atanas, Kaski Samuel, Engkvist Ola

机构信息

Department of Computer Science, Aalto University, Espoo, Finland.

Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.

出版信息

J Cheminform. 2022 Dec 28;14(1):86. doi: 10.1186/s13321-022-00667-8.

DOI:10.1186/s13321-022-00667-8
PMID:36578043
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9795720/
Abstract

A de novo molecular design workflow can be used together with technologies such as reinforcement learning to navigate the chemical space. A bottleneck in the workflow that remains to be solved is how to integrate human feedback in the exploration of the chemical space to optimize molecules. A human drug designer still needs to design the goal, expressed as a scoring function for the molecules that captures the designer's implicit knowledge about the optimization task. Little support for this task exists and, consequently, a chemist usually resorts to iteratively building the objective function of multi-parameter optimization (MPO) in de novo design. We propose a principled approach to use human-in-the-loop machine learning to help the chemist to adapt the MPO scoring function to better match their goal. An advantage is that the method can learn the scoring function directly from the user's feedback while they browse the output of the molecule generator, instead of the current manual tuning of the scoring function with trial and error. The proposed method uses a probabilistic model that captures the user's idea and uncertainty about the scoring function, and it uses active learning to interact with the user. We present two case studies for this: In the first use-case, the parameters of an MPO are learned, and in the second use-case a non-parametric component of the scoring function to capture human domain knowledge is developed. The results show the effectiveness of the methods in two simulated example cases with an oracle, achieving significant improvement in less than 200 feedback queries, for the goals of a high QED score and identifying potent molecules for the DRD2 receptor, respectively. We further demonstrate the performance gains with a medicinal chemist interacting with the system.

摘要

从头开始的分子设计工作流程可以与强化学习等技术一起使用,以探索化学空间。该工作流程中仍有待解决的一个瓶颈是如何在化学空间探索中整合人类反馈以优化分子。人类药物设计师仍需要设计目标,将其表示为分子的评分函数,该函数捕获设计师关于优化任务的隐性知识。目前对此任务的支持很少,因此,化学家通常在从头设计中采用迭代构建多参数优化(MPO)目标函数的方法。我们提出了一种有原则的方法,使用人在回路机器学习来帮助化学家调整MPO评分函数,以更好地匹配他们的目标。一个优点是,该方法可以在用户浏览分子生成器的输出时直接从用户反馈中学习评分函数,而不是目前通过反复试验对手动调整评分函数。所提出的方法使用一个概率模型来捕捉用户对评分函数的想法和不确定性,并使用主动学习与用户进行交互。我们为此展示了两个案例研究:在第一个用例中,学习MPO的参数,在第二个用例中,开发评分函数的非参数组件以捕捉人类领域知识。结果表明,在两个带有预言机的模拟示例案例中,这些方法是有效的,分别针对高QED分数和识别DRD2受体的强效分子的目标,在不到200次反馈查询中取得了显著改进。我们进一步展示了药物化学家与系统交互时的性能提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/b6af9d28a64c/13321_2022_667_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/75917d41d8a3/13321_2022_667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/83b0d759412e/13321_2022_667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/c818e646737a/13321_2022_667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/ef46ca5df914/13321_2022_667_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/94f209126010/13321_2022_667_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/b6af9d28a64c/13321_2022_667_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/75917d41d8a3/13321_2022_667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/83b0d759412e/13321_2022_667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/c818e646737a/13321_2022_667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/ef46ca5df914/13321_2022_667_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/94f209126010/13321_2022_667_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab00/9795720/b6af9d28a64c/13321_2022_667_Fig6_HTML.jpg

相似文献

1
Human-in-the-loop assisted de novo molecular design.人在回路辅助的从头分子设计
J Cheminform. 2022 Dec 28;14(1):86. doi: 10.1186/s13321-022-00667-8.
2
Metis: a python-based user interface to collect expert feedback for generative chemistry models.梅蒂斯:一个基于Python的用户界面,用于收集生成化学模型的专家反馈。
J Cheminform. 2024 Aug 14;16(1):100. doi: 10.1186/s13321-024-00892-3.
3
Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation.增强爬山算法提高了基于语言的从头分子生成的强化学习效率。
J Cheminform. 2022 Oct 3;14(1):68. doi: 10.1186/s13321-022-00646-z.
4
Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES.使用增强型 SMILES 进行双环强化学习,实现更快、更多样的从头分子优化。
J Comput Aided Mol Des. 2023 Aug;37(8):373-394. doi: 10.1007/s10822-023-00512-6. Epub 2023 Jun 17.
5
Memory-assisted reinforcement learning for diverse molecular de novo design.用于多样分子从头设计的记忆辅助强化学习
J Cheminform. 2020 Nov 10;12(1):68. doi: 10.1186/s13321-020-00473-0.
6
Multi-objective molecular generation via clustered Pareto-based reinforcement learning.基于聚类 Pareto 强化学习的多目标分子生成。
Neural Netw. 2024 Nov;179:106596. doi: 10.1016/j.neunet.2024.106596. Epub 2024 Aug 6.
7
DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology.DrugEx v2:基于帕累托的多目标强化学习在多药理学中从头设计药物分子
J Cheminform. 2021 Nov 12;13(1):85. doi: 10.1186/s13321-021-00561-9.
8
Deep generative models for ligand-based de novo design applied to multi-parametric optimization.用于基于配体的从头设计的深度生成模型在多参数优化中的应用。
J Comput Chem. 2022 Apr 15;43(10):692-703. doi: 10.1002/jcc.26826. Epub 2022 Feb 26.
9
De Novo Drug Design by Multi-Objective Path Consistency Learning With Beam A Search.基于束搜索的多目标路径一致性学习的从头药物设计
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2459-2470. doi: 10.1109/TCBB.2024.3477592. Epub 2024 Dec 10.
10
Sample efficient reinforcement learning with active learning for molecular design.用于分子设计的基于主动学习的样本高效强化学习
Chem Sci. 2024 Feb 8;15(11):4146-4160. doi: 10.1039/d3sc04653b. eCollection 2024 Mar 13.

引用本文的文献

1
Toward the Uniform of Chemical Theory, Simulation, and Experiments in Metaverse Technology.迈向元宇宙技术中化学理论、模拟与实验的统一。
Precis Chem. 2023 Jun 14;1(4):192-198. doi: 10.1021/prechem.3c00045. eCollection 2023 Jun 26.
2
Human-in-the-loop active learning for goal-oriented molecule generation.用于目标导向分子生成的人工参与式主动学习
J Cheminform. 2024 Dec 9;16(1):138. doi: 10.1186/s13321-024-00924-y.
3
Metis: a python-based user interface to collect expert feedback for generative chemistry models.梅蒂斯:一个基于Python的用户界面,用于收集生成化学模型的专家反馈。

本文引用的文献

1
DockStream: a docking wrapper to enhance de novo molecular design.DockStream:一种用于增强从头分子设计的对接包装程序。
J Cheminform. 2021 Nov 17;13(1):89. doi: 10.1186/s13321-021-00563-7.
2
Has Artificial Intelligence Impacted Drug Discovery?人工智能是否影响了药物发现?
Methods Mol Biol. 2022;2390:153-176. doi: 10.1007/978-1-0716-1787-8_6.
3
De novo design with deep generative models based on 3D similarity scoring.基于 3D 相似度评分的深度生成模型从头设计。
J Cheminform. 2024 Aug 14;16(1):100. doi: 10.1186/s13321-024-00892-3.
4
Comprehensive applications of the artificial intelligence technology in new drug research and development.人工智能技术在新药研发中的综合应用。
Health Inf Sci Syst. 2024 Aug 8;12(1):41. doi: 10.1007/s13755-024-00300-y. eCollection 2024 Dec.
5
Modeling needs user modeling.
Front Artif Intell. 2023 Apr 6;6:1097891. doi: 10.3389/frai.2023.1097891. eCollection 2023.
Bioorg Med Chem. 2021 Aug 15;44:116308. doi: 10.1016/j.bmc.2021.116308. Epub 2021 Jul 9.
4
Retrosynthetic accessibility score (RAscore) - rapid machine learned synthesizability classification from AI driven retrosynthetic planning.逆合成可及性分数(RAscore)——基于人工智能驱动的逆合成规划的快速机器学习合成性分类。
Chem Sci. 2021 Jan 22;12(9):3339-3349. doi: 10.1039/d0sc05401a.
5
Multiobjective de novo drug design with recurrent neural networks and nondominated sorting.基于循环神经网络和非支配排序的多目标从头药物设计
J Cheminform. 2020 Feb 18;12(1):14. doi: 10.1186/s13321-020-00419-6.
6
REINVENT 2.0: An AI Tool for De Novo Drug Design.REINVENT 2.0:一种用于从头设计药物的人工智能工具。
J Chem Inf Model. 2020 Dec 28;60(12):5918-5922. doi: 10.1021/acs.jcim.0c00915. Epub 2020 Oct 29.
7
grünifai: interactive multiparameter optimization of molecules in a continuous vector space.grünifai:连续向量空间中分子的交互式多参数优化。
Bioinformatics. 2020 Jul 1;36(13):4093-4094. doi: 10.1093/bioinformatics/btaa271.
8
Efficient multi-objective molecular optimization in a continuous latent space.连续潜在空间中的高效多目标分子优化。
Chem Sci. 2019 Jul 8;10(34):8016-8024. doi: 10.1039/c9sc01928f. eCollection 2019 Sep 14.
9
A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space.一种基于图的遗传算法和生成模型/蒙特卡罗树搜索方法用于化学空间探索。
Chem Sci. 2019 Feb 11;10(12):3567-3572. doi: 10.1039/c8sc05372c. eCollection 2019 Mar 28.
10
Improving genomics-based predictions for precision medicine through active elicitation of expert knowledge.通过主动征集专家知识来提高基于基因组学的精准医疗预测。
Bioinformatics. 2018 Jul 1;34(13):i395-i403. doi: 10.1093/bioinformatics/bty257.