• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于图的遗传算法和生成模型/蒙特卡罗树搜索方法用于化学空间探索。

A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space.

作者信息

Jensen Jan H

机构信息

Department of Chemistry , University of Copenhagen , Copenhagen , Denmark . Email:

出版信息

Chem Sci. 2019 Feb 11;10(12):3567-3572. doi: 10.1039/c8sc05372c. eCollection 2019 Mar 28.

DOI:10.1039/c8sc05372c
PMID:30996948
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6438151/
Abstract

This paper presents a comparison of a graph-based genetic algorithm (GB-GA) and machine learning (ML) results for the optimization of log  values with a constraint for synthetic accessibility and shows that the GA is as good as or better than the ML approaches for this particular property. The molecules found by the GB-GA bear little resemblance to the molecules used to construct the initial mating pool, indicating that the GB-GA approach can traverse a relatively large distance in chemical space using relatively few (50) generations. The paper also introduces a new non-ML graph-based generative model (GB-GM) that can be parameterized using very small data sets and combined with a Monte Carlo tree search (MCTS) algorithm. The results are comparable to previously published results (, 2017, , 972-976) using a recurrent neural network (RNN) generative model, and the GB-GM-based method is several orders of magnitude faster. The MCTS results seem more dependent on the composition of the training set than the GA approach for this particular property. Our results suggest that the performance of new ML-based generative models should be compared to that of more traditional, and often simpler, approaches such a GA.

摘要

本文对基于图的遗传算法(GB-GA)和机器学习(ML)在优化log 值时的结果进行了比较,同时考虑了合成可及性的约束条件,并表明对于这一特定性质,遗传算法与机器学习方法表现相当甚至更优。GB-GA找到的分子与用于构建初始交配池的分子几乎没有相似之处,这表明GB-GA方法能够在相对较少的(50)代内跨越化学空间中相对较大的距离。本文还介绍了一种新的基于图的非机器学习生成模型(GB-GM),该模型可以使用非常小的数据集进行参数化,并与蒙特卡罗树搜索(MCTS)算法相结合。其结果与之前使用递归神经网络(RNN)生成模型发表的结果(,2017,,972 - 976)相当,并且基于GB-GM的方法要快几个数量级。对于这一特定性质,MCTS的结果似乎比GA方法更依赖于训练集的组成。我们的结果表明,新的基于机器学习的生成模型的性能应该与更传统且通常更简单的方法(如GA)进行比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/1106770f1c8f/c8sc05372c-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/5de358ef1801/c8sc05372c-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/7163949c2da8/c8sc05372c-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/19efaf1a2432/c8sc05372c-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/1106770f1c8f/c8sc05372c-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/5de358ef1801/c8sc05372c-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/7163949c2da8/c8sc05372c-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/19efaf1a2432/c8sc05372c-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/337d/6438151/1106770f1c8f/c8sc05372c-f4.jpg

相似文献

1
A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space.一种基于图的遗传算法和生成模型/蒙特卡罗树搜索方法用于化学空间探索。
Chem Sci. 2019 Feb 11;10(12):3567-3572. doi: 10.1039/c8sc05372c. eCollection 2019 Mar 28.
2
VGAE-MCTS: A New Molecular Generative Model Combining the Variational Graph Auto-Encoder and Monte Carlo Tree Search.VGAE-MCTS:一种结合变分图自编码器和蒙特卡罗树搜索的新型分子生成模型。
J Chem Inf Model. 2023 Dec 11;63(23):7392-7400. doi: 10.1021/acs.jcim.3c01220. Epub 2023 Nov 22.
3
Design of Molecules with Low Hole Reorganization Energy Based on a Quarter-Million Molecule DFT Screen: Part 2.基于二十五万分子密度泛函理论筛选的低空穴重组能分子设计:第二部分
J Phys Chem A. 2022 Sep 1;126(34):5837-5852. doi: 10.1021/acs.jpca.2c04221. Epub 2022 Aug 19.
4
A self-learning Monte Carlo tree search algorithm for robot path planning.一种用于机器人路径规划的自学习蒙特卡罗树搜索算法。
Front Neurorobot. 2023 Jul 6;17:1039644. doi: 10.3389/fnbot.2023.1039644. eCollection 2023.
5
Magicmol: a light-weighted pipeline for drug-like molecule evolution and quick chemical space exploration.Magicmol:一个轻量级的药物分子进化和快速化学空间探索的流水线。
BMC Bioinformatics. 2023 Apr 26;24(1):173. doi: 10.1186/s12859-023-05286-0.
6
Data-driven approaches for identifying hyperparameters in multi-step retrosynthesis.数据驱动的方法用于识别多步骤反合成中的超参数。
Mol Inform. 2023 Nov;42(11):e202300128. doi: 10.1002/minf.202300128. Epub 2023 Sep 27.
7
Enabling target-aware molecule generation to follow multi objectives with Pareto MCTS.使能基于目标的分子生成,以 Pareto MCTS 跟随多目标。
Commun Biol. 2024 Sep 2;7(1):1074. doi: 10.1038/s42003-024-06746-w.
8
Chemistry-informed molecular graph as reaction descriptor for machine-learned retrosynthesis planning.基于化学信息的分子图作为机器学习逆向合成规划的反应描述符。
Proc Natl Acad Sci U S A. 2022 Oct 11;119(41):e2212711119. doi: 10.1073/pnas.2212711119. Epub 2022 Oct 3.
9
Perfect prosthetic heart valve: generative design with machine learning, modeling, and optimization.完美的人工心脏瓣膜:基于机器学习、建模与优化的生成式设计
Front Bioeng Biotechnol. 2023 Sep 15;11:1238130. doi: 10.3389/fbioe.2023.1238130. eCollection 2023.
10
Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning.利用蒙特卡罗树搜索和强化学习实现绿色合成途径的高效发现。
Chem Sci. 2020 Sep 14;11(40):10959-10972. doi: 10.1039/d0sc04184j.

引用本文的文献

1
Multi-Objective Drug Molecule Optimization Based on Tanimoto Crowding Distance and Acceptance Probability.基于谷本系数拥挤距离和接受概率的多目标药物分子优化
Pharmaceuticals (Basel). 2025 Aug 20;18(8):1227. doi: 10.3390/ph18081227.
2
Expediting the discovery of promising photothermal cyanine molecules through a transfer learning approach.通过迁移学习方法加速有前景的光热花菁分子的发现。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf413.
3
Integrate & balance aspects for safe and sustainable innovation: Needs analysis on SSbD categories and product development stage requirements to cover the entire life cycle.

本文引用的文献

1
Hunting for Organic Molecules with Artificial Intelligence: Molecules Optimized for Desired Excitation Energies.利用人工智能寻找有机分子:针对所需激发能进行优化的分子
ACS Cent Sci. 2018 Sep 26;4(9):1126-1133. doi: 10.1021/acscentsci.8b00213. Epub 2018 Aug 20.
2
Inverse molecular design using machine learning: Generative models for matter engineering.基于机器学习的反向分子设计:物质工程生成模型。
Science. 2018 Jul 27;361(6400):360-365. doi: 10.1126/science.aat2663. Epub 2018 Jul 26.
3
Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules.
整合并平衡各方面因素以实现安全且可持续的创新:对安全系统设计(SSbD)类别和产品开发阶段要求进行需求分析,以涵盖整个生命周期。
Comput Struct Biotechnol J. 2025 Jul 17;29:201-221. doi: 10.1016/j.csbj.2025.07.030. eCollection 2025.
4
Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction.Boltz-2:迈向准确高效的结合亲和力预测
bioRxiv. 2025 Jun 18:2025.06.14.659707. doi: 10.1101/2025.06.14.659707.
5
CMOMO: a deep multi-objective optimization framework for constrained molecular multi-property optimization.CMOMO:一种用于受限分子多性质优化的深度多目标优化框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf335.
6
Molecular Machine Learning Approach to Enantioselective C-H Bond Activation Reactions: From Generative AI to Experimental Validation.用于对映选择性C-H键活化反应的分子机器学习方法:从生成式人工智能到实验验证
Chem Sci. 2025 Jun 10. doi: 10.1039/d5sc01098e.
7
Crossover operators for molecular graphs with an application to virtual drug screening.用于分子图的交叉算子及其在虚拟药物筛选中的应用。
J Cheminform. 2025 Jun 17;17(1):97. doi: 10.1186/s13321-025-00958-w.
8
Rag2Mol: structure-based drug design based on retrieval augmented generation.Rag2Mol:基于检索增强生成的基于结构的药物设计。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf265.
9
Guided multi-objective generative AI to enhance structure-based drug design.引导式多目标生成式人工智能增强基于结构的药物设计。
Chem Sci. 2025 May 29. doi: 10.1039/d5sc01778e.
10
Artificial Intelligence in Molecular Optimization: Current Paradigms and Future Frontiers.分子优化中的人工智能:当前范式与未来前沿
Int J Mol Sci. 2025 May 19;26(10):4878. doi: 10.3390/ijms26104878.
使用数据驱动的分子连续表示法进行自动化学设计。
ACS Cent Sci. 2018 Feb 28;4(2):268-276. doi: 10.1021/acscentsci.7b00572. Epub 2018 Jan 12.
4
ChemTS: an efficient python library for molecular generation.ChemTS:一个用于分子生成的高效Python库。
Sci Technol Adv Mater. 2017 Nov 24;18(1):972-976. doi: 10.1080/14686996.2017.1401424. eCollection 2017.
5
Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks.使用递归神经网络生成用于药物发现的聚焦分子库。
ACS Cent Sci. 2018 Jan 24;4(1):120-131. doi: 10.1021/acscentsci.7b00512. Epub 2017 Dec 28.
6
Stochastic voyages into uncharted chemical space produce a representative library of all possible drug-like compounds.随机探索未知的化学空间可产生具有代表性的所有可能类药性化合物文库。
J Am Chem Soc. 2013 May 15;135(19):7296-303. doi: 10.1021/ja401184g. Epub 2013 May 2.
7
Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions.基于分子复杂性和片段贡献的类药性分子的综合可及性评分估计。
J Cheminform. 2009 Jun 10;1(1):8. doi: 10.1186/1758-2946-1-8.
8
A graph-based genetic algorithm and its application to the multiobjective evolution of median molecules.一种基于图的遗传算法及其在中位分子多目标进化中的应用。
J Chem Inf Comput Sci. 2004 May-Jun;44(3):1079-87. doi: 10.1021/ci034290p.