生成式机器学习在从头药物发现中的应用：系统评价。

Generative machine learning for de novo drug discovery: A systematic review.

机构信息

Independent researcher, United States of America.

出版信息

Comput Biol Med. 2022 Jun;145:105403. doi: 10.1016/j.compbiomed.2022.105403. Epub 2022 Mar 13.

DOI:10.1016/j.compbiomed.2022.105403

Abstract

Recent research on artificial intelligence indicates that machine learning algorithms can auto-generate novel drug-like molecules. Generative models have revolutionized de novo drug discovery, rendering the explorative process more efficient. Several model frameworks and input formats have been proposed to enhance the performance of intelligent algorithms in generative molecular design. In this systematic literature review of experimental articles and reviews over the last five years, machine learning models, challenges associated with computational molecule design along with proposed solutions, and molecular encoding methods are discussed. A query-based search of the PubMed, ScienceDirect, Springer, Wiley Online Library, arXiv, MDPI, bioRxiv, and IEEE Xplore databases yielded 87 studies. Twelve additional studies were identified via citation searching. Of the articles in which machine learning was implemented, six prominent algorithms were identified: long short-term memory recurrent neural networks (LSTM-RNNs), variational autoencoders (VAEs), generative adversarial networks (GANs), adversarial autoencoders (AAEs), evolutionary algorithms, and gated recurrent unit (GRU-RNNs). Furthermore, eight central challenges were designated: homogeneity of generated molecular libraries, deficient synthesizability, limited assay data, model interpretability, incapacity for multi-property optimization, incomparability, restricted molecule size, and uncertainty in model evaluation. Molecules were encoded either as strings, which were occasionally augmented using randomization, as 2D graphs, or as 3D graphs. Statistical analysis and visualization are performed to illustrate how approaches to machine learning in de novo drug design have evolved over the past five years. Finally, future opportunities and reservations are discussed.

摘要

最近关于人工智能的研究表明，机器学习算法可以自动生成新的类似药物的分子。生成模型彻底改变了从头药物发现，使探索过程更加高效。已经提出了几种模型框架和输入格式，以提高智能算法在生成分子设计中的性能。在过去五年的实验文章和综述的系统文献回顾中，讨论了机器学习模型、与计算分子设计相关的挑战以及提出的解决方案，以及分子编码方法。通过对 PubMed、ScienceDirect、Springer、Wiley Online Library、arXiv、MDPI、bioRxiv 和 IEEE Xplore 数据库的基于查询的搜索，共获得 87 项研究。通过引文搜索又确定了 12 项研究。在实施机器学习的文章中，确定了六个突出的算法：长短期记忆递归神经网络 (LSTM-RNN)、变分自动编码器 (VAE)、生成对抗网络 (GAN)、对抗自动编码器 (AAE)、进化算法和门控递归单元 (GRU-RNN)。此外，指定了八个核心挑战：生成分子库的同质性、合成能力不足、有限的测定数据、模型可解释性、无法进行多属性优化、不可比性、受限的分子大小和模型评估的不确定性。分子被编码为字符串，偶尔使用随机化进行扩充，也可以编码为 2D 图或 3D 图。进行统计分析和可视化，以说明过去五年中从头药物设计中机器学习方法的发展情况。最后，讨论了未来的机会和保留意见。

相似文献

Generative machine learning for de novo drug discovery: A systematic review.生成式机器学习在从头药物发现中的应用：系统评价。

Comput Biol Med. 2022 Jun;145:105403. doi: 10.1016/j.compbiomed.2022.105403. Epub 2022 Mar 13.

Accuracy of Using Generative Adversarial Networks for Glaucoma Detection: Systematic Review and Bibliometric Analysis.使用生成对抗网络进行青光眼检测的准确性：系统评价和文献计量分析。

J Med Internet Res. 2021 Sep 21;23(9):e27414. doi: 10.2196/27414.

The COVID-19 epidemic analysis and diagnosis using deep learning: A systematic literature review and future directions.利用深度学习进行 COVID-19 疫情分析和诊断：系统文献回顾及未来方向。

Comput Biol Med. 2022 Feb;141:105141. doi: 10.1016/j.compbiomed.2021.105141. Epub 2021 Dec 14.

Machine Learning and Natural Language Processing in Mental Health: Systematic Review.机器学习和自然语言处理在心理健康中的应用：系统综述。

J Med Internet Res. 2021 May 4;23(5):e15708. doi: 10.2196/15708.

Uses of Different Machine Learning Algorithms for Diagnosis of Dental Caries.不同机器学习算法在龋齿诊断中的应用。

J Healthc Eng. 2022 Mar 31;2022:5032435. doi: 10.1155/2022/5032435. eCollection 2022.

Machine learning in oral squamous cell carcinoma: Current status, clinical concerns and prospects for future-A systematic review.机器学习在口腔鳞状细胞癌中的应用：现状、临床关注点及未来展望——系统综述。

Artif Intell Med. 2021 May;115:102060. doi: 10.1016/j.artmed.2021.102060. Epub 2021 Mar 26.

Machine learning in knee arthroplasty: specific data are key-a systematic review.机器学习在膝关节置换术中的应用：特定数据是关键——系统评价。

Knee Surg Sports Traumatol Arthrosc. 2022 Feb;30(2):376-388. doi: 10.1007/s00167-021-06848-6. Epub 2022 Jan 10.

An insight into diagnosis of depression using machine learning techniques: a systematic review.利用机器学习技术进行抑郁症诊断的研究进展：系统综述。

Curr Med Res Opin. 2022 May;38(5):749-771. doi: 10.1080/03007995.2022.2038487. Epub 2022 Feb 17.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

引用本文的文献

Advances in Lipid-Based Nanomedicine: Pathway Specific siRNA Therapy and Optimizing Delivery for Hepatocellular Carcinoma.基于脂质的纳米医学进展：针对肝细胞癌的通路特异性siRNA疗法及优化递送

Int J Nanomedicine. 2025 Aug 30;20:10541-10566. doi: 10.2147/IJN.S532246. eCollection 2025.

Targeting the TRIB3-MYC axis in cancer: mechanistic insights and therapeutic disruption strategies.靶向癌症中的TRIB3-MYC轴：机制见解与治疗性破坏策略

Invest New Drugs. 2025 Sep 4. doi: 10.1007/s10637-025-01582-z.

MGMG: Cell Morphology-Guided Molecule Generation for Drug Discovery.MGMG：用于药物发现的细胞形态学引导分子生成

bioRxiv. 2025 Jul 17:2025.07.11.664424. doi: 10.1101/2025.07.11.664424.

Applications of Artificial Intelligence in Biotech Drug Discovery and Product Development.人工智能在生物技术药物发现与产品开发中的应用。

MedComm (2020). 2025 Jul 30;6(8):e70317. doi: 10.1002/mco2.70317. eCollection 2025 Aug.

Identification of nanomolar adenosine A receptor ligands using reinforcement learning and structure-based drug design.利用强化学习和基于结构的药物设计鉴定纳摩尔级别的腺苷 A 受体配体。

Nat Commun. 2025 Jul 1;16(1):5485. doi: 10.1038/s41467-025-60629-0.

Scaffold Hopping with Generative Reinforcement Learning.基于生成式强化学习的支架跳跃

J Chem Inf Model. 2025 Jul 14;65(13):6513-6525. doi: 10.1021/acs.jcim.5c00029. Epub 2025 Jun 26.

In Silico Validation of AI-Assisted Drugs in Healthcare.医疗保健中人工智能辅助药物的计算机模拟验证

Methods Mol Biol. 2025;2952:445-458. doi: 10.1007/978-1-0716-4690-8_24.

AI-Driven Antimicrobial Peptide Discovery: Mining and Generation.人工智能驱动的抗菌肽发现：挖掘与生成

Acc Chem Res. 2025 Jun 17;58(12):1831-1846. doi: 10.1021/acs.accounts.0c00594. Epub 2025 Jun 3.

The Role of Artificial Intelligence in Drug Discovery and Pharmaceutical Development: A Paradigm Shift in the History of Pharmaceutical Industries.人工智能在药物发现与制药研发中的作用：制药行业历史上的一次范式转变。

AAPS PharmSciTech. 2025 May 14;26(5):133. doi: 10.1208/s12249-025-03134-3.

A Review of the Applications, Benefits, and Challenges of Generative AI for Sustainable Toxicology.生成式人工智能在可持续毒理学中的应用、益处及挑战综述

Curr Res Toxicol. 2025 Apr 21;8:100232. doi: 10.1016/j.crtox.2025.100232. eCollection 2025.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

生成式机器学习在从头药物发现中的应用：系统评价。

Generative machine learning for de novo drug discovery: A systematic review.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献