• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

小分子设计机器学习的实验者指南。

The Experimentalist's Guide to Machine Learning for Small Molecule Design.

机构信息

Department of Bioengineering, University of Illinois, Urbana-Champaign, Illinois 61801, United States.

Department of Chemical and Biomolecular Engineering, University of Illinois, Urbana-Champaign, Illinois 61801, United States.

出版信息

ACS Appl Bio Mater. 2024 Feb 19;7(2):657-684. doi: 10.1021/acsabm.3c00054. Epub 2023 Aug 3.

DOI:10.1021/acsabm.3c00054
PMID:37535819
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10880109/
Abstract

Initially part of the field of artificial intelligence, machine learning (ML) has become a booming research area since branching out into its own field in the 1990s. After three decades of refinement, ML algorithms have accelerated scientific developments across a variety of research topics. The field of small molecule design is no exception, and an increasing number of researchers are applying ML techniques in their pursuit of discovering, generating, and optimizing small molecule compounds. The goal of this review is to provide simple, yet descriptive, explanations of some of the most commonly utilized ML algorithms in the field of small molecule design along with those that are highly applicable to an experimentally focused audience. The algorithms discussed here span across three ML paradigms: supervised learning, unsupervised learning, and ensemble methods. Examples from the published literature will be provided for each algorithm. Some common pitfalls of applying ML to biological and chemical data sets will also be explained, alongside a brief summary of a few more advanced paradigms, including reinforcement learning and semi-supervised learning.

摘要

最初是人工智能领域的一部分,机器学习(ML)自 20 世纪 90 年代发展成为一个独立的领域以来,已经成为一个蓬勃发展的研究领域。经过三十年的完善,机器学习算法加速了各个研究课题的科学发展。小分子设计领域也不例外,越来越多的研究人员在探索、生成和优化小分子化合物的过程中应用机器学习技术。本文的目的是提供一些在小分子设计领域中最常用的机器学习算法的简单但描述性的解释,以及那些非常适用于实验性受众的算法。这里讨论的算法涵盖了三种机器学习范式:监督学习、无监督学习和集成方法。将为每个算法提供来自已发表文献的示例。还将解释将机器学习应用于生物和化学数据集的一些常见陷阱,并简要介绍包括强化学习和半监督学习在内的几个更高级的范例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/8486de03f1a1/mt3c00054_0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/0ac407cb09e1/mt3c00054_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/4ef7cb9132a8/mt3c00054_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/996ba83ab3ac/mt3c00054_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/bdff08ed7585/mt3c00054_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/d28f1a44e53f/mt3c00054_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/4b8c02aa5796/mt3c00054_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/131d695745e5/mt3c00054_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/c9da36aca8d1/mt3c00054_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/78224ddd8f05/mt3c00054_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/b879d4ff42b4/mt3c00054_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/927c66779758/mt3c00054_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/2b48414c2863/mt3c00054_0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/8486de03f1a1/mt3c00054_0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/0ac407cb09e1/mt3c00054_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/4ef7cb9132a8/mt3c00054_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/996ba83ab3ac/mt3c00054_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/bdff08ed7585/mt3c00054_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/d28f1a44e53f/mt3c00054_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/4b8c02aa5796/mt3c00054_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/131d695745e5/mt3c00054_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/c9da36aca8d1/mt3c00054_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/78224ddd8f05/mt3c00054_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/b879d4ff42b4/mt3c00054_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/927c66779758/mt3c00054_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/2b48414c2863/mt3c00054_0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2843/10880109/8486de03f1a1/mt3c00054_0013.jpg

相似文献

1
The Experimentalist's Guide to Machine Learning for Small Molecule Design.小分子设计机器学习的实验者指南。
ACS Appl Bio Mater. 2024 Feb 19;7(2):657-684. doi: 10.1021/acsabm.3c00054. Epub 2023 Aug 3.
2
Artificial intelligence in spine care: current applications and future utility.人工智能在脊柱护理中的应用:当前的应用和未来的效用。
Eur Spine J. 2022 Aug;31(8):2057-2081. doi: 10.1007/s00586-022-07176-0. Epub 2022 Mar 27.
3
Artificial intelligence and machine learning overview in pathology & laboratory medicine: A general review of data preprocessing and basic supervised concepts.病理学与检验医学中的人工智能和机器学习概述:数据预处理及基本监督概念的综合回顾
Semin Diagn Pathol. 2023 Mar;40(2):71-87. doi: 10.1053/j.semdp.2023.02.002. Epub 2023 Feb 16.
4
Artificial intelligence and machine learning: Definition of terms and current concepts in critical care research.人工智能与机器学习:重症监护研究中的术语定义及当前概念
J Crit Care. 2024 Aug;82:154792. doi: 10.1016/j.jcrc.2024.154792. Epub 2024 Mar 29.
5
A Comprehensive Review on Machine Learning in Healthcare Industry: Classification, Restrictions, Opportunities and Challenges.机器学习在医疗行业的全面综述:分类、限制、机遇和挑战。
Sensors (Basel). 2023 Apr 22;23(9):4178. doi: 10.3390/s23094178.
6
Artificial Intelligence in Cardiology.人工智能在心脏病学中的应用。
J Am Coll Cardiol. 2018 Jun 12;71(23):2668-2679. doi: 10.1016/j.jacc.2018.03.521.
7
Conceptual Structure and Current Trends in Artificial Intelligence, Machine Learning, and Deep Learning Research in Sports: A Bibliometric Review.体育领域人工智能、机器学习和深度学习研究的概念结构和当前趋势:文献计量学综述。
Int J Environ Res Public Health. 2022 Dec 22;20(1):173. doi: 10.3390/ijerph20010173.
8
Artificial Intelligence and Machine Learning for Improving Glycemic Control in Diabetes: Best Practices, Pitfalls, and Opportunities.人工智能和机器学习在糖尿病血糖控制中的应用:最佳实践、陷阱和机遇。
IEEE Rev Biomed Eng. 2024;17:19-41. doi: 10.1109/RBME.2023.3331297. Epub 2024 Jan 12.
9
SemiBoost: boosting for semi-supervised learning.半增强算法:用于半监督学习的增强算法
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2000-14. doi: 10.1109/TPAMI.2008.235.
10
Machine learning in health care and laboratory medicine: General overview of supervised learning and Auto-ML.医疗保健与检验医学中的机器学习:监督学习和自动机器学习概述
Int J Lab Hematol. 2021 Jul;43 Suppl 1:15-22. doi: 10.1111/ijlh.13537.

引用本文的文献

1
Database of Nonaqueous Proton-Conducting Materials.非水电质子传导材料数据库
ACS Appl Mater Interfaces. 2025 Mar 19;17(11):16901-16908. doi: 10.1021/acsami.4c22618. Epub 2025 Mar 9.
2
Testing the predictive power of reverse screening to infer drug targets, with the help of machine learning.借助机器学习测试反向筛选推断药物靶点的预测能力。
Commun Chem. 2024 May 9;7(1):105. doi: 10.1038/s42004-024-01179-2.
3
An ensemble machine learning model generates a focused screening library for the identification of CDK8 inhibitors.

本文引用的文献

1
Machine Learning Guided Design of High-Affinity ACE2 Decoys for SARS-CoV-2 Neutralization.机器学习指导设计高亲和力 ACE2 诱饵以中和 SARS-CoV-2。
J Phys Chem B. 2023 Mar 9;127(9):1995-2001. doi: 10.1021/acs.jpcb.3c00469. Epub 2023 Feb 24.
2
TAIGET: A small-molecule target identification and annotation web server.TAIGET:一个小分子靶点识别与注释网络服务器。
Front Pharmacol. 2022 Aug 29;13:898519. doi: 10.3389/fphar.2022.898519. eCollection 2022.
3
Recent Advances in Machine Learning Variant Effect Prediction Tools for Protein Engineering.
基于集成机器学习模型生成靶向筛选文库,用于鉴定 CDK8 抑制剂。
Protein Sci. 2024 Jun;33(6):e5007. doi: 10.1002/pro.5007.
蛋白质工程机器学习变体效应预测工具的最新进展
Ind Eng Chem Res. 2022 May 18;61(19):6235-6245. doi: 10.1021/acs.iecr.1c04943. Epub 2022 Apr 6.
4
Hierarchical Clustering and Target-Independent QSAR for Antileishmanial Oxazole and Oxadiazole Derivatives.层次聚类和无靶定量构效关系分析抗利什曼原虫唑和噁二唑衍生物。
Int J Mol Sci. 2022 Aug 10;23(16):8898. doi: 10.3390/ijms23168898.
5
Aglow: A Fluorescence Assay and Machine Learning Model to Identify Inhibitors of Intracellular Infection.发光法:一种用于鉴定细胞内感染抑制剂的荧光检测法和机器学习模型。
ACS Infect Dis. 2022 Jul 8;8(7):1280-1290. doi: 10.1021/acsinfecdis.2c00014. Epub 2022 Jun 24.
6
Merging data curation and machine learning to improve nanomedicines.将数据管理和机器学习相结合,以改善纳米医学。
Adv Drug Deliv Rev. 2022 Apr;183:114172. doi: 10.1016/j.addr.2022.114172. Epub 2022 Feb 18.
7
Predicting the Activities of Drug Excipients on Biological Targets using One-Shot Learning.基于单次学习预测药物辅料对生物靶点的活性。
J Phys Chem B. 2022 Feb 24;126(7):1492-1503. doi: 10.1021/acs.jpcb.1c10574. Epub 2022 Feb 10.
8
Machine learning approaches to optimize small-molecule inhibitors for RNA targeting.用于优化靶向RNA的小分子抑制剂的机器学习方法。
J Cheminform. 2022 Feb 2;14(1):4. doi: 10.1186/s13321-022-00583-x.
9
Exploring sodium glucose cotransporter (SGLT2) inhibitors with machine learning approach: A novel hope in anti-diabetes drug discovery.采用机器学习方法探索钠-葡萄糖协同转运蛋白 2(SGLT2)抑制剂:抗糖尿病药物发现的新希望。
J Mol Graph Model. 2022 Mar;111:108106. doi: 10.1016/j.jmgm.2021.108106. Epub 2021 Dec 10.
10
Automated metabolic assignment: Semi-supervised learning in metabolic analysis employing two dimensional Nuclear Magnetic Resonance (NMR).自动代谢物归属:采用二维核磁共振(NMR)的代谢分析中的半监督学习
Comput Struct Biotechnol J. 2021 Aug 31;19:5047-5058. doi: 10.1016/j.csbj.2021.08.048. eCollection 2021.