• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合机器学习与大语言模型以推动电化学反应探索

Integrating Machine Learning and Large Language Models to Advance Exploration of Electrochemical Reactions.

作者信息

Zheng Zhiling, Florit Federico, Jin Brooke, Wu Haoyang, Li Shih-Cheng, Nandiwale Kakasaheb Y, Salazar Chase A, Mustakis Jason G, Green William H, Jensen Klavs F

机构信息

Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, United States.

Chemical Research & Development, Pfizer Worldwide Research and Development, Groton, CT 06340, United States.

出版信息

Angew Chem Int Ed Engl. 2025 Feb 3;64(6):e202418074. doi: 10.1002/anie.202418074. Epub 2024 Dec 18.

DOI:10.1002/anie.202418074
PMID:39625837
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11795713/
Abstract

Electrochemical C-H oxidation reactions offer a sustainable route to functionalize hydrocarbons, yet identifying suitable substrates and optimizing synthesis remain challenging. Here, we report an integrated approach combining machine learning and large language models to streamline the exploration of electrochemical C-H oxidation reactions. Utilizing a batch rapid screening electrochemical platform, we evaluated a wide range of reactions, initially classifying substrates by their reactivity, while LLMs text-mined literature data to augment the training set. The resulting ML models for reactivity prediction achieved high accuracy (>90 %) and enabled virtual screening of a large set of commercially available molecules. To optimize reaction conditions for selected substrates, LLMs were prompted to generate code that iteratively improved yields. This human-AI collaboration proved effective, efficiently identifying high-yield conditions for 8 drug-like substances or intermediates. Notably, we benchmarked the accuracy and reliability of 12 different LLMs-including LLaMA series, Claude series, OpenAI o1, and GPT-4-on code generation and function calling related to ML based on natural language prompts given by chemists to showcase potentials for accelerating research across four diverse tasks. In addition, we collected an experimental benchmark dataset comprising 1071 reaction conditions and yields for electrochemical C-H oxidation reactions.

摘要

电化学C-H氧化反应为烃类官能团化提供了一条可持续的途径,但确定合适的底物并优化合成仍然具有挑战性。在此,我们报告了一种结合机器学习和大语言模型的综合方法,以简化电化学C-H氧化反应的探索。利用批量快速筛选电化学平台,我们评估了广泛的反应,最初根据底物的反应性对其进行分类,同时大语言模型对文献数据进行文本挖掘以扩充训练集。由此产生的用于反应性预测的机器学习模型实现了高精度(>90%),并能够对大量市售分子进行虚拟筛选。为了优化选定底物的反应条件,我们促使大语言模型生成能够迭代提高产率的代码。这种人机协作被证明是有效的,能够高效地为8种类药物物质或中间体确定高产率条件。值得注意的是,我们基于化学家给出的自然语言提示,对12种不同的大语言模型(包括LLaMA系列、Claude系列、OpenAI o1和GPT-4)在与机器学习相关的代码生成和函数调用方面的准确性和可靠性进行了基准测试,以展示其在加速四项不同任务研究方面的潜力。此外,我们收集了一个包含1071个电化学C-H氧化反应条件和产率的实验基准数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/615f0a468418/ANIE-64-e202418074-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/aa7b3708a090/ANIE-64-e202418074-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/a031221ad8c4/ANIE-64-e202418074-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/3df1213d96f8/ANIE-64-e202418074-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/615f0a468418/ANIE-64-e202418074-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/aa7b3708a090/ANIE-64-e202418074-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/a031221ad8c4/ANIE-64-e202418074-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/3df1213d96f8/ANIE-64-e202418074-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1655/11795713/615f0a468418/ANIE-64-e202418074-g005.jpg

相似文献

1
Integrating Machine Learning and Large Language Models to Advance Exploration of Electrochemical Reactions.整合机器学习与大语言模型以推动电化学反应探索
Angew Chem Int Ed Engl. 2025 Feb 3;64(6):e202418074. doi: 10.1002/anie.202418074. Epub 2024 Dec 18.
2
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
3
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models.利用生成式人工智能辅助学习罕见且复杂的诊断:对流行的大型语言模型的定性研究。
JMIR Med Educ. 2024 Feb 13;10:e51391. doi: 10.2196/51391.
4
Programming Chatbots Using Natural Language: Generating Cervical Spine MRI Impressions.使用自然语言编程聊天机器人:生成颈椎MRI影像报告
Cureus. 2024 Sep 14;16(9):e69410. doi: 10.7759/cureus.69410. eCollection 2024 Sep.
5
Emotional prompting amplifies disinformation generation in AI large language models.情感提示会放大人工智能大语言模型中的虚假信息生成。
Front Artif Intell. 2025 Apr 7;8:1543603. doi: 10.3389/frai.2025.1543603. eCollection 2025.
6
Evaluating Large Language Models for the National Premedical Exam in India: Comparative Analysis of GPT-3.5, GPT-4, and Bard.评估印度全国医预考用大型语言模型:GPT-3.5、GPT-4 和 Bard 的比较分析。
JMIR Med Educ. 2024 Feb 21;10:e51523. doi: 10.2196/51523.
7
GPT for RCTs? Using AI to determine adherence to clinical trial reporting guidelines.用于随机对照试验的GPT?利用人工智能确定对临床试验报告指南的遵循情况。
BMJ Open. 2025 Mar 18;15(3):e088735. doi: 10.1136/bmjopen-2024-088735.
8
Accuracy of latest large language models in answering multiple choice questions in dentistry: A comparative study.最新大语言模型在回答牙科多项选择题方面的准确性:一项比较研究。
PLoS One. 2025 Jan 29;20(1):e0317423. doi: 10.1371/journal.pone.0317423. eCollection 2025.
9
Can large language models understand molecules?大语言模型能理解分子吗?
BMC Bioinformatics. 2024 Jun 26;25(1):225. doi: 10.1186/s12859-024-05847-x.
10
Current safeguards, risk mitigation, and transparency measures of large language models against the generation of health disinformation: repeated cross sectional analysis.大型语言模型防范生成健康类虚假信息的现行保障措施、风险缓解措施和透明度措施:重复横断面分析。
BMJ. 2024 Mar 20;384:e078538. doi: 10.1136/bmj-2023-078538.

引用本文的文献

1
AI Approaches to Homogeneous Catalysis with Transition Metal Complexes.过渡金属配合物均相催化的人工智能方法
ACS Catal. 2025 May 14;15(11):9089-9105. doi: 10.1021/acscatal.5c01202. eCollection 2025 Jun 6.
2
Unraveling the Complexity of Divalent Hydride Electrolytes in Solid-State Batteries via a Data-Driven Framework with Large Language Model.通过基于大语言模型的数据驱动框架揭示固态电池中二价氢化物电解质的复杂性
Angew Chem Int Ed Engl. 2025 Jun 17;64(25):e202506573. doi: 10.1002/anie.202506573. Epub 2025 Apr 25.

本文引用的文献

1
An automatic end-to-end chemical synthesis development platform powered by large language models.基于大型语言模型的自动化端到端化学合成开发平台。
Nat Commun. 2024 Nov 23;15(1):10160. doi: 10.1038/s41467-024-54457-x.
2
Fine-tuning large language models for chemical text mining.针对化学文本挖掘对大语言模型进行微调。
Chem Sci. 2024 Jun 7;15(27):10600-10611. doi: 10.1039/d4sc00924j. eCollection 2024 Jul 10.
3
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
4
Autonomous, multiproperty-driven molecular discovery: From predictions to measurements and back.自主的、多属性驱动的分子发现:从预测到测量再回归。
Science. 2023 Dec 22;382(6677):eadi1407. doi: 10.1126/science.adi1407.
5
Autonomous chemical research with large language models.大语言模型驱动的自主化学研究。
Nature. 2023 Dec;624(7992):570-578. doi: 10.1038/s41586-023-06792-0. Epub 2023 Dec 20.
6
ChatGPT Research Group for Optimizing the Crystallinity of MOFs and COFs.用于优化金属有机框架材料(MOFs)和共价有机框架材料(COFs)结晶度的ChatGPT研究小组。
ACS Cent Sci. 2023 Nov 10;9(11):2161-2170. doi: 10.1021/acscentsci.3c01087. eCollection 2023 Nov 22.
7
A GPT-4 Reticular Chemist for Guiding MOF Discovery.用于指导金属有机框架材料发现的GPT-4网状化学家。
Angew Chem Int Ed Engl. 2023 Nov 13;62(46):e202311983. doi: 10.1002/anie.202311983. Epub 2023 Oct 13.
8
ChatGPT Chemistry Assistant for Text Mining and the Prediction of MOF Synthesis.用于文本挖掘和金属有机框架合成预测的ChatGPT化学助手
J Am Chem Soc. 2023 Aug 16;145(32):18048-18062. doi: 10.1021/jacs.3c05819. Epub 2023 Aug 7.
9
Assessment of chemistry knowledge in large language models that generate code.对生成代码的大语言模型中的化学知识评估。
Digit Discov. 2023 Jan 26;2(2):368-376. doi: 10.1039/d2dd00087c. eCollection 2023 Apr 11.
10
Machine-Learning-Guided Discovery of Electrochemical Reactions.机器学习指导下的电化学反应发现。
J Am Chem Soc. 2022 Dec 14;144(49):22599-22610. doi: 10.1021/jacs.2c08997. Epub 2022 Dec 2.