作为规则发现者的推理语言模型：基于二维金属有机框架的C-H键活化案例研究

Reasoning Language Model as Rule Finder: A Case Study on C-H Bond Activation Using 2D Metal-Organic Frameworks.

作者信息

Lin He, Cui Xiaoqi, Dai Binglin, Chen Jiawei, Su Pengkun, Su Zhaomin, Hu Huihui, Jiang Yibin, Wang Cheng

机构信息

iChem, State Key Laboratory of Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, P. R. China.

Jiangsu Key Laboratory for Science and Applications of Molecular Ferroelectrics, Southeast University, Nanjing 211189, P. R. China.

出版信息

ACS Cent Sci. 2025 Jun 13;11(7):1135-1146. doi: 10.1021/acscentsci.5c00561. eCollection 2025 Jul 23.

DOI:10.1021/acscentsci.5c00561

PMID:40726799

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12291106/

Abstract

Unraveling the structure-activity relationship in catalysis requires interpretable models that can extract governing principles from complex data sets. This study explores reasoning large language models (LLMs) as rule-finders for predicting C-(sp)-H activation outcomes catalyzed by 2D Fe-terpyridine MOFs. Surface modifications with molecular modifiers systematically modulate the catalytic microenvironment, but linking modifier structure to activity remains challenging. While traditional descriptors offer high predictive accuracy, LLM-derived rules provide interpretable insights. Integrating LLM reasoning with experimental features (e.g., Fe-loading, modifier ratios) identified para-substituted benzoates with electron-withdrawing or coordinating groups as performance boosters. Validated by machine learning, this rule achieved 82.6% prediction accuracy. Notably, the coordinating group can become electron-withdrawing upon Fe coordination or protonation. The LLM revealed that modifiers tune the catalyst's electronic state rather than directly interacting with intermediates/transition states, bridging data-driven predictions with mechanistic understanding. This highlights LLM's potential to derive chemically meaningful rules in catalysis.

摘要

揭示催化过程中的构效关系需要可解释的模型，这些模型能够从复杂的数据集中提取主导原则。本研究探索将推理大语言模型（LLMs）作为规则发现器，用于预测二维铁-联吡啶金属有机框架催化的C-(sp)-H活化结果。用分子修饰剂进行表面修饰可系统地调节催化微环境，但将修饰剂结构与活性联系起来仍然具有挑战性。虽然传统描述符具有较高的预测准确性，但基于大语言模型得出的规则提供了可解释的见解。将大语言模型推理与实验特征（如铁负载量、修饰剂比例）相结合，确定了具有吸电子或配位基团的对位取代苯甲酸酯为性能增强剂。经机器学习验证，该规则的预测准确率达到82.6%。值得注意的是，配位基团在与铁配位或质子化后可变为吸电子基团。大语言模型表明，修饰剂调节催化剂的电子状态，而不是直接与中间体/过渡态相互作用，从而在数据驱动的预测与机理理解之间架起了桥梁。这突出了大语言模型在催化中得出具有化学意义规则的潜力。

相似文献

Reasoning Language Model as Rule Finder: A Case Study on C-H Bond Activation Using 2D Metal-Organic Frameworks.作为规则发现者的推理语言模型：基于二维金属有机框架的C-H键活化案例研究

ACS Cent Sci. 2025 Jun 13;11(7):1135-1146. doi: 10.1021/acscentsci.5c00561. eCollection 2025 Jul 23.

Improving mortality prediction after radiotherapy with large language model structuring of large-scale unstructured electronic health records.利用大规模非结构化电子健康记录的大语言模型构建来改善放疗后的死亡率预测。

Radiother Oncol. 2025 Jul 19;211:111052. doi: 10.1016/j.radonc.2025.111052.

[Preliminary exploration of the applications of five large language models in the field of oral auxiliary diagnosis, treatment and health consultation].五种大语言模型在口腔辅助诊断、治疗及健康咨询领域的应用初探

Zhonghua Kou Qiang Yi Xue Za Zhi. 2025 Jul 30;60(8):871-878. doi: 10.3760/cma.j.cn112144-20241107-00418.

Short-Term Memory Impairment短期记忆障碍

Large Language Model Influence on Diagnostic Reasoning: A Randomized Clinical Trial.大语言模型对诊断推理的影响：一项随机临床试验。

JAMA Netw Open. 2024 Oct 1;7(10):e2440969. doi: 10.1001/jamanetworkopen.2024.40969.

Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测：基于放射学报告的多中心方法学研究

J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.

Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.通过在出院小结中添加重点内容提高大语言模型的总结准确性：比较评估

JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Optimization for threat classification of various data types-based on ML model and LLM.基于机器学习模型和大语言模型对各种数据类型进行威胁分类的优化。

Sci Rep. 2025 Jul 2;15(1):22768. doi: 10.1038/s41598-025-05182-y.

Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction.用于人格特质预测的大语言模型嵌入的心理测量评估

J Med Internet Res. 2025 Jul 8;27:e75347. doi: 10.2196/75347.

本文引用的文献

Augmenting large language models with chemistry tools.用化学工具增强大语言模型。

Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.

Chemprop: A Machine Learning Package for Chemical Property Prediction.Chemprop：一个用于化学性质预测的机器学习工具包。

J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.

ChatGPT Research Group for Optimizing the Crystallinity of MOFs and COFs.用于优化金属有机框架材料（MOFs）和共价有机框架材料（COFs）结晶度的ChatGPT研究小组。

ACS Cent Sci. 2023 Nov 10;9(11):2161-2170. doi: 10.1021/acscentsci.3c01087. eCollection 2023 Nov 22.

A GPT-4 Reticular Chemist for Guiding MOF Discovery.用于指导金属有机框架材料发现的GPT-4网状化学家。

Angew Chem Int Ed Engl. 2023 Nov 13;62(46):e202311983. doi: 10.1002/anie.202311983. Epub 2023 Oct 13.

ChatGPT Chemistry Assistant for Text Mining and the Prediction of MOF Synthesis.用于文本挖掘和金属有机框架合成预测的ChatGPT化学助手

J Am Chem Soc. 2023 Aug 16;145(32):18048-18062. doi: 10.1021/jacs.3c05819. Epub 2023 Aug 7.

From alchemist to AI chemist.从炼金术士到人工智能化学家。

Nat Rev Chem. 2023 Aug;7(8):527-528. doi: 10.1038/s41570-023-00522-w.

Do Large Language Models Understand Chemistry? A Conversation with ChatGPT.大语言模型理解化学吗？与ChatGPT的一次对话。

J Chem Inf Model. 2023 Mar 27;63(6):1649-1655. doi: 10.1021/acs.jcim.3c00285. Epub 2023 Mar 16.

Concepts and applications of chemical fingerprint for hit and lead screening.用于活性分子和先导化合物筛选的化学指纹图谱的概念与应用

Drug Discov Today. 2022 Nov;27(11):103356. doi: 10.1016/j.drudis.2022.103356. Epub 2022 Sep 13.

UV-adVISor: Attention-Based Recurrent Neural Networks to Predict UV-Vis Spectra.UV-adVISor：基于注意力的循环神经网络预测紫外可见光谱。

Anal Chem. 2021 Dec 7;93(48):16076-16085. doi: 10.1021/acs.analchem.1c03741. Epub 2021 Nov 23.

Iron-Catalyzed Photoinduced Remote C(sp)-H Amination of Free Alcohols.铁催化的游离醇的光诱导远程C(sp)-H胺化反应

Org Lett. 2021 Nov 19;23(22):8968-8972. doi: 10.1021/acs.orglett.1c03488. Epub 2021 Oct 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

作为规则发现者的推理语言模型：基于二维金属有机框架的C-H键活化案例研究

Reasoning Language Model as Rule Finder: A Case Study on C-H Bond Activation Using 2D Metal-Organic Frameworks.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献