• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BatGPT-Chem:一种用于化学工程的基础大模型。

BatGPT-Chem: A Foundation Large Model for Chemical Engineering.

作者信息

Yang Yifei, Shi Runhan, Li Zuchao, Jiang Shu, Lu Bao-Liang, Zhao Qibin, Yang Yang, Zhao Hai

机构信息

School of Computer Science, Shanghai Jiao Tong University, Shanghai 200240, China.

Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, Shanghai 200240, China.

出版信息

Research (Wash D C). 2025 Sep 10;8:0827. doi: 10.34133/research.0827. eCollection 2025.

DOI:10.34133/research.0827
PMID:40936797
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12421729/
Abstract

Large language models (LLMs) have showcased remarkable capabilities in the realm of AI for Science, and chemistry has greatly benefited from the advancement of AI tools. With a strong capacity for learning sequential data like natural language, LLMs offer immense potential. Despite this promise, the application of LLMs in chemistry remains limited, with few models specifically designed for chemical data and tasks. Hence, we propose leveraging LLMs to comprehensively model both chemical sequences and natural language sequences, aiming to tackle diverse chemical tasks. We introduce BatGPT-Chem, a general foundation large-scale model with 15 billion parameters tailored for chemical engineering. Built on a corpus of over 100 million chemical instances, BatGPT-Chem specializes in 5 core tasks: retrosynthesis prediction, molecule design, molecule description, product inference, and yield prediction. BatGPT-Chem comprehensively models the information flow between chemical language and natural language, enabling full-spectrum prediction across chemical tasks. It is one of the largest bilingual chemistry-specific LLMs, supporting both English and Chinese for input and output. BatGPT-Chem is also the first automated retrosynthesis tool capable of explicitly predicting reaction conditions, a critical but often overlooked aspect in previous models. Through rigorous zero-shot evaluations, BatGPT-Chem demonstrates state-of-the-art performance, surpassing both existing chemical LLMs and general-purpose models in accuracy and validity across a diverse range of tasks. Notably, it demonstrates superior ability in predicting both reactants and reaction conditions, as well as strong generalization in low-data settings. These results suggest that BatGPT-Chem is among the most advanced and practical chemical LLMs, with strong potential to support real-world applications in synthesis planning, drug discovery, and materials design.

摘要

大语言模型(LLMs)在人工智能用于科学领域展现出了卓越能力,化学领域也因人工智能工具的进步而受益匪浅。由于大语言模型具有强大的学习自然语言等序列数据的能力,因此具有巨大潜力。尽管有此前景,但大语言模型在化学领域的应用仍然有限,专门针对化学数据和任务设计的模型很少。因此,我们建议利用大语言模型对化学序列和自然语言序列进行全面建模,以解决各种化学任务。我们推出了BatGPT-Chem,这是一个为化学工程量身定制的具有150亿参数的通用基础大规模模型。基于超过1亿个化学实例的语料库构建,BatGPT-Chem专注于5个核心任务:逆合成预测、分子设计、分子描述、产物推断和产率预测。BatGPT-Chem全面模拟化学语言和自然语言之间的信息流,能够对各种化学任务进行全谱预测。它是最大的双语化学专用大语言模型之一,并支持中英文输入和输出。BatGPT-Chem也是第一个能够明确预测反应条件的自动化逆合成工具,这是先前模型中一个关键但经常被忽视的方面。通过严格的零样本评估,BatGPT-Chem展示了其在各种任务中的最先进性能,在准确性和有效性方面超过了现有的化学大语言模型和通用模型。值得注意的是,它在预测反应物和反应条件方面表现出卓越能力,以及在低数据环境中的强大泛化能力。这些结果表明,BatGPT-Chem是最先进且实用的化学大语言模型之一,在支持合成规划、药物发现和材料设计等实际应用方面具有强大潜力。

相似文献

1
BatGPT-Chem: A Foundation Large Model for Chemical Engineering.BatGPT-Chem:一种用于化学工程的基础大模型。
Research (Wash D C). 2025 Sep 10;8:0827. doi: 10.34133/research.0827. eCollection 2025.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Using a Diverse Test Suite to Assess Large Language Models on Fast Health Care Interoperability Resources Knowledge: Comparative Analysis.使用多样化测试套件在快速医疗保健互操作性资源知识方面评估大语言模型:比较分析
J Med Internet Res. 2025 Aug 12;27:e73540. doi: 10.2196/73540.
4
The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.第一步是最困难的:为大型语言模型表示和标记时间数据的陷阱。
J Am Med Inform Assoc. 2024 Sep 1;31(9):2151-2158. doi: 10.1093/jamia/ocae090.
5
Resource-efficient instruction tuning of large language models for biomedical named entity recognition.用于生物医学命名实体识别的大语言模型的资源高效指令微调
J Biomed Inform. 2025 Aug 21;170:104896. doi: 10.1016/j.jbi.2025.104896.
6
Can open source large language models be used for tumor documentation in Germany?-An evaluation on urological doctors' notes.在德国,开源大语言模型可用于肿瘤记录吗?——对泌尿科医生笔记的评估
BioData Min. 2025 Jul 24;18(1):48. doi: 10.1186/s13040-025-00463-8.
7
Large Language Models and Empathy: Systematic Review.大语言模型与同理心:系统综述
J Med Internet Res. 2024 Dec 11;26:e52597. doi: 10.2196/52597.
8
Leveraging Retrieval-Augmented Large Language Models for Dietary Recommendations With Traditional Chinese Medicine's Medicine Food Homology: Algorithm Development and Validation.利用检索增强大语言模型结合中医药食同源进行饮食推荐:算法开发与验证
JMIR Med Inform. 2025 Aug 21;13:e75279. doi: 10.2196/75279.
9
Evaluating the Reasoning Capabilities of Large Language Models for Medical Coding and Hospital Readmission Risk Stratification: Zero-Shot Prompting Approach.评估大型语言模型在医学编码和医院再入院风险分层方面的推理能力:零样本提示方法。
J Med Internet Res. 2025 Jul 30;27:e74142. doi: 10.2196/74142.
10
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.

本文引用的文献

1
Molecular Merged Hypergraph Neural Network for Explainable Solvation Gibbs Free Energy Prediction.用于可解释溶剂化吉布斯自由能预测的分子合并超图神经网络
Research (Wash D C). 2025 Aug 15;8:0740. doi: 10.34133/research.0740. eCollection 2025.
2
Clc-db: an open-source online database of chiral ligands and catalysts.Clc-db:一个手性配体和催化剂的开源在线数据库。
J Cheminform. 2025 Apr 3;17(1):45. doi: 10.1186/s13321-025-00991-9.
3
When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges.当大语言模型遇上进化算法:潜在的提升与挑战
Research (Wash D C). 2025 Mar 27;8:0646. doi: 10.34133/research.0646. eCollection 2025.
4
Deep Learning for Predicting Biomolecular Binding Sites of Proteins.用于预测蛋白质生物分子结合位点的深度学习
Research (Wash D C). 2025 Feb 24;8:0615. doi: 10.34133/research.0615. eCollection 2025.
5
The symmetric division Szeged index: A novel tool for predicting physical and chemical properties of complex networks.对称划分塞格德指数:预测复杂网络物理和化学性质的新工具。
Heliyon. 2025 Jan 27;11(3):e42280. doi: 10.1016/j.heliyon.2025.e42280. eCollection 2025 Feb 15.
6
Predicting enthalpy of formation of benzenoid hydrocarbons and ordering molecular trees using general multiplicative Zagreb indices.使用广义乘法 Zagreb 指数预测苯型烃的生成焓并对分子树进行排序。
Heliyon. 2024 May 15;10(10):e30913. doi: 10.1016/j.heliyon.2024.e30913. eCollection 2024 May 30.
7
Application of Transformers in Cheminformatics.Transformer 在化学信息学中的应用。
J Chem Inf Model. 2024 Jun 10;64(11):4392-4409. doi: 10.1021/acs.jcim.3c02070. Epub 2024 May 30.
8
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
9
Prediction of chemical reaction yields with large-scale multi-view pre-training.基于大规模多视图预训练的化学反应产率预测
J Cheminform. 2024 Feb 25;16(1):22. doi: 10.1186/s13321-024-00815-2.
10
Enhancing chemical synthesis: a two-stage deep neural network for predicting feasible reaction conditions.增强化学合成:用于预测可行反应条件的两阶段深度神经网络。
J Cheminform. 2024 Jan 24;16(1):11. doi: 10.1186/s13321-024-00805-4.