BatGPT-Chem：一种用于化学工程的基础大模型。

BatGPT-Chem: A Foundation Large Model for Chemical Engineering.

作者信息

Yang Yifei, Shi Runhan, Li Zuchao, Jiang Shu, Lu Bao-Liang, Zhao Qibin, Yang Yang, Zhao Hai

机构信息

School of Computer Science, Shanghai Jiao Tong University, Shanghai 200240, China.

Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, Shanghai 200240, China.

出版信息

Research (Wash D C). 2025 Sep 10;8:0827. doi: 10.34133/research.0827. eCollection 2025.

DOI:10.34133/research.0827

PMID:40936797

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12421729/

Abstract

Large language models (LLMs) have showcased remarkable capabilities in the realm of AI for Science, and chemistry has greatly benefited from the advancement of AI tools. With a strong capacity for learning sequential data like natural language, LLMs offer immense potential. Despite this promise, the application of LLMs in chemistry remains limited, with few models specifically designed for chemical data and tasks. Hence, we propose leveraging LLMs to comprehensively model both chemical sequences and natural language sequences, aiming to tackle diverse chemical tasks. We introduce BatGPT-Chem, a general foundation large-scale model with 15 billion parameters tailored for chemical engineering. Built on a corpus of over 100 million chemical instances, BatGPT-Chem specializes in 5 core tasks: retrosynthesis prediction, molecule design, molecule description, product inference, and yield prediction. BatGPT-Chem comprehensively models the information flow between chemical language and natural language, enabling full-spectrum prediction across chemical tasks. It is one of the largest bilingual chemistry-specific LLMs, supporting both English and Chinese for input and output. BatGPT-Chem is also the first automated retrosynthesis tool capable of explicitly predicting reaction conditions, a critical but often overlooked aspect in previous models. Through rigorous zero-shot evaluations, BatGPT-Chem demonstrates state-of-the-art performance, surpassing both existing chemical LLMs and general-purpose models in accuracy and validity across a diverse range of tasks. Notably, it demonstrates superior ability in predicting both reactants and reaction conditions, as well as strong generalization in low-data settings. These results suggest that BatGPT-Chem is among the most advanced and practical chemical LLMs, with strong potential to support real-world applications in synthesis planning, drug discovery, and materials design.

摘要

大语言模型（LLMs）在人工智能用于科学领域展现出了卓越能力，化学领域也因人工智能工具的进步而受益匪浅。由于大语言模型具有强大的学习自然语言等序列数据的能力，因此具有巨大潜力。尽管有此前景，但大语言模型在化学领域的应用仍然有限，专门针对化学数据和任务设计的模型很少。因此，我们建议利用大语言模型对化学序列和自然语言序列进行全面建模，以解决各种化学任务。我们推出了BatGPT-Chem，这是一个为化学工程量身定制的具有150亿参数的通用基础大规模模型。基于超过1亿个化学实例的语料库构建，BatGPT-Chem专注于5个核心任务：逆合成预测、分子设计、分子描述、产物推断和产率预测。BatGPT-Chem全面模拟化学语言和自然语言之间的信息流，能够对各种化学任务进行全谱预测。它是最大的双语化学专用大语言模型之一，并支持中英文输入和输出。BatGPT-Chem也是第一个能够明确预测反应条件的自动化逆合成工具，这是先前模型中一个关键但经常被忽视的方面。通过严格的零样本评估，BatGPT-Chem展示了其在各种任务中的最先进性能，在准确性和有效性方面超过了现有的化学大语言模型和通用模型。值得注意的是，它在预测反应物和反应条件方面表现出卓越能力，以及在低数据环境中的强大泛化能力。这些结果表明，BatGPT-Chem是最先进且实用的化学大语言模型之一，在支持合成规划、药物发现和材料设计等实际应用方面具有强大潜力。

相似文献

BatGPT-Chem: A Foundation Large Model for Chemical Engineering.

Research (Wash D C). 2025 Sep 10;8:0827. doi: 10.34133/research.0827. eCollection 2025.

Prescription of Controlled Substances: Benefits and Risks

Using a Diverse Test Suite to Assess Large Language Models on Fast Health Care Interoperability Resources Knowledge: Comparative Analysis.

J Med Internet Res. 2025 Aug 12;27:e73540. doi: 10.2196/73540.

The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.

J Am Med Inform Assoc. 2024 Sep 1;31(9):2151-2158. doi: 10.1093/jamia/ocae090.

Resource-efficient instruction tuning of large language models for biomedical named entity recognition.

J Biomed Inform. 2025 Aug 21;170:104896. doi: 10.1016/j.jbi.2025.104896.

Can open source large language models be used for tumor documentation in Germany?-An evaluation on urological doctors' notes.

BioData Min. 2025 Jul 24;18(1):48. doi: 10.1186/s13040-025-00463-8.

Large Language Models and Empathy: Systematic Review.

J Med Internet Res. 2024 Dec 11;26:e52597. doi: 10.2196/52597.

Leveraging Retrieval-Augmented Large Language Models for Dietary Recommendations With Traditional Chinese Medicine's Medicine Food Homology: Algorithm Development and Validation.

JMIR Med Inform. 2025 Aug 21;13:e75279. doi: 10.2196/75279.

Evaluating the Reasoning Capabilities of Large Language Models for Medical Coding and Hospital Readmission Risk Stratification: Zero-Shot Prompting Approach.

J Med Internet Res. 2025 Jul 30;27:e74142. doi: 10.2196/74142.

Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.

J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.

本文引用的文献

Molecular Merged Hypergraph Neural Network for Explainable Solvation Gibbs Free Energy Prediction.

Research (Wash D C). 2025 Aug 15;8:0740. doi: 10.34133/research.0740. eCollection 2025.

Clc-db: an open-source online database of chiral ligands and catalysts.

J Cheminform. 2025 Apr 3;17(1):45. doi: 10.1186/s13321-025-00991-9.

When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges.

Research (Wash D C). 2025 Mar 27;8:0646. doi: 10.34133/research.0646. eCollection 2025.

Deep Learning for Predicting Biomolecular Binding Sites of Proteins.

Research (Wash D C). 2025 Feb 24;8:0615. doi: 10.34133/research.0615. eCollection 2025.

The symmetric division Szeged index: A novel tool for predicting physical and chemical properties of complex networks.

Heliyon. 2025 Jan 27;11(3):e42280. doi: 10.1016/j.heliyon.2025.e42280. eCollection 2025 Feb 15.

Predicting enthalpy of formation of benzenoid hydrocarbons and ordering molecular trees using general multiplicative Zagreb indices.

Heliyon. 2024 May 15;10(10):e30913. doi: 10.1016/j.heliyon.2024.e30913. eCollection 2024 May 30.

Application of Transformers in Cheminformatics.

J Chem Inf Model. 2024 Jun 10;64(11):4392-4409. doi: 10.1021/acs.jcim.3c02070. Epub 2024 May 30.

Augmenting large language models with chemistry tools.

Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.

Prediction of chemical reaction yields with large-scale multi-view pre-training.

J Cheminform. 2024 Feb 25;16(1):22. doi: 10.1186/s13321-024-00815-2.

Enhancing chemical synthesis: a two-stage deep neural network for predicting feasible reaction conditions.

J Cheminform. 2024 Jan 24;16(1):11. doi: 10.1186/s13321-024-00805-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

BatGPT-Chem：一种用于化学工程的基础大模型。

BatGPT-Chem: A Foundation Large Model for Chemical Engineering.

作者信息

Yang Yifei, Shi Runhan, Li Zuchao, Jiang Shu, Lu Bao-Liang, Zhao Qibin, Yang Yang, Zhao Hai

机构信息

School of Computer Science, Shanghai Jiao Tong University, Shanghai 200240, China.

Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, Shanghai 200240, China.

出版信息

Research (Wash D C). 2025 Sep 10;8:0827. doi: 10.34133/research.0827. eCollection 2025.

DOI:10.34133/research.0827

PMID:40936797

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12421729/

Abstract

摘要

BatGPT-Chem：一种用于化学工程的基础大模型。

BatGPT-Chem: A Foundation Large Model for Chemical Engineering.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

BatGPT-Chem：一种用于化学工程的基础大模型。

BatGPT-Chem: A Foundation Large Model for Chemical Engineering.

作者信息

机构信息

出版信息

相似文献

本文引用的文献