• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过大语言模型准确预测3D晶体结构的可合成性和前体。

Accurate prediction of synthesizability and precursors of 3D crystal structures via large language models.

作者信息

Song Zhilong, Lu Shuaihua, Ju Minggang, Zhou Qionghua, Wang Jinlan

机构信息

Key Laboratory of Quantum Materials and Devices of Ministry of Education, School of Physics, Southeast University, Nanjing, China.

Suzhou Laboratory, Suzhou, China.

出版信息

Nat Commun. 2025 Jul 15;16(1):6530. doi: 10.1038/s41467-025-61778-y.

DOI:10.1038/s41467-025-61778-y
PMID:40664632
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12264133/
Abstract

Accessing the synthesizability of crystal structures is crucial for transforming theoretical materials into real-world applications. Nevertheless, there is a significant gap between actual synthesizability and thermodynamic or kinetic stability commonly used to screen synthesizable structures. Herein, we develop the Crystal Synthesis Large Language Models (CSLLM) framework, which utilizes three specialized LLMs to predict the synthesizability of arbitrary 3D crystal structures, possible synthetic methods, and suitable precursors, respectively. We construct a comprehensive dataset including synthesizable/non-synthesizable crystal structures and develop an efficient text representation for crystal structures to fine-tune LLMs. Our Synthesizability LLM achieves state-of-the-art accuracy (98.6%), significantly outperforming traditional synthesizability screening based on thermodynamic and kinetic stability. Its outstanding generalization ability is further demonstrated in experimental structures with complexity considerably exceeding that of the training data. Furthermore, both the Method and Precursor LLMs exceed 90% accuracy in classifying possible synthetic methods and identifying solid-state synthetic precursors for common binary and ternary compounds, respectively. Leveraging CSLLM, tens of thousands of synthesizable theoretical structures are successfully identified, with their 23 key properties predicted using accurate graph neural network models.

摘要

评估晶体结构的可合成性对于将理论材料转化为实际应用至关重要。然而,实际可合成性与通常用于筛选可合成结构的热力学或动力学稳定性之间存在显著差距。在此,我们开发了晶体合成大语言模型(CSLLM)框架,该框架利用三个专门的大语言模型分别预测任意三维晶体结构的可合成性、可能的合成方法和合适的前驱体。我们构建了一个包含可合成/不可合成晶体结构的综合数据集,并开发了一种用于晶体结构的高效文本表示方法来微调大语言模型。我们的可合成性大语言模型达到了目前最优的准确率(98.6%),显著优于基于热力学和动力学稳定性的传统可合成性筛选方法。在复杂度远超训练数据的实验结构中,其出色的泛化能力得到了进一步证明。此外,方法大语言模型和前驱体大语言模型在分别对常见二元和三元化合物的可能合成方法进行分类以及识别固态合成前驱体方面的准确率均超过90%。利用CSLLM,成功识别出了数以万计的可合成理论结构,并使用精确的图神经网络模型预测了它们的23个关键属性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/a0abab584d37/41467_2025_61778_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/aad7bf0dd8a4/41467_2025_61778_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/f14b8dc105be/41467_2025_61778_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/83e02a770145/41467_2025_61778_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/4c9356209572/41467_2025_61778_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/a0abab584d37/41467_2025_61778_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/aad7bf0dd8a4/41467_2025_61778_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/f14b8dc105be/41467_2025_61778_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/83e02a770145/41467_2025_61778_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/4c9356209572/41467_2025_61778_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ebe/12264133/a0abab584d37/41467_2025_61778_Fig5_HTML.jpg

相似文献

1
Accurate prediction of synthesizability and precursors of 3D crystal structures via large language models.通过大语言模型准确预测3D晶体结构的可合成性和前体。
Nat Commun. 2025 Jul 15;16(1):6530. doi: 10.1038/s41467-025-61778-y.
2
Evaluating and Improving Syndrome Differentiation Thinking Ability in Large Language Models: Method Development Study.评估和提高大语言模型中的辨证思维能力:方法开发研究
JMIR Med Inform. 2025 Jun 20;13:e75103. doi: 10.2196/75103.
3
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
4
Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.在医疗保健中应用大语言模型:以临床医生为重点的回顾与交互式指南
J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.
5
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
6
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Non-orthogonal kV imaging guided patient position verification in non-coplanar radiation therapy with dataset-free implicit neural representation.在无数据集隐式神经表示的非共面放射治疗中,基于非正交千伏成像的患者体位验证
Med Phys. 2025 May 19. doi: 10.1002/mp.17885.
9
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
10
Short-Term Memory Impairment短期记忆障碍

本文引用的文献

1
A generative model for inorganic materials design.一种用于无机材料设计的生成模型。
Nature. 2025 Mar;639(8055):624-632. doi: 10.1038/s41586-025-08628-5. Epub 2025 Jan 16.
2
Crystal structure generation with autoregressive large language modeling.基于自回归大语言模型的晶体结构生成
Nat Commun. 2024 Dec 6;15(1):10570. doi: 10.1038/s41467-024-54639-7.
3
Machine Learning-Aided Design of Highly Conductive Anion Exchange Membranes for Fuel Cells and Water Electrolyzers.用于燃料电池和水电解槽的高导电性阴离子交换膜的机器学习辅助设计
Adv Mater. 2024 Sep;36(36):e2404981. doi: 10.1002/adma.202404981. Epub 2024 Jul 29.
4
Large Language Models for Inorganic Synthesis Predictions.用于无机合成预测的大语言模型
J Am Chem Soc. 2024 Jul 24;146(29):19654-19659. doi: 10.1021/jacs.4c05840. Epub 2024 Jul 11.
5
From bulk effective mass to 2D carrier mobility accurate prediction via adversarial transfer learning.通过对抗性迁移学习实现从体有效质量到二维载流子迁移率的精确预测。
Nat Commun. 2024 Jun 25;15(1):5391. doi: 10.1038/s41467-024-49686-z.
6
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
7
Universal machine learning aided synthesis approach of two-dimensional perovskites in a typical laboratory.典型实验室中二维钙钛矿的通用机器学习辅助合成方法。
Nat Commun. 2024 Jan 2;15(1):138. doi: 10.1038/s41467-023-44236-5.
8
Autonomous chemical research with large language models.大语言模型驱动的自主化学研究。
Nature. 2023 Dec;624(7992):570-578. doi: 10.1038/s41586-023-06792-0. Epub 2023 Dec 20.
9
Machine-Learning-Driven G-Quartet-Based Circularly Polarized Luminescence Materials.基于机器学习驱动的G-四链体圆偏振发光材料。
Adv Mater. 2024 Jan;36(4):e2310455. doi: 10.1002/adma.202310455. Epub 2023 Nov 27.
10
A Machine Learning Study on High Thermal Conductivity Assisted to Discover Chalcogenides with Balanced Infrared Nonlinear Optical Performance.一项关于高导热性辅助发现具有平衡红外非线性光学性能硫族化物的机器学习研究。
Adv Mater. 2024 Feb;36(6):e2309675. doi: 10.1002/adma.202309675. Epub 2023 Dec 6.