• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自回归大语言模型的晶体结构生成

Crystal structure generation with autoregressive large language modeling.

作者信息

Antunes Luis M, Butler Keith T, Grau-Crespo Ricardo

机构信息

Department of Chemistry, University of Reading, Whiteknights, Reading, UK.

Department of Chemistry, University College London, London, UK.

出版信息

Nat Commun. 2024 Dec 6;15(1):10570. doi: 10.1038/s41467-024-54639-7.

DOI:10.1038/s41467-024-54639-7
PMID:39643601
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11624194/
Abstract

The generation of plausible crystal structures is often the first step in predicting the structure and properties of a material from its chemical composition. However, most current methods for crystal structure prediction are computationally expensive, slowing the pace of innovation. Seeding structure prediction algorithms with quality generated candidates can overcome a major bottleneck. Here, we introduce CrystaLLM, a methodology for the versatile generation of crystal structures, based on the autoregressive large language modeling (LLM) of the Crystallographic Information File (CIF) format. Trained on millions of CIF files, CrystaLLM focuses on modeling crystal structures through text. CrystaLLM can produce plausible crystal structures for a wide range of inorganic compounds unseen in training, as demonstrated by ab initio simulations. Our approach challenges conventional representations of crystals, and demonstrates the potential of LLMs for learning effective models of crystal chemistry, which will lead to accelerated discovery and innovation in materials science.

摘要

从化学成分预测材料的结构和性质时,生成合理的晶体结构通常是第一步。然而,当前大多数晶体结构预测方法计算成本高昂,减缓了创新步伐。用高质量生成的候选结构为结构预测算法提供种子可以克服一个主要瓶颈。在此,我们介绍CrystaLLM,这是一种基于晶体学信息文件(CIF)格式的自回归大语言建模(LLM)来通用生成晶体结构的方法。在数百万个CIF文件上进行训练后,CrystaLLM专注于通过文本对晶体结构进行建模。如从头算模拟所示,CrystaLLM可以为训练中未见过的多种无机化合物生成合理的晶体结构。我们的方法挑战了晶体的传统表示方式,并展示了大语言模型在学习有效的晶体化学模型方面的潜力,这将加速材料科学中的发现和创新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/30887f7d9608/41467_2024_54639_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/7ab919b86f28/41467_2024_54639_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/7f515ecdd187/41467_2024_54639_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/302dde403f22/41467_2024_54639_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/34a04216f696/41467_2024_54639_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/84e35af3e094/41467_2024_54639_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/30887f7d9608/41467_2024_54639_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/7ab919b86f28/41467_2024_54639_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/7f515ecdd187/41467_2024_54639_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/302dde403f22/41467_2024_54639_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/34a04216f696/41467_2024_54639_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/84e35af3e094/41467_2024_54639_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b08/11624194/30887f7d9608/41467_2024_54639_Fig6_HTML.jpg

相似文献

1
Crystal structure generation with autoregressive large language modeling.基于自回归大语言模型的晶体结构生成
Nat Commun. 2024 Dec 6;15(1):10570. doi: 10.1038/s41467-024-54639-7.
2
Programmatic conversion of crystal structures into 3D printable files using Jmol.使用Jmol将晶体结构编程转换为3D可打印文件。
J Cheminform. 2016 Nov 23;8:66. doi: 10.1186/s13321-016-0181-z. eCollection 2016.
3
Explainable Synthesizability Prediction of Inorganic Crystal Polymorphs Using Large Language Models.使用大语言模型对无机晶体多晶型物进行可解释的可合成性预测
Angew Chem Int Ed Engl. 2025 May;64(19):e202423950. doi: 10.1002/anie.202423950. Epub 2025 Mar 22.
4
Distance Matrix-Based Crystal Structure Prediction Using Evolutionary Algorithms.基于距离矩阵的进化算法晶体结构预测。
J Phys Chem A. 2020 Dec 24;124(51):10909-10919. doi: 10.1021/acs.jpca.0c08775. Epub 2020 Dec 10.
5
Deep Learning-Based Prediction of Contact Maps and Crystal Structures of Inorganic Materials.基于深度学习的无机材料接触图和晶体结构预测
ACS Omega. 2023 Jul 11;8(29):26170-26179. doi: 10.1021/acsomega.3c02115. eCollection 2023 Jul 25.
6
Discovery of Crystalline Inorganic Solids in the Digital Age.数字时代晶体无机固体的发现。
Acc Chem Res. 2025 May 6;58(9):1355-1365. doi: 10.1021/acs.accounts.4c00694. Epub 2025 Apr 17.
7
Kinetic products in coordination networks: ab initio X-ray powder diffraction analysis.配合物网络中的动力学产物:从头算 X 射线粉末衍射分析。
Acc Chem Res. 2013 Feb 19;46(2):493-505. doi: 10.1021/ar300212v. Epub 2012 Dec 19.
8
Accelerated Organic Crystal Structure Prediction with Genetic Algorithms and Machine Learning.基于遗传算法和机器学习的加速有机晶体结构预测
J Chem Theory Comput. 2023 Dec 26;19(24):9388-9402. doi: 10.1021/acs.jctc.3c00853. Epub 2023 Dec 7.
9
Leveraging Large Language Models (LLM) for the Plastic Surgery Resident Training: Do They Have a Role?利用大语言模型进行整形外科住院医师培训:它们能发挥作用吗?
Indian J Plast Surg. 2023 Aug 28;56(5):413-420. doi: 10.1055/s-0043-1772704. eCollection 2023 Oct.
10
A database of high-pressure crystal structures from hydrogen to lanthanum.一个从氢到镧的高压晶体结构数据库。
Sci Data. 2024 Jul 12;11(1):766. doi: 10.1038/s41597-024-03447-1.

引用本文的文献

1
Catal-GPT: AI-driven directed efficient design framework for catalysts.催化GPT:用于催化剂的人工智能驱动的定向高效设计框架。
Natl Sci Rev. 2025 Jul 25;12(9):nwaf299. doi: 10.1093/nsr/nwaf299. eCollection 2025 Sep.
2
Accurate prediction of synthesizability and precursors of 3D crystal structures via large language models.通过大语言模型准确预测3D晶体结构的可合成性和前体。
Nat Commun. 2025 Jul 15;16(1):6530. doi: 10.1038/s41467-025-61778-y.
3
Disentangling autoencoders and spherical harmonics for efficient shape classification in crystal growth simulations.

本文引用的文献

1
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
2
Structure prediction and materials design with generative neural networks.基于生成神经网络的结构预测与材料设计。
Nat Comput Sci. 2023 Jul;3(7):572-574. doi: 10.1038/s43588-023-00471-w.
3
A universal graph deep learning interatomic potential for the periodic table.一种用于元素周期表的通用图深度学习原子间势能。
在晶体生长模拟中解开自动编码器与球谐函数以实现高效形状分类
Commun Phys. 2025;8(1):272. doi: 10.1038/s42005-025-02129-7. Epub 2025 Jul 2.
4
A Deep Generative Model for the Inverse Design of Transition Metal Ligands and Complexes.用于过渡金属配体和配合物逆向设计的深度生成模型
JACS Au. 2025 Apr 23;5(5):2294-2308. doi: 10.1021/jacsau.5c00242. eCollection 2025 May 26.
5
Exploration of crystal chemical space using text-guided generative artificial intelligence.利用文本引导的生成式人工智能探索晶体化学空间。
Nat Commun. 2025 May 12;16(1):4379. doi: 10.1038/s41467-025-59636-y.
6
A Perspective on Foundation Models in Chemistry.化学领域基础模型的视角
JACS Au. 2025 Mar 25;5(4):1499-1518. doi: 10.1021/jacsau.4c01160. eCollection 2025 Apr 28.
7
Cross-disciplinary perspectives on the potential for artificial intelligence across chemistry.关于人工智能在化学领域潜力的跨学科观点。
Chem Soc Rev. 2025 Apr 25. doi: 10.1039/d5cs00146c.
8
Molecular analysis and design using generative artificial intelligence multi-agent modeling.使用生成式人工智能多智能体建模的分子分析与设计
Mol Syst Des Eng. 2025 Jan 24;10(4):314-337. doi: 10.1039/d4me00174e. eCollection 2025 Mar 31.
Nat Comput Sci. 2022 Nov;2(11):718-728. doi: 10.1038/s43588-022-00349-3. Epub 2022 Nov 28.
4
Autonomous chemical research with large language models.大语言模型驱动的自主化学研究。
Nature. 2023 Dec;624(7992):570-578. doi: 10.1038/s41586-023-06792-0. Epub 2023 Dec 20.
5
Scaling deep learning for materials discovery.深度学习在材料发现中的应用。
Nature. 2023 Dec;624(7990):80-85. doi: 10.1038/s41586-023-06735-9. Epub 2023 Nov 29.
6
14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon.大语言模型如何改变材料科学与化学的14个实例:对一场大语言模型黑客马拉松的思考
Digit Discov. 2023 Aug 8;2(5):1233-1250. doi: 10.1039/d3dd00113j. eCollection 2023 Oct 9.
7
Lithium Intercalation into the Excitonic Insulator Candidate TaNiSe.锂嵌入激子绝缘体候选材料TaNiSe中。
Inorg Chem. 2023 Jul 31;62(30):12027-12037. doi: 10.1021/acs.inorgchem.3c01510. Epub 2023 Jul 19.
8
Optimization of Chemical Bonding through Defect Formation and Ordering─The Case of MgPtGe.通过缺陷形成和有序化优化化学键─以 MgPtGe 为例。
Inorg Chem. 2023 Jun 5;62(22):8519-8529. doi: 10.1021/acs.inorgchem.2c04312. Epub 2023 May 19.
9
Novel inorganic crystal structures predicted using autonomous simulation agents.使用自主模拟代理预测新型无机晶体结构。
Sci Data. 2022 Jun 14;9(1):302. doi: 10.1038/s41597-022-01438-8.
10
TCSP: a Template-Based Crystal Structure Prediction Algorithm for Materials Discovery.TCSP:一种基于模板的材料发现晶体结构预测算法。
Inorg Chem. 2022 Jun 6;61(22):8431-8439. doi: 10.1021/acs.inorgchem.1c03879. Epub 2022 Apr 14.