• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

聊天机器人生成的150万条素材叙述。

1.5 million materials narratives generated by chatbots.

作者信息

Park Yang Jeong, Jerng Sung Eun, Yoon Sungroh, Li Ju

机构信息

Massachusetts Institute of Technology, Department of Nuclear Science and Engineering, Cambridge, 02139, USA.

Massachusetts Institute of Technology, Department of Materials Science and Engineering, Cambridge, 02139, USA.

出版信息

Sci Data. 2024 Sep 28;11(1):1060. doi: 10.1038/s41597-024-03886-w.

DOI:10.1038/s41597-024-03886-w
PMID:39341807
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11439064/
Abstract

The advent of artificial intelligence (AI) has enabled a comprehensive exploration of materials for various applications. However, AI models often prioritize frequently encountered material examples in the scientific literature, limiting the selection of suitable candidates based on inherent physical and chemical attributes. To address this imbalance, we generated a dataset consisting of 1,453,493 natural language-material narratives from OQMD, Materials Project, JARVIS, and AFLOW2 databases based on ab initio calculation results that are more evenly distributed across the periodic table. The generated text narratives were then scored by both human experts and GPT-4, based on three rubrics: technical accuracy, language and structure, and relevance and depth of content, showing similar scores but with human-scored depth of content being the most lagging. The integration of multimodal data sources and large language models holds immense potential for AI frameworks to aid the exploration and discovery of solid-state materials for specific applications of interest.

摘要

人工智能(AI)的出现使得人们能够全面探索适用于各种应用的材料。然而,AI模型通常优先考虑科学文献中经常出现的材料示例,这限制了基于固有物理和化学属性来选择合适的候选材料。为了解决这种不平衡,我们基于从头算计算结果生成了一个数据集,该数据集包含来自OQMD、材料项目、JARVIS和AFLOW2数据库的1,453,493条自然语言-材料叙述,这些叙述在元素周期表上的分布更加均匀。然后,由人类专家和GPT-4根据三个标准对生成的文本叙述进行评分:技术准确性、语言和结构,以及内容的相关性和深度,结果显示两者得分相似,但人类评分的内容深度最为滞后。多模态数据源和大语言模型的整合为AI框架助力探索和发现用于特定感兴趣应用的固态材料具有巨大潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/d8c4e6d30738/41597_2024_3886_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/ebcbd800331a/41597_2024_3886_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/1d7dead07fec/41597_2024_3886_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/65b646f43dc0/41597_2024_3886_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/4d3c69fd9067/41597_2024_3886_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/d8c4e6d30738/41597_2024_3886_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/ebcbd800331a/41597_2024_3886_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/1d7dead07fec/41597_2024_3886_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/65b646f43dc0/41597_2024_3886_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/4d3c69fd9067/41597_2024_3886_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9625/11439064/d8c4e6d30738/41597_2024_3886_Fig5_HTML.jpg

相似文献

1
1.5 million materials narratives generated by chatbots.聊天机器人生成的150万条素材叙述。
Sci Data. 2024 Sep 28;11(1):1060. doi: 10.1038/s41597-024-03886-w.
2
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性:公众需谨慎。
Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.
3
The Use of Artificial Intelligence-Based Conversational Agents (Chatbots) for Weight Loss: Scoping Review and Practical Recommendations.基于人工智能的对话代理(聊天机器人)在减肥中的应用:范围综述与实用建议。
JMIR Med Inform. 2022 Apr 13;10(4):e32578. doi: 10.2196/32578.
4
Diagnostic accuracy of large language models in psychiatry.精神科大语言模型的诊断准确性。
Asian J Psychiatr. 2024 Oct;100:104168. doi: 10.1016/j.ajp.2024.104168. Epub 2024 Jul 25.
5
Artificial Intelligence-Based Conversational Agents for Chronic Conditions: Systematic Literature Review.基于人工智能的慢性病对话代理:系统文献综述。
J Med Internet Res. 2020 Sep 14;22(9):e20701. doi: 10.2196/20701.
6
A systematic review of artificial intelligence-powered (AI-powered) chatbot intervention for managing chronic illness.人工智能驱动的(AI 驱动)聊天机器人干预管理慢性疾病的系统评价。
Ann Med. 2024 Dec;56(1):2302980. doi: 10.1080/07853890.2024.2302980. Epub 2024 Mar 11.
7
Artificial Intelligence-Generated Scientific Literature: A Critical Appraisal.人工智能生成的科学文献:批判性评价。
J Allergy Clin Immunol Pract. 2024 Jan;12(1):106-110. doi: 10.1016/j.jaip.2023.10.010. Epub 2023 Oct 12.
8
Artificial Intelligence Can Generate Fraudulent but Authentic-Looking Scientific Medical Articles: Pandora's Box Has Been Opened.人工智能可以生成虚假但看起来真实的科学医学文章:潘多拉的盒子已经被打开。
J Med Internet Res. 2023 May 31;25:e46924. doi: 10.2196/46924.
9
The performance of artificial intelligence chatbot large language models to address skeletal biology and bone health queries.人工智能聊天机器人大型语言模型在解决骨骼生物学和骨骼健康问题方面的表现。
J Bone Miner Res. 2024 Mar 22;39(2):106-115. doi: 10.1093/jbmr/zjad007.
10
Comparison of artificial intelligence large language model chatbots in answering frequently asked questions in anaesthesia.人工智能大语言模型聊天机器人在回答麻醉常见问题方面的比较。
BJA Open. 2024 May 8;10:100280. doi: 10.1016/j.bjao.2024.100280. eCollection 2024 Jun.

引用本文的文献

1
A simulated dataset for proactive robot task inference from streaming natural language dialogues.一个用于从流式自然语言对话中进行主动机器人任务推理的模拟数据集。
Sci Data. 2025 Aug 11;12(1):1405. doi: 10.1038/s41597-025-05727-w.
2
Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis.评估用于分诊、转诊和诊断的临床决策支持中的大语言模型工作流程。
NPJ Digit Med. 2025 May 9;8(1):263. doi: 10.1038/s41746-025-01684-1.

本文引用的文献

1
Machine-enabled inverse design of inorganic solid materials: promises and challenges.无机固体材料的机器辅助逆向设计:前景与挑战。
Chem Sci. 2020 Apr 15;11(19):4871-4881. doi: 10.1039/d0sc00594k.
2
From nanoscale interface characterization to sustainable energy storage using all-solid-state batteries.从纳米级界面特性描述到全固态电池的可持续储能。
Nat Nanotechnol. 2020 Mar;15(3):170-180. doi: 10.1038/s41565-020-0657-x. Epub 2020 Mar 10.
3
Challenges and perspectives of NASICON-type solid electrolytes for all-solid-state lithium batteries.
NASICON 型固体电解质在全固态锂电池中的挑战与展望。
Nanotechnology. 2020 Mar 27;31(13):132003. doi: 10.1088/1361-6528/ab5be7. Epub 2019 Nov 26.
4
Carbon capture and conversion using metal-organic frameworks and MOF-based materials.使用金属有机框架材料和基于金属有机框架的材料进行碳捕获与转化。
Chem Soc Rev. 2019 May 20;48(10):2783-2828. doi: 10.1039/c8cs00829a.
5
The atomic simulation environment-a Python library for working with atoms.原子模拟环境——一个用于处理原子的Python库。
J Phys Condens Matter. 2017 Jul 12;29(27):273002. doi: 10.1088/1361-648X/aa680e. Epub 2017 Mar 21.
6
Towards greener and more sustainable batteries for electrical energy storage.迈向更绿色、更可持续的电化学储能电池。
Nat Chem. 2015 Jan;7(1):19-29. doi: 10.1038/nchem.2085. Epub 2014 Nov 17.