• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从文本语料库自动构建的材料术语知识图谱。

A materials terminology knowledge graph automatically constructed from text corpus.

机构信息

School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China.

Beijing Advanced Innovation Center for Materials Genome Engineering, Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing, 100083, China.

出版信息

Sci Data. 2024 Jun 7;11(1):600. doi: 10.1038/s41597-024-03448-0.

DOI:10.1038/s41597-024-03448-0
PMID:38849436
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11161478/
Abstract

A scalable, reusable, and broad-coverage unified material knowledge representation shows its importance and will bring great benefits to data sharing among materials communities. A knowledge graph (KG) for materials terminology, which is a formal collection of term entities and relationships, is conceptually important to achieve this goal. In this work, we propose a KG for materials terminology, named Materials Genome Engineering Database Knowledge Graph (MGED-KG), which is automatically constructed from text corpus via natural language processing. MGED-KG is the most comprehensive KG for materials terminology in both Chinese and English languages, consisting of 8,660 terms and their explanations. It encompasses 11 principal categories, such as Metals, Composites, Nanomaterials, each with two or three levels of subcategories, resulting in a total of 235 distinct category labels. For further application, a knowledge web system based on MGED-KG is developed and shows its great power in improving data sharing efficiency from the aspects of query expansion, term, and data recommendation.

摘要

一个可扩展的、可重复使用的、具有广泛覆盖范围的统一材料知识表示形式,显示了其重要性,并将为材料界的数据共享带来巨大的好处。术语知识图(KG)是术语实体和关系的正式集合,对于实现这一目标具有重要的概念意义。在这项工作中,我们提出了一个材料术语的 KG,命名为 Materials Genome Engineering Database Knowledge Graph(MGED-KG),它是通过自然语言处理从文本语料库自动构建的。MGED-KG 是中英文材料术语最全面的 KG,包含 8660 个术语及其解释。它涵盖了 11 个主要类别,如金属、复合材料、纳米材料,每个类别都有两个或三个层次的子类别,总共包含 235 个不同的类别标签。为了进一步应用,我们开发了一个基于 MGED-KG 的知识网络系统,并从查询扩展、术语和数据推荐等方面展示了其提高数据共享效率的强大功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/36cd2b84a8f3/41597_2024_3448_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/695b251e1329/41597_2024_3448_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/9575bdb43cd0/41597_2024_3448_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/40b8d6129428/41597_2024_3448_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/cd47f1e5a47b/41597_2024_3448_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/f11e5b7eb90c/41597_2024_3448_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/d6a8d10d8dfd/41597_2024_3448_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/0444d7bb9d63/41597_2024_3448_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/8ab8429fe819/41597_2024_3448_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/36cd2b84a8f3/41597_2024_3448_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/695b251e1329/41597_2024_3448_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/9575bdb43cd0/41597_2024_3448_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/40b8d6129428/41597_2024_3448_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/cd47f1e5a47b/41597_2024_3448_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/f11e5b7eb90c/41597_2024_3448_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/d6a8d10d8dfd/41597_2024_3448_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/0444d7bb9d63/41597_2024_3448_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/8ab8429fe819/41597_2024_3448_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a745/11161478/36cd2b84a8f3/41597_2024_3448_Fig9_HTML.jpg

相似文献

1
A materials terminology knowledge graph automatically constructed from text corpus.从文本语料库自动构建的材料术语知识图谱。
Sci Data. 2024 Jun 7;11(1):600. doi: 10.1038/s41597-024-03448-0.
2
Leveraging Knowledge Graphs and Natural Language Processing for Automated Web Resource Labeling and Knowledge Mobilization in Neurodevelopmental Disorders: Development and Usability Study.利用知识图谱和自然语言处理实现神经发育障碍相关网络资源自动标注和知识转化:开发和可用性研究。
J Med Internet Res. 2023 Apr 17;25:e45268. doi: 10.2196/45268.
3
Assessment of disease named entity recognition on a corpus of annotated sentences.基于带注释句子语料库的疾病命名实体识别评估。
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2105-9-S3-S3.
4
COVID term: a bilingual terminology for COVID-19.新冠术语:COVID-19 的双语术语。
BMC Med Inform Decis Mak. 2021 Aug 3;21(1):231. doi: 10.1186/s12911-021-01593-9.
5
Aspect-based sentiment analysis with graph convolution over syntactic dependencies.基于句法依存关系图卷积的面向方面情感分析。
Artif Intell Med. 2021 Sep;119:102138. doi: 10.1016/j.artmed.2021.102138. Epub 2021 Aug 9.
6
A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data.电子健康记录数据二次使用的统一数据质量评估术语和框架。
EGEMS (Wash DC). 2016 Sep 11;4(1):1244. doi: 10.13063/2327-9214.1244. eCollection 2016.
7
Automatic knowledge extraction from Chinese electronic medical records and rheumatoid arthritis knowledge graph construction.从中国电子病历中自动提取知识并构建类风湿性关节炎知识图谱。
Quant Imaging Med Surg. 2023 Jun 1;13(6):3873-3890. doi: 10.21037/qims-22-1158. Epub 2023 May 8.
8
Discovering novel drug-supplement interactions using SuppKG generated from the biomedical literature.利用从生物医学文献中生成的 SuppKG 发现新的药物-补充剂相互作用。
J Biomed Inform. 2022 Jul;131:104120. doi: 10.1016/j.jbi.2022.104120. Epub 2022 Jun 13.
9
Construction of Knowledge Graph English Online Homework Evaluation System Based on Multimodal Neural Network Feature Extraction.基于多模态神经网络特征提取的知识图谱英语在线作业评估系统的构建。
Comput Intell Neurosci. 2022 May 13;2022:7941414. doi: 10.1155/2022/7941414. eCollection 2022.
10
TeaBERT: An Efficient Knowledge Infused Cross-Lingual Language Model for Mapping Chinese Medical Entities to the Unified Medical Language System.茶伯特:一种高效的知识注入跨语言语言模型,用于将中文医学实体映射到统一医学语言系统。
IEEE J Biomed Health Inform. 2023 Dec;27(12):6029-6038. doi: 10.1109/JBHI.2023.3315143. Epub 2023 Dec 6.

引用本文的文献

1
A knowledge graph for crop diseases and pests in China.中国农作物病虫害知识图谱。
Sci Data. 2025 Feb 6;12(1):222. doi: 10.1038/s41597-025-04492-0.

本文引用的文献

1
MatKG: An autonomously generated knowledge graph in Material Science.MatKG:材料科学领域中自主生成的知识图谱。
Sci Data. 2024 Feb 17;11(1):217. doi: 10.1038/s41597-024-03039-z.
2
A repository for the publication and sharing of heterogeneous materials data.一个用于发布和共享异类材料数据的存储库。
Sci Data. 2022 Dec 27;9(1):787. doi: 10.1038/s41597-022-01897-z.
3
Using Machine Learning to Predict Oxygen Evolution Activity for Transition Metal Hydroxide Electrocatalysts.利用机器学习预测过渡金属氢氧化物电催化剂的析氧活性。
ACS Appl Mater Interfaces. 2022 Sep 14;14(36):41141-41148. doi: 10.1021/acsami.2c13435. Epub 2022 Aug 31.
4
Materials information extraction via automatically generated corpus.通过自动生成的语料库进行材料信息提取。
Sci Data. 2022 Jul 13;9(1):401. doi: 10.1038/s41597-022-01492-2.
5
Materials Data toward Machine Learning: Advances and Challenges.材料数据助力机器学习:进展与挑战。
J Phys Chem Lett. 2022 May 12;13(18):3965-3977. doi: 10.1021/acs.jpclett.2c00576. Epub 2022 Apr 28.
6
OPTIMADE, an API for exchanging materials data.OPTIMADE,一种用于交换材料数据的 API。
Sci Data. 2021 Aug 12;8(1):217. doi: 10.1038/s41597-021-00974-z.
7
A universal system for digitization and automatic execution of the chemical synthesis literature.一种用于化学合成文献数字化和自动执行的通用系统。
Science. 2020 Oct 2;370(6512):101-108. doi: 10.1126/science.abc2986.
8
Materials Cloud, a platform for open computational science.材料云,一个开放计算科学平台。
Sci Data. 2020 Sep 8;7(1):299. doi: 10.1038/s41597-020-00637-5.
9
The CECAM electronic structure library and the modular software development paradigm.CECAM 电子结构库与模块化软件开发范式。
J Chem Phys. 2020 Jul 14;153(2):024117. doi: 10.1063/5.0012901.
10
Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks.基于文献训练的神经网络的无机材料合成规划。
J Chem Inf Model. 2020 Mar 23;60(3):1194-1201. doi: 10.1021/acs.jcim.9b00995. Epub 2020 Jan 28.