Suppr超能文献

可扩展的精准医学开放知识引擎 (SPOKE):生物医学信息的大规模知识图谱。

The scalable precision medicine open knowledge engine (SPOKE): a massive knowledge graph of biomedical information.

机构信息

Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, San Francisco, CA 94158, USA.

Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA.

出版信息

Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad080.

Abstract

MOTIVATION

Knowledge graphs (KGs) are being adopted in industry, commerce and academia. Biomedical KG presents a challenge due to the complexity, size and heterogeneity of the underlying information.

RESULTS

In this work, we present the Scalable Precision Medicine Open Knowledge Engine (SPOKE), a biomedical KG connecting millions of concepts via semantically meaningful relationships. SPOKE contains 27 million nodes of 21 different types and 53 million edges of 55 types downloaded from 41 databases. The graph is built on the framework of 11 ontologies that maintain its structure, enable mappings and facilitate navigation. SPOKE is built weekly by python scripts which download each resource, check for integrity and completeness, and then create a 'parent table' of nodes and edges. Graph queries are translated by a REST API and users can submit searches directly via an API or a graphical user interface. Conclusions/Significance: SPOKE enables the integration of seemingly disparate information to support precision medicine efforts.

AVAILABILITY AND IMPLEMENTATION

The SPOKE neighborhood explorer is available at https://spoke.rbvi.ucsf.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

知识图谱(KGs)正在工业、商业和学术界得到采用。由于底层信息的复杂性、规模和异质性,生物医学 KG 带来了挑战。

结果

在这项工作中,我们提出了可扩展的精准医学开放知识引擎(SPOKE),这是一个通过语义上有意义的关系连接数百万个概念的生物医学 KG。SPOKE 包含 2700 万个来自 41 个数据库的 21 种不同类型的节点和 5300 万个 55 种类型的边。该图建立在 11 个本体的框架上,这些本体维护着其结构,支持映射并促进导航。SPOKE 每周由 python 脚本构建,这些脚本下载每个资源,检查其完整性和完整性,然后创建一个节点和边的“父表”。图形查询通过 REST API 进行翻译,用户可以直接通过 API 或图形用户界面提交搜索。

结论/意义:SPOKE 能够整合看似不同的信息,以支持精准医学的努力。

可用性和实现

SPOKE 邻域浏览器可在 https://spoke.rbvi.ucsf.edu 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27f3/9940622/6f0929bfa719/btad080f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验