KaBOB：基于本体的生物医学数据库语义集成

KaBOB: ontology-based semantic integration of biomedical databases.

作者信息

Livingston Kevin M, Bada Michael, Baumgartner William A, Hunter Lawrence E

机构信息

Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

出版信息

BMC Bioinformatics. 2015 Apr 23;16(1):126. doi: 10.1186/s12859-015-0559-3.

DOI:10.1186/s12859-015-0559-3

PMID:25903923

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4448321/

Abstract

BACKGROUND

The ability to query many independent biological databases using a common ontology-based semantic model would facilitate deeper integration and more effective utilization of these diverse and rapidly growing resources. Despite ongoing work moving toward shared data formats and linked identifiers, significant problems persist in semantic data integration in order to establish shared identity and shared meaning across heterogeneous biomedical data sources.

RESULTS

We present five processes for semantic data integration that, when applied collectively, solve seven key problems. These processes include making explicit the differences between biomedical concepts and database records, aggregating sets of identifiers denoting the same biomedical concepts across data sources, and using declaratively represented forward-chaining rules to take information that is variably represented in source databases and integrating it into a consistent biomedical representation. We demonstrate these processes and solutions by presenting KaBOB (the Knowledge Base Of Biomedicine), a knowledge base of semantically integrated data from 18 prominent biomedical databases using common representations grounded in Open Biomedical Ontologies. An instance of KaBOB with data about humans and seven major model organisms can be built using on the order of 500 million RDF triples. All source code for building KaBOB is available under an open-source license.

CONCLUSIONS

KaBOB is an integrated knowledge base of biomedical data representationally based in prominent, actively maintained Open Biomedical Ontologies, thus enabling queries of the underlying data in terms of biomedical concepts (e.g., genes and gene products, interactions and processes) rather than features of source-specific data schemas or file formats. KaBOB resolves many of the issues that routinely plague biomedical researchers intending to work with data from multiple data sources and provides a platform for ongoing data integration and development and for formal reasoning over a wealth of integrated biomedical data.

摘要

背景

使用基于通用本体的语义模型查询多个独立生物数据库的能力，将有助于更深入地整合和更有效地利用这些多样且快速增长的资源。尽管在朝着共享数据格式和链接标识符的方向不断努力，但在语义数据集成方面仍存在重大问题，以便在异构生物医学数据源之间建立共享身份和共享含义。

结果

我们提出了五个语义数据集成过程，这些过程共同应用时可解决七个关键问题。这些过程包括明确生物医学概念与数据库记录之间的差异，汇总跨数据源表示相同生物医学概念的标识符集，并使用声明式表示的前向链规则获取在源数据库中以可变方式表示的信息，并将其整合到一致的生物医学表示中。我们通过展示KaBOB（生物医学知识库）来演示这些过程和解决方案，KaBOB是一个语义集成数据的知识库，它使用基于开放生物医学本体的通用表示，整合了18个著名生物医学数据库的数据。使用大约5亿个RDF三元组可以构建一个包含人类和七种主要模式生物数据的KaBOB实例。构建KaBOB的所有源代码都可在开源许可下获取。

结论

KaBOB是一个基于著名的、积极维护的开放生物医学本体的生物医学数据集成知识库，从而能够根据生物医学概念（如基因和基因产物、相互作用和过程）而不是特定于源的数据模式或文件格式的特征来查询基础数据。KaBOB解决了许多经常困扰打算使用来自多个数据源的数据的生物医学研究人员的问题，并为正在进行的数据集成和开发以及对大量集成生物医学数据进行形式推理提供了一个平台。

相似文献

KaBOB: ontology-based semantic integration of biomedical databases.

BMC Bioinformatics. 2015 Apr 23;16(1):126. doi: 10.1186/s12859-015-0559-3.

Using the Semantic Web for Rapid Integration of WikiPathways with Other Biological Online Data Resources.

PLoS Comput Biol. 2016 Jun 23;12(6):e1004989. doi: 10.1371/journal.pcbi.1004989. eCollection 2016 Jun.

AlzPharm: integration of neurodegeneration data using RDF.

BMC Bioinformatics. 2007 May 9;8 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2105-8-S3-S4.

Semantic web for integrated network analysis in biomedicine.

Brief Bioinform. 2009 Mar;10(2):177-92. doi: 10.1093/bib/bbp002.

Exploring and linking biomedical resources through multidimensional semantic spaces.

BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2105-13-S1-S6.

Generation of open biomedical datasets through ontology-driven transformation and integration processes.

J Biomed Semantics. 2016 Jun 3;7:32. doi: 10.1186/s13326-016-0075-z.

A semantic web ontology for small molecules and their biological targets.

J Chem Inf Model. 2010 May 24;50(5):732-41. doi: 10.1021/ci900461j.

Linked Data Applications Through Ontology Based Data Access in Clinical Research.

Stud Health Technol Inform. 2017;235:131-135.

Building biomedical web communities using a semantically aware content management system.

Brief Bioinform. 2009 Mar;10(2):129-38. doi: 10.1093/bib/bbn052. Epub 2008 Dec 6.

Toward a view-oriented approach for aligning RDF-based biomedical repositories.

Methods Inf Med. 2015;54(1):50-5. doi: 10.3414/ME13-02-0020. Epub 2014 Apr 29.

引用本文的文献

Graph databases in systems biology: a systematic review.

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae561.

An ontology-based knowledge graph for representing interactions involving RNA molecules.

Sci Data. 2024 Aug 22;11(1):906. doi: 10.1038/s41597-024-03673-7.

An open source knowledge graph ecosystem for the life sciences.

Sci Data. 2024 Apr 11;11(1):363. doi: 10.1038/s41597-024-03171-w.

Development and validation of the early warning system scores ontology.

J Biomed Semantics. 2023 Sep 20;14(1):14. doi: 10.1186/s13326-023-00296-6.

A universal diagnosis syntax.

BMC Med Inform Decis Mak. 2023 Jul 31;23(1):143. doi: 10.1186/s12911-023-02209-0.

RTX-KG2: a system for building a semantically standardized knowledge graph for translational biomedicine.

BMC Bioinformatics. 2022 Sep 29;23(1):400. doi: 10.1186/s12859-022-04932-3.

CROssBAR: comprehensive resource of biomedical relations with knowledge graph representations.

Nucleic Acids Res. 2021 Sep 20;49(16):e96. doi: 10.1093/nar/gkab543.

Knowledge-Based Biomedical Data Science.

Annu Rev Biomed Data Sci. 2020 Jul;3:23-41. doi: 10.1146/annurev-biodatasci-010820-091627. Epub 2020 Apr 7.

Establishing a consensus for the hallmarks of cancer based on gene ontology and pathway annotations.

BMC Bioinformatics. 2021 Apr 6;22(1):178. doi: 10.1186/s12859-021-04105-8.

A Semantic-Based Approach for Managing Healthcare Big Data: A Survey.

J Healthc Eng. 2020 Nov 23;2020:8865808. doi: 10.1155/2020/8865808. eCollection 2020.

本文引用的文献

Micropublications: a semantic model for claims, evidence, arguments and annotations in biomedical communications.

J Biomed Semantics. 2014 Jul 4;5:28. doi: 10.1186/2041-1480-5-28. eCollection 2014.

The 2015 Nucleic Acids Research Database Issue and molecular biology database collection.

Nucleic Acids Res. 2015 Jan;43(Database issue):D1-5. doi: 10.1093/nar/gku1241.

Ontology-Based Querying with Bio2RDF's Linked Open Data.

J Biomed Semantics. 2013 Apr 15;4 Suppl 1(Suppl 1):S1. doi: 10.1186/2041-1480-4-S1-S1.

Expression profiles of mitochondrial genes in the frontal cortex and the caudate nucleus of developing humans and mice selectively bred for high and low fear.

PLoS One. 2012;7(11):e49183. doi: 10.1371/journal.pone.0049183. Epub 2012 Nov 13.

Identifying aberrant pathways through integrated analysis of knowledge in pharmacogenomics.

Bioinformatics. 2012 Aug 15;28(16):2169-75. doi: 10.1093/bioinformatics/bts350. Epub 2012 Jun 17.

Anesthetics isoflurane and desflurane differently affect mitochondrial function, learning, and memory.

Ann Neurol. 2012 May;71(5):687-98. doi: 10.1002/ana.23536. Epub 2012 Feb 24.

Identifiers.org and MIRIAM Registry: community resources to provide persistent identification.

Nucleic Acids Res. 2012 Jan;40(Database issue):D580-6. doi: 10.1093/nar/gkr1097. Epub 2011 Dec 2.

The Gene Ontology: enhancements for 2011.

Nucleic Acids Res. 2012 Jan;40(Database issue):D559-64. doi: 10.1093/nar/gkr1028. Epub 2011 Nov 18.

NCBO Resource Index: Ontology-Based Search and Mining of Biomedical Resources.

Web Semant. 2011 Sep 1;9(3):316-324. doi: 10.1016/j.websem.2011.06.005.

Linked open drug data for pharmaceutical research and development.

J Cheminform. 2011 May 16;3(1):19. doi: 10.1186/1758-2946-3-19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

KaBOB：基于本体的生物医学数据库语义集成

KaBOB: ontology-based semantic integration of biomedical databases.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献