一种应用于整合生物信息学实验的语义网方法：一个基因组学数据的生物学用例。

A semantic web approach applied to integrative bioinformatics experimentation: a biological use case with genomics data.

作者信息

Post Lennart J G, Roos Marco, Marshall M Scott, van Driel Roel, Breit Timo M

机构信息

Integrative Bioinformatics Unit and Nuclear Organization Group, Swammerdam Institute for Life Sciences, University of Amsterdam, 1098 SM, Amsterdam, The Netherlands.

出版信息

Bioinformatics. 2007 Nov 15;23(22):3080-7. doi: 10.1093/bioinformatics/btm461. Epub 2007 Sep 19.

DOI:10.1093/bioinformatics/btm461

PMID:17881406

Abstract

MOTIVATION

The numerous public data resources make integrative bioinformatics experimentation increasingly important in life sciences research. However, it is severely hampered by the way the data and information are made available. The semantic web approach enhances data exchange and integration by providing standardized formats such as RDF, RDF Schema (RDFS) and OWL, to achieve a formalized computational environment. Our semantic web-enabled data integration (SWEDI) approach aims to formalize biological domains by capturing the knowledge in semantic models using ontologies as controlled vocabularies. The strategy is to build a collection of relatively small but specific knowledge and data models, which together form a 'personal semantic framework'. This can be linked to external large, general knowledge and data models. In this way, the involved scientists are familiar with the concepts and associated relationships in their models and can create semantic queries using their own terms. We studied the applicability of our SWEDI approach in the context of a biological use case by integrating genomics data sets for histone modification and transcription factor binding sites.

RESULTS

We constructed four OWL knowledge models, two RDFS data models, transformed and mapped relevant data to the data models, linked the data models to knowledge models using linkage statements, and ran semantic queries. Our biological use case demonstrates the relevance of these kinds of integrative bioinformatics experiments. Our findings show high startup costs for the SWEDI approach, but straightforward extension with similar data.

摘要

动机

众多的公共数据资源使得整合生物信息学实验在生命科学研究中变得越来越重要。然而，数据和信息的提供方式严重阻碍了这一进程。语义网方法通过提供诸如RDF、RDF模式（RDFS）和OWL等标准化格式来增强数据交换和整合，以实现形式化的计算环境。我们的语义网支持的数据集成（SWEDI）方法旨在通过使用本体作为受控词汇表在语义模型中捕获知识，从而将生物领域形式化。该策略是构建一组相对较小但特定的知识和数据模型，这些模型共同构成一个“个人语义框架”。这可以与外部的大型通用知识和数据模型相链接。通过这种方式，相关科学家熟悉其模型中的概念和相关关系，并能够使用自己的术语创建语义查询。我们通过整合用于组蛋白修饰和转录因子结合位点的基因组数据集，研究了我们的SWEDI方法在生物用例中的适用性。

结果

我们构建了四个OWL知识模型、两个RDFS数据模型，将相关数据转换并映射到数据模型，使用链接语句将数据模型与知识模型相链接，并运行语义查询。我们的生物用例证明了这类整合生物信息学实验的相关性。我们的研究结果表明，SWEDI方法的启动成本很高，但使用类似数据进行扩展却很简单。

相似文献

A semantic web approach applied to integrative bioinformatics experimentation: a biological use case with genomics data.

Bioinformatics. 2007 Nov 15;23(22):3080-7. doi: 10.1093/bioinformatics/btm461. Epub 2007 Sep 19.

Bio2RDF: towards a mashup to build bioinformatics knowledge systems.

J Biomed Inform. 2008 Oct;41(5):706-16. doi: 10.1016/j.jbi.2008.03.004. Epub 2008 Mar 21.

A semantic web ontology for small molecules and their biological targets.

J Chem Inf Model. 2010 May 24;50(5):732-41. doi: 10.1021/ci900461j.

YeastHub: a semantic web use case for integrating data in the life sciences domain.

Bioinformatics. 2005 Jun;21 Suppl 1:i85-96. doi: 10.1093/bioinformatics/bti1026.

AlzPharm: integration of neurodegeneration data using RDF.

BMC Bioinformatics. 2007 May 9;8 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2105-8-S3-S4.

Combining Semantic Web technologies with Multi-Agent Systems for integrated access to biological resources.

J Biomed Inform. 2008 Oct;41(5):848-59. doi: 10.1016/j.jbi.2008.05.007. Epub 2008 May 23.

Leveraging the structure of the Semantic Web to enhance information retrieval for proteomics.

Bioinformatics. 2007 Nov 15;23(22):3073-9. doi: 10.1093/bioinformatics/btm452. Epub 2007 Oct 7.

CelOWS: an ontology based framework for the provision of semantic web services related to biological models.

J Biomed Inform. 2010 Feb;43(1):125-36. doi: 10.1016/j.jbi.2009.08.008. Epub 2009 Aug 18.

Towards Semantic e-Science for Traditional Chinese Medicine.

BMC Bioinformatics. 2007 May 9;8 Suppl 3(Suppl 3):S6. doi: 10.1186/1471-2105-8-S3-S6.

An agent- and ontology-based system for integrating public gene, protein, and disease databases.

J Biomed Inform. 2007 Feb;40(1):17-29. doi: 10.1016/j.jbi.2006.02.014. Epub 2006 Mar 20.

引用本文的文献

FAIRification of health-related data using semantic web technologies in the Swiss Personalized Health Network.

Sci Data. 2023 Mar 10;10(1):127. doi: 10.1038/s41597-023-02028-y.

Preserving sequence annotations across reference sequences.

J Biomed Semantics. 2014 Jun 3;5(Suppl 1 Proceedings of the Bio-Ontologies Spec Interest G):S6. doi: 10.1186/2041-1480-5-S1-S6. eCollection 2014.

The Zebrafish GenomeWiki: a crowdsourcing approach to connect the long tail for zebrafish gene annotation.

Database (Oxford). 2014 Feb 26;2014:bau011. doi: 10.1093/database/bau011. Print 2014.

The Third ACGG-DB Meeting Report: Towards an international collaborative infrastructure for glycobioinformatics.

Glycobiology. 2013 Feb;23(2):144-6. doi: 10.1093/glycob/cws167.

Accelerating cancer systems biology research through Semantic Web technology.

Wiley Interdiscip Rev Syst Biol Med. 2013 Mar-Apr;5(2):135-51. doi: 10.1002/wsbm.1200. Epub 2012 Nov 27.

Current trends and new challenges of databases and web applications for systems driven biological research.

Front Physiol. 2010 Dec 3;1:147. doi: 10.3389/fphys.2010.00147. eCollection 2010.

Knowledge management for systems biology a general and visually driven framework applied to translational medicine.

BMC Syst Biol. 2011 Mar 5;5:38. doi: 10.1186/1752-0509-5-38.

Semantically enabled and statistically supported biological hypothesis testing with tissue microarray databases.

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S51. doi: 10.1186/1471-2105-12-S1-S51.

At the intersection of public-health informatics and bioinformatics: using advanced Web technologies for phylogeography.

Epidemiology. 2010 Nov;21(6):764-8. doi: 10.1097/EDE.0b013e3181f534dd.

Data integration for dynamic and sustainable systems biology resources: challenges and lessons learned.

Chem Biodivers. 2010 May;7(5):1124-41. doi: 10.1002/cbdv.200900317.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种应用于整合生物信息学实验的语义网方法：一个基因组学数据的生物学用例。

A semantic web approach applied to integrative bioinformatics experimentation: a biological use case with genomics data.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

动机

结果

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献