将糖组学数据引入语义网。

Introducing glycomics data into the Semantic Web.

作者信息

Aoki-Kinoshita Kiyoko F, Bolleman Jerven, Campbell Matthew P, Kawano Shin, Kim Jin-Dong, Lütteke Thomas, Matsubara Masaaki, Okuda Shujiro, Ranzinger Rene, Sawaki Hiromichi, Shikanai Toshihide, Shinmachi Daisuke, Suzuki Yoshinori, Toukach Philip, Yamada Issaku, Packer Nicolle H, Narimatsu Hisashi

机构信息

Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba Central-2, Umezono 1-1-1, Tsukuba 305-8568, Japan.

出版信息

J Biomed Semantics. 2013 Nov 26;4(1):39. doi: 10.1186/2041-1480-4-39.

DOI:10.1186/2041-1480-4-39

PMID:24280648

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4177142/

Abstract

BACKGROUND

Glycoscience is a research field focusing on complex carbohydrates (otherwise known as glycans)a, which can, for example, serve as "switches" that toggle between different functions of a glycoprotein or glycolipid. Due to the advancement of glycomics technologies that are used to characterize glycan structures, many glycomics databases are now publicly available and provide useful information for glycoscience research. However, these databases have almost no link to other life science databases.

RESULTS

In order to implement support for the Semantic Web most efficiently for glycomics research, the developers of major glycomics databases agreed on a minimal standard for representing glycan structure and annotation information using RDF (Resource Description Framework). Moreover, all of the participants implemented this standard prototype and generated preliminary RDF versions of their data. To test the utility of the converted data, all of the data sets were uploaded into a Virtuoso triple store, and several SPARQL queries were tested as "proofs-of-concept" to illustrate the utility of the Semantic Web in querying across databases which were originally difficult to implement.

CONCLUSIONS

We were able to successfully retrieve information by linking UniCarbKB, GlycomeDB and JCGGDB in a single SPARQL query to obtain our target information. We also tested queries linking UniProt with GlycoEpitope as well as lectin data with GlycomeDB through PDB. As a result, we have been able to link proteomics data with glycomics data through the implementation of Semantic Web technologies, allowing for more flexible queries across these domains.

摘要

背景

糖科学是一个专注于复杂碳水化合物（又称聚糖）的研究领域，例如，聚糖可作为“开关”，在糖蛋白或糖脂的不同功能之间切换。由于用于表征聚糖结构的糖组学技术的进步，现在许多糖组学数据库都可公开获取，并为糖科学研究提供有用信息。然而，这些数据库几乎与其他生命科学数据库没有关联。

结果

为了最有效地为糖组学研究实现对语义网的支持，主要糖组学数据库的开发者就使用RDF（资源描述框架）表示聚糖结构和注释信息的最低标准达成了一致。此外，所有参与者都实现了该标准原型，并生成了其数据的初步RDF版本。为了测试转换后数据的实用性，所有数据集都上传到了Virtuoso三元组存储中，并测试了几个SPARQL查询作为“概念验证”，以说明语义网在跨原本难以实现的数据库进行查询方面的实用性。

结论

我们能够通过在单个SPARQL查询中链接UniCarbKB、GlycomeDB和JCGGDB成功检索信息，以获取我们的目标信息。我们还测试了通过PDB将UniProt与糖基表位以及凝集素数据与GlycomeDB链接的查询。结果，通过实施语义网技术，我们能够将蛋白质组学数据与糖组学数据链接起来，从而在这些领域进行更灵活的查询。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1269/4177142/4fc1e1b05ad9/2041-1480-4-39-1.jpg

相似文献

Introducing glycomics data into the Semantic Web.

J Biomed Semantics. 2013 Nov 26;4(1):39. doi: 10.1186/2041-1480-4-39.

Gauging triple stores with actual biological data.

BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-13-S1-S3.

A Querying Method over RDF-ized Health Level Seven v2.5 Messages Using Life Science Knowledge Resources.

JMIR Med Inform. 2016 Apr 5;4(2):e12. doi: 10.2196/medinform.5275.

Design and development of a linked open data-based health information representation and visualization system: potentials and preliminary evaluation.

JMIR Med Inform. 2014 Oct 25;2(2):e31. doi: 10.2196/medinform.3531.

Processing SPARQL queries with regular expressions in RDF databases.

BMC Bioinformatics. 2011 Mar 29;12 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-12-S2-S6.

SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases.

BMC Bioinformatics. 2017 Feb 8;18(1):93. doi: 10.1186/s12859-017-1531-1.

IDSM ChemWebRDF: SPARQLing small-molecule datasets.

J Cheminform. 2021 May 12;13(1):38. doi: 10.1186/s13321-021-00515-1.

A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL.

F1000Res. 2019 Oct 29;8:1822. doi: 10.12688/f1000research.21027.2. eCollection 2019.

BioCarian: search engine for exploratory searches in heterogeneous biological databases.

BMC Bioinformatics. 2017 Oct 2;18(1):435. doi: 10.1186/s12859-017-1840-4.

Semantic Web repositories for genomics data using the eXframe platform.

J Biomed Semantics. 2014 Jun 3;5(Suppl 1 Proceedings of the Bio-Ontologies Spec Interest G):S3. doi: 10.1186/2041-1480-5-S1-S3. eCollection 2014.

引用本文的文献

Functions of Glycosylation and Related Web Resources for Its Prediction.

Methods Mol Biol. 2022;2499:135-144. doi: 10.1007/978-1-0716-2317-6_6.

Informatics Ecosystems to Advance the Biology of Glycans.

Methods Mol Biol. 2022;2303:655-673. doi: 10.1007/978-1-0716-1398-6_50.

The glycoconjugate ontology (GlycoCoO) for standardizing the annotation of glycoconjugate data and its application.

Glycobiology. 2021 Aug 7;31(7):741-750. doi: 10.1093/glycob/cwab013.

GlycoPOST realizes FAIR principles for glycomics mass spectrometry data.

Nucleic Acids Res. 2021 Jan 8;49(D1):D1523-D1528. doi: 10.1093/nar/gkaa1012.

BioHackathon 2015: Semantics of data for life sciences and reproducible research.

F1000Res. 2020 Feb 24;9:136. doi: 10.12688/f1000research.18236.1. eCollection 2020.

Property Graph vs RDF Triple Store: A Comparison on Glycan Substructure Search.

PLoS One. 2015 Dec 14;10(12):e0144578. doi: 10.1371/journal.pone.0144578. eCollection 2015.

SugarBindDB, a resource of glycan-mediated host-pathogen interactions.

Nucleic Acids Res. 2016 Jan 4;44(D1):D1243-50. doi: 10.1093/nar/gkv1247. Epub 2015 Nov 17.

Carbohydrate structure database merged from bacterial, archaeal, plant and fungal parts.

Nucleic Acids Res. 2016 Jan 4;44(D1):D1229-36. doi: 10.1093/nar/gkv840. Epub 2015 Aug 18.

The Lectin Frontier Database (LfDB), and data generation based on frontal affinity chromatography.

Molecules. 2015 Jan 8;20(1):951-73. doi: 10.3390/molecules20010951.

GlycoRDF: an ontology to standardize glycomics data in RDF.

Bioinformatics. 2015 Mar 15;31(6):919-25. doi: 10.1093/bioinformatics/btu732. Epub 2014 Nov 11.

本文引用的文献

JCGGDB: Japan Consortium for Glycobiology and Glycotechnology Database.

Methods Mol Biol. 2015;1273:161-79. doi: 10.1007/978-1-4939-2343-4_12.

Using databases and web resources for glycomics research.

Mol Cell Proteomics. 2013 Apr;12(4):1036-45. doi: 10.1074/mcp.R112.026252. Epub 2013 Jan 16.

Update on activities at the Universal Protein Resource (UniProt) in 2013.

Nucleic Acids Res. 2013 Jan;41(Database issue):D43-7. doi: 10.1093/nar/gks1068. Epub 2012 Nov 17.

Large-scale identification of N-glycosylated proteins of mouse tissues and construction of a glycoprotein database, GlycoProtDB.

J Proteome Res. 2012 Sep 7;11(9):4553-66. doi: 10.1021/pr300346c. Epub 2012 Aug 13.

UniCarbKB: putting the pieces together for glycomics research.

Proteomics. 2011 Nov;11(21):4117-21. doi: 10.1002/pmic.201100302. Epub 2011 Sep 19.

Bacterial carbohydrate structure database 3: principles and realization.

J Chem Inf Model. 2011 Jan 24;51(1):159-70. doi: 10.1021/ci100150d. Epub 2010 Dec 14.

EUROCarbDB: An open-access platform for glycoinformatics.

Glycobiology. 2011 Apr;21(4):493-502. doi: 10.1093/glycob/cwq188. Epub 2010 Nov 23.

GlycomeDB--a unified database for carbohydrate structures.

Nucleic Acids Res. 2011 Jan;39(Database issue):D373-6. doi: 10.1093/nar/gkq1014. Epub 2010 Nov 2.

GlycoCT-a unifying sequence format for carbohydrates.

Carbohydr Res. 2008 Aug 11;343(12):2162-71. doi: 10.1016/j.carres.2008.03.011. Epub 2008 Mar 13.

GlycoBase and autoGU: tools for HPLC-based glycan analysis.

Bioinformatics. 2008 May 1;24(9):1214-6. doi: 10.1093/bioinformatics/btn090. Epub 2008 Mar 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

将糖组学数据引入语义网。

Introducing glycomics data into the Semantic Web.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献