• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

属性图与RDF三元组存储库:聚糖子结构搜索的比较

Property Graph vs RDF Triple Store: A Comparison on Glycan Substructure Search.

作者信息

Alocci Davide, Mariethoz Julien, Horlacher Oliver, Bolleman Jerven T, Campbell Matthew P, Lisacek Frederique

机构信息

Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, 1211, Switzerland.

Computer Science Department, University of Geneva, Geneva, 1227, Switzerland.

出版信息

PLoS One. 2015 Dec 14;10(12):e0144578. doi: 10.1371/journal.pone.0144578. eCollection 2015.

DOI:10.1371/journal.pone.0144578
PMID:26656740
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4684231/
Abstract

Resource description framework (RDF) and Property Graph databases are emerging technologies that are used for storing graph-structured data. We compare these technologies through a molecular biology use case: glycan substructure search. Glycans are branched tree-like molecules composed of building blocks linked together by chemical bonds. The molecular structure of a glycan can be encoded into a direct acyclic graph where each node represents a building block and each edge serves as a chemical linkage between two building blocks. In this context, Graph databases are possible software solutions for storing glycan structures and Graph query languages, such as SPARQL and Cypher, can be used to perform a substructure search. Glycan substructure searching is an important feature for querying structure and experimental glycan databases and retrieving biologically meaningful data. This applies for example to identifying a region of the glycan recognised by a glycan binding protein (GBP). In this study, 19,404 glycan structures were selected from GlycomeDB (www.glycome-db.org) and modelled for being stored into a RDF triple store and a Property Graph. We then performed two different sets of searches and compared the query response times and the results from both technologies to assess performance and accuracy. The two implementations produced the same results, but interestingly we noted a difference in the query response times. Qualitative measures such as portability were also used to define further criteria for choosing the technology adapted to solving glycan substructure search and other comparable issues.

摘要

资源描述框架(RDF)和属性图数据库是用于存储图结构数据的新兴技术。我们通过一个分子生物学用例——聚糖子结构搜索,来比较这些技术。聚糖是由通过化学键连接在一起的构建块组成的分支树状分子。聚糖的分子结构可以编码为一个有向无环图,其中每个节点代表一个构建块,每条边代表两个构建块之间的化学键。在这种情况下,图数据库是存储聚糖结构的可能软件解决方案,并且图查询语言,如SPARQL和Cypher,可用于执行子结构搜索。聚糖子结构搜索是查询结构和实验性聚糖数据库以及检索具有生物学意义的数据的一项重要功能。例如,这适用于识别聚糖结合蛋白(GBP)识别的聚糖区域。在本研究中,从GlycomeDB(www.glycome-db.org)中选择了19404个聚糖结构,并对其进行建模以便存储到RDF三元组存储和属性图中。然后我们执行了两组不同的搜索,并比较了两种技术的查询响应时间和结果,以评估性能和准确性。两种实现产生了相同的结果,但有趣的是,我们注意到查询响应时间存在差异。诸如可移植性等定性指标也被用于定义进一步的标准,以选择适合解决聚糖子结构搜索及其他类似问题的技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/922eb02b288d/pone.0144578.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/57e632c46912/pone.0144578.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/de06642f7d37/pone.0144578.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/7a8cf210034f/pone.0144578.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/ffd26765449d/pone.0144578.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/922eb02b288d/pone.0144578.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/57e632c46912/pone.0144578.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/de06642f7d37/pone.0144578.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/7a8cf210034f/pone.0144578.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/ffd26765449d/pone.0144578.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fc7/4684231/922eb02b288d/pone.0144578.g005.jpg

相似文献

1
Property Graph vs RDF Triple Store: A Comparison on Glycan Substructure Search.属性图与RDF三元组存储库:聚糖子结构搜索的比较
PLoS One. 2015 Dec 14;10(12):e0144578. doi: 10.1371/journal.pone.0144578. eCollection 2015.
2
Processing SPARQL queries with regular expressions in RDF databases.在 RDF 数据库中使用正则表达式处理 SPARQL 查询。
BMC Bioinformatics. 2011 Mar 29;12 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-12-S2-S6.
3
A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL.通过SPARQL跨多个数据源查询进化关系的实践指南。
F1000Res. 2019 Oct 29;8:1822. doi: 10.12688/f1000research.21027.2. eCollection 2019.
4
Query Chem: a Google-powered web search combining text and chemical structures.查询化学:一种由谷歌驱动的结合文本和化学结构的网络搜索工具。
Bioinformatics. 2006 Jul 1;22(13):1670-3. doi: 10.1093/bioinformatics/btl155. Epub 2006 May 3.
5
Glycome-DB.org: a portal for querying across the digital world of carbohydrate sequences.Glycome-DB.org:一个用于查询碳水化合物序列数字世界的门户。
Glycobiology. 2009 Dec;19(12):1563-7. doi: 10.1093/glycob/cwp137. Epub 2009 Sep 16.
6
GlycoRDF: an ontology to standardize glycomics data in RDF.糖基RDF:一种用于在RDF中标准化糖组学数据的本体。
Bioinformatics. 2015 Mar 15;31(6):919-25. doi: 10.1093/bioinformatics/btu732. Epub 2014 Nov 11.
7
Introducing glycomics data into the Semantic Web.将糖组学数据引入语义网。
J Biomed Semantics. 2013 Nov 26;4(1):39. doi: 10.1186/2041-1480-4-39.
8
YeastHub: a semantic web use case for integrating data in the life sciences domain.酵母中心:生命科学领域数据整合的语义网用例。
Bioinformatics. 2005 Jun;21 Suppl 1:i85-96. doi: 10.1093/bioinformatics/bti1026.
9
A Querying Method over RDF-ized Health Level Seven v2.5 Messages Using Life Science Knowledge Resources.基于生命科学知识库的 RDF 化 HL7 v2.5 消息查询方法
JMIR Med Inform. 2016 Apr 5;4(2):e12. doi: 10.2196/medinform.5275.
10
LinkedImm: a linked data graph database for integrating immunological data.LinkedImm:用于整合免疫学数据的关联数据图数据库。
BMC Bioinformatics. 2021 Aug 25;22(Suppl 9):105. doi: 10.1186/s12859-021-04031-9.

引用本文的文献

1
Microbiome knowledge graph as a tool to understand bacteria-host associations.微生物组知识图谱作为理解细菌与宿主关联的工具。
Arch Microbiol. 2025 Aug 12;207(9):222. doi: 10.1007/s00203-025-04413-0.
2
An epidemiological knowledge graph extracted from the World Health Organization's Disease Outbreak News.从世界卫生组织疾病暴发新闻中提取的流行病学知识图谱。
Sci Data. 2025 Jun 10;12(1):970. doi: 10.1038/s41597-025-05276-2.
3
An ontology-based knowledge graph for representing interactions involving RNA molecules.基于本体的知识图谱,用于表示涉及 RNA 分子的相互作用。

本文引用的文献

1
Carbohydrate structure database merged from bacterial, archaeal, plant and fungal parts.碳水化合物结构数据库,由细菌、古细菌、植物和真菌部分合并而成。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1229-36. doi: 10.1093/nar/gkv840. Epub 2015 Aug 18.
2
MzJava: An open source library for mass spectrometry data processing.MzJava:一个用于质谱数据处理的开源库。
J Proteomics. 2015 Nov 3;129:63-70. doi: 10.1016/j.jprot.2015.06.013. Epub 2015 Jun 30.
3
GlycoRDF: an ontology to standardize glycomics data in RDF.糖基RDF:一种用于在RDF中标准化糖组学数据的本体。
Sci Data. 2024 Aug 22;11(1):906. doi: 10.1038/s41597-024-03673-7.
4
A semantics, energy-based approach to automate biomodel composition.一种基于语义和能量的方法,用于自动化生物模型组合。
PLoS One. 2022 Jun 3;17(6):e0269497. doi: 10.1371/journal.pone.0269497. eCollection 2022.
5
Dealing with the Ambiguity of Glycan Substructure Search.处理糖基结构搜索的模糊性。
Molecules. 2021 Dec 23;27(1):65. doi: 10.3390/molecules27010065.
6
The glycoconjugate ontology (GlycoCoO) for standardizing the annotation of glycoconjugate data and its application.糖缀合物本体(GlycoCoO)用于规范糖缀合物数据的注释及其应用。
Glycobiology. 2021 Aug 7;31(7):741-750. doi: 10.1093/glycob/cwab013.
7
A consensus-based and readable extension of near de for eaction ules (LiCoRR).基于共识且易于阅读的近反应规则扩展(LiCoRR)。
Beilstein J Org Chem. 2020 Oct 27;16:2645-2662. doi: 10.3762/bjoc.16.215. eCollection 2020.
8
glypy: An Open Source Glycoinformatics Library.glypy:一个开源的糖生物信息学库。
J Proteome Res. 2019 Sep 6;18(9):3532-3537. doi: 10.1021/acs.jproteome.9b00367. Epub 2019 Jul 30.
9
A Bioinformatics View of Glycan⁻Virus Interactions.聚糖-病毒相互作用的生物信息学视角
Viruses. 2019 Apr 23;11(4):374. doi: 10.3390/v11040374.
10
Glycomics@ExPASy: Bridging the Gap.糖组学@ExPASy:弥合差距。
Mol Cell Proteomics. 2018 Nov;17(11):2164-2176. doi: 10.1074/mcp.RA118.000799. Epub 2018 Aug 10.
Bioinformatics. 2015 Mar 15;31(6):919-25. doi: 10.1093/bioinformatics/btu732. Epub 2014 Nov 11.
4
UniProt: a hub for protein information.通用蛋白质数据库(UniProt):蛋白质信息中心。
Nucleic Acids Res. 2015 Jan;43(Database issue):D204-12. doi: 10.1093/nar/gku989. Epub 2014 Oct 27.
5
WURCS: the Web3 unique representation of carbohydrate structures.WURCS:碳水化合物结构的Web3唯一表示形式。
J Chem Inf Model. 2014 Jun 23;54(6):1558-66. doi: 10.1021/ci400571e. Epub 2014 Jun 4.
6
KCF-S: KEGG Chemical Function and Substructure for improved interpretability and prediction in chemical bioinformatics.KCF-S:用于改善化学生物信息学中可解释性和预测性的KEGG化学功能与子结构
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S2. doi: 10.1186/1752-0509-7-S6-S2. Epub 2013 Dec 13.
7
Toolboxes for a standardised and systematic study of glycans.聚糖标准化和系统研究的工具包。
BMC Bioinformatics. 2014;15 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-15-S1-S9. Epub 2014 Jan 10.
8
Introducing glycomics data into the Semantic Web.将糖组学数据引入语义网。
J Biomed Semantics. 2013 Nov 26;4(1):39. doi: 10.1186/2041-1480-4-39.
9
UniCarbKB: building a knowledge platform for glycoproteomics.UniCarbKB:构建糖蛋白质组学知识库平台。
Nucleic Acids Res. 2014 Jan;42(Database issue):D215-21. doi: 10.1093/nar/gkt1128. Epub 2013 Nov 13.
10
EUROCarbDB: An open-access platform for glycoinformatics.EUROCarbDB:一个用于聚糖信息学的开放获取平台。
Glycobiology. 2011 Apr;21(4):493-502. doi: 10.1093/glycob/cwq188. Epub 2010 Nov 23.