• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2011年生物黑客马拉松上生命科学领域关联数据的应用

Implementation of linked data in the life sciences at BioHackathon 2011.

作者信息

Aoki-Kinoshita Kiyoko F, Kinjo Akira R, Morita Mizuki, Igarashi Yoshinobu, Chen Yi-An, Shigemoto Yasumasa, Fujisawa Takatomo, Akune Yukie, Katoda Takeo, Kokubu Anna, Mori Takaaki, Nakao Mitsuteru, Kawashima Shuichi, Okamoto Shinobu, Katayama Toshiaki, Ogishima Soichi

机构信息

Department of Bioinformatics, Faculty of Engineering, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo, 192-8577 Japan.

Laboratory of Protein Informatics, Laboratory of Protein Databases, and Protein Data Bank Japan, Research Center for Structural and Functional Proteomics, Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871 Japan.

出版信息

J Biomed Semantics. 2015 Jan 7;6:3. doi: 10.1186/2041-1480-6-3. eCollection 2015.

DOI:10.1186/2041-1480-6-3
PMID:25973165
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4429360/
Abstract

BACKGROUND

Linked Data has gained some attention recently in the life sciences as an effective way to provide and share data. As a part of the Semantic Web, data are linked so that a person or machine can explore the web of data. Resource Description Framework (RDF) is the standard means of implementing Linked Data. In the process of generating RDF data, not only are data simply linked to one another, the links themselves are characterized by ontologies, thereby allowing the types of links to be distinguished. Although there is a high labor cost to define an ontology for data providers, the merit lies in the higher level of interoperability with data analysis and visualization software. This increase in interoperability facilitates the multi-faceted retrieval of data, and the appropriate data can be quickly extracted and visualized. Such retrieval is usually performed using the SPARQL (SPARQL Protocol and RDF Query Language) query language, which is used to query RDF data stores. For the database provider, such interoperability will surely lead to an increase in the number of users.

RESULTS

This manuscript describes the experiences and discussions shared among participants of the week-long BioHackathon 2011 who went through the development of RDF representations of their own data and developed specific RDF and SPARQL use cases. Advice regarding considerations to take when developing RDF representations of their data are provided for bioinformaticians considering making data available and interoperable.

CONCLUSIONS

Participants of the BioHackathon 2011 were able to produce RDF representations of their data and gain a better understanding of the requirements for producing such data in a period of just five days. We summarize the work accomplished with the hope that it will be useful for researchers involved in developing laboratory databases or data analysis, and those who are considering such technologies as RDF and Linked Data.

摘要

背景

关联数据作为一种提供和共享数据的有效方式,最近在生命科学领域受到了一些关注。作为语义网的一部分,数据被链接起来,以便人员或机器能够探索数据网络。资源描述框架(RDF)是实现关联数据的标准方式。在生成RDF数据的过程中,数据不仅简单地相互链接,链接本身还由本体进行表征,从而能够区分链接的类型。尽管为数据提供者定义本体的人力成本很高,但其优点在于与数据分析和可视化软件具有更高的互操作性。这种互操作性的提高有助于多方面的数据检索,并且能够快速提取和可视化适当的数据。这种检索通常使用SPARQL(SPARQL协议和RDF查询语言)查询语言来查询RDF数据存储。对于数据库提供者而言,这种互操作性肯定会带来用户数量的增加。

结果

本文描述了参加为期一周的2011年生物黑客马拉松的参与者们分享的经验和讨论,他们经历了将自己的数据开发为RDF表示形式,并开发了特定的RDF和SPARQL用例。对于考虑使数据可用且具有互操作性的生物信息学家,提供了有关在开发其数据的RDF表示形式时应考虑的事项的建议。

结论

2011年生物黑客马拉松的参与者能够在短短五天内生成他们数据的RDF表示形式,并更好地理解生成此类数据的要求。我们总结了所完成的工作,希望它对参与开发实验室数据库或数据分析的研究人员以及那些正在考虑使用RDF和关联数据等技术的人员有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/8eb4bef313cf/13326_2013_215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/a8fde88854ce/13326_2013_215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/17b5d5769e50/13326_2013_215_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/d6e3dd528db0/13326_2013_215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/8eb4bef313cf/13326_2013_215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/a8fde88854ce/13326_2013_215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/17b5d5769e50/13326_2013_215_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/d6e3dd528db0/13326_2013_215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9841/4429360/8eb4bef313cf/13326_2013_215_Fig4_HTML.jpg

相似文献

1
Implementation of linked data in the life sciences at BioHackathon 2011.2011年生物黑客马拉松上生命科学领域关联数据的应用
J Biomed Semantics. 2015 Jan 7;6:3. doi: 10.1186/2041-1480-6-3. eCollection 2015.
2
SPANG: a SPARQL client supporting generation and reuse of queries for distributed RDF databases.SPANG:一个支持为分布式RDF数据库生成和重用查询的SPARQL客户端。
BMC Bioinformatics. 2017 Feb 8;18(1):93. doi: 10.1186/s12859-017-1531-1.
3
A Querying Method over RDF-ized Health Level Seven v2.5 Messages Using Life Science Knowledge Resources.基于生命科学知识库的 RDF 化 HL7 v2.5 消息查询方法
JMIR Med Inform. 2016 Apr 5;4(2):e12. doi: 10.2196/medinform.5275.
4
cMapper: gene-centric connectivity mapper for EBI-RDF platform.cMapper:用于欧洲生物信息学研究所资源描述框架(EBI-RDF)平台的以基因为中心的连通性映射器。
Bioinformatics. 2017 Jan 15;33(2):266-271. doi: 10.1093/bioinformatics/btw612. Epub 2016 Sep 25.
5
AlzPharm: integration of neurodegeneration data using RDF.阿尔茨海默病药物研发:使用资源描述框架(RDF)整合神经退行性变数据。
BMC Bioinformatics. 2007 May 9;8 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2105-8-S3-S4.
6
YeastHub: a semantic web use case for integrating data in the life sciences domain.酵母中心:生命科学领域数据整合的语义网用例。
Bioinformatics. 2005 Jun;21 Suppl 1:i85-96. doi: 10.1093/bioinformatics/bti1026.
7
BioFed: federated query processing over life sciences linked open data.BioFed:基于生命科学关联开放数据的联邦查询处理
J Biomed Semantics. 2017 Mar 15;8(1):13. doi: 10.1186/s13326-017-0118-0.
8
Advanced SPARQL querying in small molecule databases.小分子数据库中的高级SPARQL查询
J Cheminform. 2016 Jun 6;8:31. doi: 10.1186/s13321-016-0144-4. eCollection 2016.
9
Semantic Web repositories for genomics data using the eXframe platform.使用eXframe平台的基因组学数据语义网知识库。
J Biomed Semantics. 2014 Jun 3;5(Suppl 1 Proceedings of the Bio-Ontologies Spec Interest G):S3. doi: 10.1186/2041-1480-5-S1-S3. eCollection 2014.
10
BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains.2011年和2012年的生物黑客马拉松系列活动:本体论和关联数据在生命科学领域的渗透。
J Biomed Semantics. 2014 Feb 5;5(1):5. doi: 10.1186/2041-1480-5-5.

引用本文的文献

1
Bridging glycoinformatics and cheminformatics: integration efforts between GlyCosmos and PubChem.桥接糖生物信息学和化学信息学:GlyCosmos 和 PubChem 之间的整合工作。
Glycobiology. 2023 Jun 21;33(6):454-463. doi: 10.1093/glycob/cwad028.
2
SugarDrawer: A Web-Based Database Search Tool with Editing Glycan Structures.糖抽屉:一个基于网络的数据库搜索工具,具有编辑聚糖结构的功能。
Molecules. 2021 Nov 25;26(23):7149. doi: 10.3390/molecules26237149.
3
RDFizing the biosynthetic pathway of E.coli O-antigen to enable semantic sharing of microbiology data.

本文引用的文献

1
The EBI RDF platform: linked open data for the life sciences.EBI RDF 平台:生命科学领域的关联开放数据。
Bioinformatics. 2014 May 1;30(9):1338-9. doi: 10.1093/bioinformatics/btt765. Epub 2014 Jan 11.
2
BioMart: driving a paradigm change in biological data management.生物数据管理领域的范式转变推动者——生物集市(BioMart)
Database (Oxford). 2011 Nov 13;2011:bar049. doi: 10.1093/database/bar049. Print 2011.
3
Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format.
将大肠杆菌 O-抗原的生物合成途径 RDF 化,以实现微生物学数据的语义共享。
BMC Microbiol. 2021 Nov 22;21(1):325. doi: 10.1186/s12866-021-02384-y.
4
The glycoconjugate ontology (GlycoCoO) for standardizing the annotation of glycoconjugate data and its application.糖缀合物本体(GlycoCoO)用于规范糖缀合物数据的注释及其应用。
Glycobiology. 2021 Aug 7;31(7):741-750. doi: 10.1093/glycob/cwab013.
5
The international glycan repository GlyTouCan version 3.0.国际聚糖库 GlyTouCan 版本 3.0。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1529-D1533. doi: 10.1093/nar/gkaa947.
6
DNA Data Bank of Japan: 30th anniversary.日本 DNA 数据库:30 周年纪念。
Nucleic Acids Res. 2018 Jan 4;46(D1):D30-D35. doi: 10.1093/nar/gkx926.
7
Improving data workflow systems with cloud services and use of open data for bioinformatics research.利用云服务改进数据工作流程系统,并利用开放数据进行生物信息学研究。
Brief Bioinform. 2018 Sep 28;19(5):1035-1050. doi: 10.1093/bib/bbx039.
8
DNA Data Bank of Japan.日本DNA数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D25-D31. doi: 10.1093/nar/gkw1001. Epub 2016 Oct 24.
9
NeuroRDF: semantic integration of highly curated data to prioritize biomarker candidates in Alzheimer's disease.NeuroRDF:高度精准数据的语义整合,以确定阿尔茨海默病生物标志物候选物的优先级
J Biomed Semantics. 2016 Jul 8;7:45. doi: 10.1186/s13326-016-0079-8.
10
DNA data bank of Japan (DDBJ) progress report.日本DNA数据库(DDBJ)进展报告。
Nucleic Acids Res. 2016 Jan 4;44(D1):D51-7. doi: 10.1093/nar/gkv1105. Epub 2015 Nov 17.
日本蛋白质数据库 (PDBj):维护结构数据库档案和资源描述框架格式。
Nucleic Acids Res. 2012 Jan;40(Database issue):D453-60. doi: 10.1093/nar/gkr811. Epub 2011 Oct 5.
4
BioMart Central Portal: an open database network for the biological community.生物信息学 Mart 中央门户:为生物界提供的一个开放数据库网络。
Database (Oxford). 2011 Sep 18;2011:bar041. doi: 10.1093/database/bar041. Print 2011.
5
BioMart: a data federation framework for large collaborative projects.BioMart:一个用于大型协作项目的数据联合框架。
Database (Oxford). 2011 Sep 19;2011:bar038. doi: 10.1093/database/bar038. Print 2011.
6
Challenges and opportunities in mining neuroscience data.挖掘神经科学数据的挑战与机遇。
Science. 2011 Feb 11;331(6018):708-12. doi: 10.1126/science.1199305.
7
GlycomeDB--a unified database for carbohydrate structures.糖库数据库——一个用于碳水化合物结构的统一数据库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D373-6. doi: 10.1093/nar/gkq1014. Epub 2010 Nov 2.
8
The RINGS resource for glycome informatics analysis and data mining on the Web.RINGS 资源:用于聚糖信息学分析和网络数据挖掘。
OMICS. 2010 Aug;14(4):475-86. doi: 10.1089/omi.2009.0129.
9
Building a biomedical ontology recommender web service.构建一个生物医学本体推荐网络服务。
J Biomed Semantics. 2010 Jun 22;1 Suppl 1(Suppl 1):S1. doi: 10.1186/2041-1480-1-S1-S1.
10
PROSITE, a protein domain database for functional characterization and annotation.PROSITE,一个用于功能特征描述和注释的蛋白质域数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D161-6. doi: 10.1093/nar/gkp885. Epub 2009 Oct 25.