• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FALDO:一种用于描述核苷酸和蛋白质特征注释位置的语义标准。

FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation.

作者信息

Bolleman Jerven T, Mungall Christopher J, Strozzi Francesco, Baran Joachim, Dumontier Michel, Bonnal Raoul J P, Buels Robert, Hoehndorf Robert, Fujisawa Takatomo, Katayama Toshiaki, Cock Peter J A

机构信息

Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, 1 rue Michel, Servet, Geneva 4, 1211, Switzerland.

Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, 94720, CA, US.

出版信息

J Biomed Semantics. 2016 Jun 13;7:39. doi: 10.1186/s13326-016-0067-z.

DOI:10.1186/s13326-016-0067-z
PMID:27296299
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4907002/
Abstract

BACKGROUND

Nucleotide and protein sequence feature annotations are essential to understand biology on the genomic, transcriptomic, and proteomic level. Using Semantic Web technologies to query biological annotations, there was no standard that described this potentially complex location information as subject-predicate-object triples.

DESCRIPTION

We have developed an ontology, the Feature Annotation Location Description Ontology (FALDO), to describe the positions of annotated features on linear and circular sequences. FALDO can be used to describe nucleotide features in sequence records, protein annotations, and glycan binding sites, among other features in coordinate systems of the aforementioned "omics" areas. Using the same data format to represent sequence positions that are independent of file formats allows us to integrate sequence data from multiple sources and data types. The genome browser JBrowse is used to demonstrate accessing multiple SPARQL endpoints to display genomic feature annotations, as well as protein annotations from UniProt mapped to genomic locations.

CONCLUSIONS

Our ontology allows users to uniformly describe - and potentially merge - sequence annotations from multiple sources. Data sources using FALDO can prospectively be retrieved using federalised SPARQL queries against public SPARQL endpoints and/or local private triple stores.

摘要

背景

核苷酸和蛋白质序列特征注释对于在基因组、转录组和蛋白质组水平上理解生物学至关重要。使用语义网技术查询生物学注释时,没有标准将这种潜在复杂的位置信息描述为主谓宾三元组。

描述

我们开发了一种本体,即特征注释位置描述本体(FALDO),用于描述线性和环状序列上注释特征的位置。FALDO可用于描述序列记录中的核苷酸特征、蛋白质注释和聚糖结合位点,以及上述“组学”领域坐标系中的其他特征。使用相同的数据格式来表示与文件格式无关的序列位置,使我们能够整合来自多个来源和数据类型的序列数据。基因组浏览器JBrowse用于演示如何访问多个SPARQL端点以显示基因组特征注释,以及映射到基因组位置的来自UniProt的蛋白质注释。

结论

我们的本体允许用户统一描述并可能合并来自多个来源的序列注释。使用FALDO的数据源可以通过针对公共SPARQL端点和/或本地私有三元组存储的联邦化SPARQL查询进行前瞻性检索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/34e69166881b/13326_2016_67_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/188b4f92118c/13326_2016_67_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/964c166803fc/13326_2016_67_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/067297c92761/13326_2016_67_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/921af1ed3b90/13326_2016_67_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/874185fce56c/13326_2016_67_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/941976738ce3/13326_2016_67_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/1c3ce48d238f/13326_2016_67_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/6556eea1b1b6/13326_2016_67_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/d61b5e432ad6/13326_2016_67_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/34e69166881b/13326_2016_67_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/188b4f92118c/13326_2016_67_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/964c166803fc/13326_2016_67_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/067297c92761/13326_2016_67_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/921af1ed3b90/13326_2016_67_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/874185fce56c/13326_2016_67_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/941976738ce3/13326_2016_67_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/1c3ce48d238f/13326_2016_67_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/6556eea1b1b6/13326_2016_67_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/d61b5e432ad6/13326_2016_67_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0579/4907002/34e69166881b/13326_2016_67_Fig10_HTML.jpg

相似文献

1
FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation.FALDO:一种用于描述核苷酸和蛋白质特征注释位置的语义标准。
J Biomed Semantics. 2016 Jun 13;7:39. doi: 10.1186/s13326-016-0067-z.
2
Clever generation of rich SPARQL queries from annotated relational schema: application to Semantic Web Service creation for biological databases.从带注释的关系模式中生成丰富的 SPARQL 查询:应用于生物数据库语义 Web 服务创建。
BMC Bioinformatics. 2013 Apr 15;14:126. doi: 10.1186/1471-2105-14-126.
3
Technical Note: Ontology-guided radiomics analysis workflow (O-RAW).技术说明:本体引导的放射组学分析工作流程(O-RAW)。
Med Phys. 2019 Dec;46(12):5677-5684. doi: 10.1002/mp.13844. Epub 2019 Oct 25.
4
Publication of nuclear magnetic resonance experimental data with semantic web technology and the application thereof to biomedical research of proteins.利用语义网技术发布核磁共振实验数据及其在蛋白质生物医学研究中的应用。
J Biomed Semantics. 2016 May 5;7(1):16. doi: 10.1186/s13326-016-0057-1.
5
TogoTable: cross-database annotation system using the Resource Description Framework (RDF) data model.TogoTable:使用资源描述框架(RDF)数据模型的跨数据库注释系统。
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W442-8. doi: 10.1093/nar/gku403. Epub 2014 May 14.
6
HAMAP as SPARQL rules-A portable annotation pipeline for genomes and proteomes.HAMAP 作为 SPARQL 规则——一种用于基因组和蛋白质组的可移植注释管道。
Gigascience. 2020 Feb 1;9(2). doi: 10.1093/gigascience/giaa003.
7
Using the Semantic Web for Rapid Integration of WikiPathways with Other Biological Online Data Resources.利用语义网实现WikiPathways与其他生物在线数据资源的快速整合。
PLoS Comput Biol. 2016 Jun 23;12(6):e1004989. doi: 10.1371/journal.pcbi.1004989. eCollection 2016 Jun.
8
Gauging triple stores with actual biological data.用实际的生物数据评估三元存储库。
BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-13-S1-S3.
9
Annotation of biologically relevant ligands in UniProtKB using ChEBI.使用 ChEBI 对 UniProtKB 中的生物相关配体进行注释。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac793.
10
A relation based measure of semantic similarity for Gene Ontology annotations.一种基于关系的基因本体注释语义相似度度量方法。
BMC Bioinformatics. 2008 Nov 4;9:468. doi: 10.1186/1471-2105-9-468.

引用本文的文献

1
A scalable tool for analyzing genomic variants of humans using knowledge graphs and graph machine learning.一种使用知识图谱和图机器学习来分析人类基因组变异的可扩展工具。
Front Big Data. 2025 Jan 21;7:1466391. doi: 10.3389/fdata.2024.1466391. eCollection 2024.
2
Generic and queryable data integration schema for transcriptomics and epigenomics studies.用于转录组学和表观基因组学研究的通用且可查询的数据整合模式。
Comput Struct Biotechnol J. 2024 Nov 19;23:4232-4241. doi: 10.1016/j.csbj.2024.11.022. eCollection 2024 Dec.
3
Explainable AI for Estimating Pathogenicity of Genetic Variants Using Large-Scale Knowledge Graphs.

本文引用的文献

1
GenBank.基因银行
Nucleic Acids Res. 2017 Jan 4;45(D1):D37-D42. doi: 10.1093/nar/gkw1070. Epub 2016 Nov 28.
2
GFVO: the Genomic Feature and Variation Ontology.GFVO:基因组特征与变异本体论。
PeerJ. 2015 May 5;3:e933. doi: 10.7717/peerj.933. eCollection 2015.
3
BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains.2011年和2012年的生物黑客马拉松系列活动:本体论和关联数据在生命科学领域的渗透。
使用大规模知识图谱估计基因变异致病性的可解释人工智能
Cancers (Basel). 2023 Feb 9;15(4):1118. doi: 10.3390/cancers15041118.
4
HAMAP as SPARQL rules-A portable annotation pipeline for genomes and proteomes.HAMAP 作为 SPARQL 规则——一种用于基因组和蛋白质组的可移植注释管道。
Gigascience. 2020 Feb 1;9(2). doi: 10.1093/gigascience/giaa003.
5
The Empusa code generator and its application to GBOL, an extendable ontology for genome annotation.Empusa 代码生成器及其在 GBOL(用于基因组注释的可扩展本体)中的应用。
Sci Data. 2019 Nov 4;6(1):254. doi: 10.1038/s41597-019-0263-7.
6
TogoGenome/TogoStanza: modularized Semantic Web genome database.TogoGenome/TogoStanza:模块化语义网基因组数据库。
Database (Oxford). 2019 Jan 1;2019:bay132. doi: 10.1093/database/bay132.
7
NBDC RDF portal: a comprehensive repository for semantic data in life sciences.NBDC RDF 门户:生命科学中语义数据的综合知识库。
Database (Oxford). 2018 Jan 1;2018:bay123. doi: 10.1093/database/bay123.
8
Spfy: an integrated graph database for real-time prediction of bacterial phenotypes and downstream comparative analyses.Spfy:用于实时预测细菌表型和下游比较分析的集成图数据库。
Database (Oxford). 2018 Jan 1;2018:1-10. doi: 10.1093/database/bay086.
9
YummyData: providing high-quality open life science data.育盟数据:提供高质量的开放生命科学数据。
Database (Oxford). 2018 Jan 1;2018. doi: 10.1093/database/bay022.
10
SAPP: functional genome annotation and analysis through a semantic framework using FAIR principles.SAPP:通过使用 FAIR 原则的语义框架进行功能基因组注释和分析。
Bioinformatics. 2018 Apr 15;34(8):1401-1403. doi: 10.1093/bioinformatics/btx767.
J Biomed Semantics. 2014 Feb 5;5(1):5. doi: 10.1186/2041-1480-5-5.
4
UniCarbKB: building a knowledge platform for glycoproteomics.UniCarbKB:构建糖蛋白质组学知识库平台。
Nucleic Acids Res. 2014 Jan;42(Database issue):D215-21. doi: 10.1093/nar/gkt1128. Epub 2013 Nov 13.
5
The 3rd DBCLS BioHackathon: improving life science data integration with Semantic Web technologies.第三届DBCLS生物黑客松:利用语义网技术改善生命科学数据整合
J Biomed Semantics. 2013 Feb 11;4(1):6. doi: 10.1186/2041-1480-4-6.
6
Facing growth in the European Nucleotide Archive.面对欧洲核苷酸档案库的增长。
Nucleic Acids Res. 2013 Jan;41(Database issue):D30-5. doi: 10.1093/nar/gks1175. Epub 2012 Nov 29.
7
DDBJ new system and service refactoring.DDBJ 新系统和服务重构。
Nucleic Acids Res. 2013 Jan;41(Database issue):D25-9. doi: 10.1093/nar/gks1152. Epub 2012 Nov 24.
8
Update on activities at the Universal Protein Resource (UniProt) in 2013.2013 年 泛蛋白资源库(UniProt)活动更新。
Nucleic Acids Res. 2013 Jan;41(Database issue):D43-7. doi: 10.1093/nar/gks1068. Epub 2012 Nov 17.
9
BioJava: an open-source framework for bioinformatics in 2012.BioJava:2012 年的生物信息学开源框架。
Bioinformatics. 2012 Oct 15;28(20):2693-5. doi: 10.1093/bioinformatics/bts494. Epub 2012 Aug 9.
10
The 2nd DBCLS BioHackathon: interoperable bioinformatics Web services for integrated applications.第二届DBCLS生物黑客松:用于集成应用的可互操作生物信息学网络服务。
J Biomed Semantics. 2011 Aug 2;2:4. doi: 10.1186/2041-1480-2-4.