• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ODG:组学数据库生成器——一种用于生成、查询和分析多组学比较数据库以促进生物学理解的工具。

ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding.

作者信息

Guhlin Joseph, Silverstein Kevin A T, Zhou Peng, Tiffin Peter, Young Nevin D

机构信息

Department of Plant and Microbial Biology, 140 Gortner Laboratory, 1479 Gortner Avenue, University of Minnesota, St. Paul, MN, 55108, USA.

Minnesota Supercomputing Institute, 599 Walter Library, 117 Pleasant St. SE, Minneapolis, MN, 55455, USA.

出版信息

BMC Bioinformatics. 2017 Aug 10;18(1):367. doi: 10.1186/s12859-017-1777-7.

DOI:10.1186/s12859-017-1777-7
PMID:28797229
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5553995/
Abstract

BACKGROUND

Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data.

RESULTS

The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations.

CONCLUSIONS

ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or understudied species. For species for which more data are available, ODG can be used to conduct complex multi-omics, pattern-matching queries.

摘要

背景

近年来,组学数据的快速生成导致大量数据集相互孤立,缺乏系统整合和知识构建,而各个研究小组虽在网络上提供了定制的注释数据集,但将这些数据集与实验室内部数据集相链接的方式却很少。由于众多研究小组都在生成各自的数据,因此将这些数据与更大的基因组和比较基因组背景相关联的能力对于充分利用数据而言变得愈发关键。

结果

组学数据库生成器(ODG)允许用户创建定制数据库,该数据库利用已发表的基因组学数据与实验数据进行整合,并可通过灵活的图形数据库进行查询。当提供组学数据和实验数据时,ODG将创建一个比较性的多维图形数据库。ODG能够从其他来源(如InterProScan、基因本体论、ENZYME、UniPathway等)导入定义和注释。这些注释数据对于研究新的或研究较少的物种(其转录本仅为预测所得)特别有用,能够迅速为预测基因提供额外的注释层。在研究较为充分的物种中,ODG可以进行共线性注释翻译,或快速识别一组基因或核苷酸位置的特征,如关联研究中的命中结果。ODG提供了基于网络的用户界面,用于配置数据导入和查询数据库。查询也可以从命令行运行,并且可以通过适用于大多数语言的编程语言钩子直接查询数据库。ODG支持大多数常见的基因组格式以及通用的、易于使用的制表符分隔值格式,用于用户提供的注释。

结论

ODG是一个用户友好的数据库生成和查询工具,它能根据提供的数据生成比较基因组数据库或多层注释数据库。ODG提供快速的比较基因组注释,因此对于非模式物种或研究较少的物种特别有用。对于有更多数据可用的物种,ODG可用于进行复杂的多组学模式匹配查询。

相似文献

1
ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding.ODG:组学数据库生成器——一种用于生成、查询和分析多组学比较数据库以促进生物学理解的工具。
BMC Bioinformatics. 2017 Aug 10;18(1):367. doi: 10.1186/s12859-017-1777-7.
2
CGKB: an annotation knowledge base for cowpea (Vigna unguiculata L.) methylation filtered genomic genespace sequences.CGKB:豇豆(Vigna unguiculata L.)甲基化过滤基因组基因空间序列的注释知识库。
BMC Bioinformatics. 2007 Apr 19;8:129. doi: 10.1186/1471-2105-8-129.
3
TabSQL: a MySQL tool to facilitate mapping user data to public databases.TabSQL:一个 MySQL 工具,用于方便将用户数据映射到公共数据库。
BMC Bioinformatics. 2010 Jun 23;11:342. doi: 10.1186/1471-2105-11-342.
4
5
The MOLGENIS toolkit: rapid prototyping of biosoftware at the push of a button.MOLGENIS 工具包:一键快速原型生物软件。
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S12. doi: 10.1186/1471-2105-11-S12-S12.
6
The pear genomics database (PGDB): a comprehensive multi-omics research platform for Pyrus spp.梨基因组学数据库 (PGDB):梨属植物综合多组学研究平台
BMC Plant Biol. 2023 Sep 15;23(1):430. doi: 10.1186/s12870-023-04406-5.
7
Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission.基因组注释生成器:一个用于生成和纠正 WGS 注释表以便提交给 NCBI 的简单工具。
Gigascience. 2018 Apr 1;7(4):1-5. doi: 10.1093/gigascience/giy018.
8
ASGARD: an open-access database of annotated transcriptomes for emerging model arthropod species.ASGARD:新兴模式节肢动物物种注释转录组的开放获取数据库。
Database (Oxford). 2012 Nov 23;2012:bas048. doi: 10.1093/database/bas048. Print 2012.
9
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
10
Mitochondrial Disease Sequence Data Resource (MSeqDR): a global grass-roots consortium to facilitate deposition, curation, annotation, and integrated analysis of genomic data for the mitochondrial disease clinical and research communities.线粒体疾病序列数据资源(MSeqDR):一个全球基层联盟,旨在促进为线粒体疾病临床和研究群体进行基因组数据的提交、管理、注释及综合分析。
Mol Genet Metab. 2015 Mar;114(3):388-96. doi: 10.1016/j.ymgme.2014.11.016. Epub 2014 Dec 4.

引用本文的文献

1
The Sordariomycetes: an expanding resource with Big Data for mining in evolutionary genomics and transcriptomics.粪壳菌纲:一个在进化基因组学和转录组学中用于大数据挖掘的不断扩展的资源。
Front Fungal Biol. 2023 Jun 30;4:1214537. doi: 10.3389/ffunb.2023.1214537. eCollection 2023.
2
Development of a knowledge graph framework to ease and empower translational approaches in plant research: a use-case on grain legumes.开发一个知识图谱框架以简化并增强植物研究中的转化方法:以豆科作物为例
Front Artif Intell. 2023 Aug 3;6:1191122. doi: 10.3389/frai.2023.1191122. eCollection 2023.
3
Ecosystem-specific microbiota and microbiome databases in the era of big data.

本文引用的文献

1
Draft Genome Sequences of Four Novel Thermal- and Alkaline-Tolerant Egyptian Rhizobium Strains Nodulating Berseem Clover.四株能使埃及三叶草结瘤的新型耐热耐碱根瘤菌的基因组草图序列
Genome Announc. 2016 Sep 15;4(5):e00988-16. doi: 10.1128/genomeA.00988-16.
2
InterProScan 5: genome-scale protein function classification.InterProScan 5:基因组规模的蛋白质功能分类。
Bioinformatics. 2014 May 1;30(9):1236-40. doi: 10.1093/bioinformatics/btu031. Epub 2014 Jan 21.
3
Soybean knowledge base (SoyKB): a web resource for integration of soybean translational genomics and molecular breeding.
大数据时代特定生态系统的微生物群和微生物组数据库
Environ Microbiome. 2022 Jul 16;17(1):37. doi: 10.1186/s40793-022-00433-1.
4
Chinese Herbal Medicine Hepatotoxicity: The Evaluation and Recognization Based on Large-scale Evidence Database.中草药肝毒性:基于大规模证据数据库的评估与识别。
Curr Drug Metab. 2019;20(2):138-146. doi: 10.2174/1389200219666180813144114.
5
The complete replicons of 16 Ensifer meliloti strains offer insights into intra- and inter-replicon gene transfer, transposon-associated loci, and repeat elements.16 株苜蓿中华根瘤菌完整复制子的研究为研究复制子内和复制子间的基因转移、转座子相关基因座和重复元件提供了线索。
Microb Genom. 2018 May;4(5). doi: 10.1099/mgen.0.000174. Epub 2018 Apr 19.
大豆知识库 (SoyKB): 整合大豆转化基因组学和分子育种的网络资源。
Nucleic Acids Res. 2014 Jan;42(Database issue):D1245-52. doi: 10.1093/nar/gkt905. Epub 2013 Oct 16.
4
Comparative genomics of the core and accessory genomes of 48 Sinorhizobium strains comprising five genospecies.48株包含五个基因种的中华根瘤菌菌株核心基因组与辅助基因组的比较基因组学研究
Genome Biol. 2013 Feb 20;14(2):R17. doi: 10.1186/gb-2013-14-2-r17.
5
The UCSC genome browser and associated tools.UCSC 基因组浏览器及相关工具。
Brief Bioinform. 2013 Mar;14(2):144-61. doi: 10.1093/bib/bbs038. Epub 2012 Aug 20.
6
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.RNA-seq 实验中使用 TopHat 和 Cufflinks 的差异基因和转录本表达分析。
Nat Protoc. 2012 Mar 1;7(3):562-78. doi: 10.1038/nprot.2012.016.
7
Prodigal: prokaryotic gene recognition and translation initiation site identification.普罗迪格:原核基因识别和翻译起始位点鉴定。
BMC Bioinformatics. 2010 Mar 8;11:119. doi: 10.1186/1471-2105-11-119.
8
SoyBase, the USDA-ARS soybean genetics and genomics database.大豆基础数据库,美国农业部农业研究服务部大豆遗传学和基因组学数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D843-6. doi: 10.1093/nar/gkp798. Epub 2009 Dec 14.
9
The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases.代谢途径与酶的MetaCyc数据库以及途径/基因组数据库的BioCyc集合。
Nucleic Acids Res. 2008 Jan;36(Database issue):D623-31. doi: 10.1093/nar/gkm900. Epub 2007 Oct 27.
10
BioGRID: a general repository for interaction datasets.生物通用互作数据集知识库(BioGRID):一个交互数据集的通用存储库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D535-9. doi: 10.1093/nar/gkj109.