用于全基因组分析的Gigwa基因型研究工具

Gigwa-Genotype investigator for genome-wide analyses.

作者信息

Sempéré Guilhem, Philippe Florian, Dereeper Alexis, Ruiz Manuel, Sarah Gautier, Larmande Pierre

机构信息

UMR InterTryp (CIRAD), Campus International de Baillarguet, 34398, Montpellier, Cedex 5, France.

South Green Bioinformatics Platform, 1000 Avenue Agropolis, 34934, Montpellier, Cedex 5, France.

出版信息

Gigascience. 2016 Jun 6;5:25. doi: 10.1186/s13742-016-0131-8.

DOI:10.1186/s13742-016-0131-8

PMID:27267926

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4897896/

Abstract

BACKGROUND

Exploring the structure of genomes and analyzing their evolution is essential to understanding the ecological adaptation of organisms. However, with the large amounts of data being produced by next-generation sequencing, computational challenges arise in terms of storage, search, sharing, analysis and visualization. This is particularly true with regards to studies of genomic variation, which are currently lacking scalable and user-friendly data exploration solutions.

DESCRIPTION

Here we present Gigwa, a web-based tool that provides an easy and intuitive way to explore large amounts of genotyping data by filtering it not only on the basis of variant features, including functional annotations, but also on genotype patterns. The data storage relies on MongoDB, which offers good scalability properties. Gigwa can handle multiple databases and may be deployed in either single- or multi-user mode. In addition, it provides a wide range of popular export formats.

CONCLUSIONS

The Gigwa application is suitable for managing large amounts of genomic variation data. Its user-friendly web interface makes such processing widely accessible. It can either be simply deployed on a workstation or be used to provide a shared data portal for a given community of researchers.

摘要

背景

探索基因组结构并分析其进化对于理解生物体的生态适应性至关重要。然而，随着下一代测序产生大量数据，在存储、搜索、共享、分析和可视化方面出现了计算挑战。在基因组变异研究方面尤其如此，目前缺乏可扩展且用户友好的数据探索解决方案。

描述

在此，我们展示了Gigwa，这是一个基于网络的工具，它提供了一种简单直观的方式来通过不仅基于变异特征（包括功能注释）而且基于基因型模式对大量基因分型数据进行过滤，从而探索这些数据。数据存储依赖于MongoDB，它具有良好的可扩展性。Gigwa可以处理多个数据库，并且可以以单用户或多用户模式部署。此外，它提供了广泛的流行导出格式。

结论

Gigwa应用程序适用于管理大量基因组变异数据。其用户友好的网络界面使这种处理广泛可用。它既可以简单地部署在工作站上，也可以用于为特定的研究人员群体提供共享数据门户。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ad/4897896/51a7b7351a2d/13742_2016_131_Fig1_HTML.jpg

相似文献

Gigwa-Genotype investigator for genome-wide analyses.

Gigascience. 2016 Jun 6;5:25. doi: 10.1186/s13742-016-0131-8.

Gigwa v2-Extended and improved genotype investigator.

Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz051.

Managing High-Density Genotyping Data with Gigwa.

Methods Mol Biol. 2022;2443:415-427. doi: 10.1007/978-1-0716-2067-0_21.

Variant graph craft (VGC): a comprehensive tool for analyzing genetic variation and identifying disease-causing variants.

BMC Bioinformatics. 2024 Sep 3;25(1):288. doi: 10.1186/s12859-024-05875-7.

VCF-Server: A web-based visualization tool for high-throughput variant data mining and management.

Mol Genet Genomic Med. 2019 Jul;7(7):e00641. doi: 10.1002/mgg3.641. Epub 2019 May 24.

GeneTools--application for functional annotation and statistical hypothesis testing.

BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470.

V-MitoSNP: visualization of human mitochondrial SNPs.

BMC Bioinformatics. 2006 Aug 15;7:379. doi: 10.1186/1471-2105-7-379.

SnpHub: an easy-to-set-up web server framework for exploring large-scale genomic variation data in the post-genomic era with applications in wheat.

Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa060.

SNiPlay3: a web-based application for exploration and large scale analyses of genomic variations.

Nucleic Acids Res. 2015 Jul 1;43(W1):W295-300. doi: 10.1093/nar/gkv351. Epub 2015 Jun 3.

GLIDERS--a web-based search engine for genome-wide linkage disequilibrium between HapMap SNPs.

BMC Bioinformatics. 2009 Oct 31;10:367. doi: 10.1186/1471-2105-10-367.

引用本文的文献

The groundnut improvement network for Africa (GINA) germplasm collection: a unique genetic resource for breeding and gene discovery.

G3 (Bethesda). 2023 Dec 29;14(1). doi: 10.1093/g3journal/jkad244.

High density genotype storage for plant breeding in the Chado schema of Breedbase.

PLoS One. 2020 Nov 11;15(11):e0240059. doi: 10.1371/journal.pone.0240059. eCollection 2020.

Unravelling the complex story of intergenomic recombination in ABB allotriploid bananas.

Ann Bot. 2021 Jan 1;127(1):7-20. doi: 10.1093/aob/mcaa032.

Bioinformatics Workflows With NoSQL Database in Cloud Computing.

Evol Bioinform Online. 2019 Dec 5;15:1176934319889974. doi: 10.1177/1176934319889974. eCollection 2019.

Benchmarking database systems for Genomic Selection implementation.

Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz096.

Rice Galaxy: an open resource for plant science.

Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz028.

Gigwa v2-Extended and improved genotype investigator.

Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz051.

BrAPI-an application programming interface for plant breeding applications.

Bioinformatics. 2019 Oct 15;35(20):4147-4155. doi: 10.1093/bioinformatics/btz190.

MGIS: managing banana (Musa spp.) genetic resources information and high-throughput genotyping data.

Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax046.

Erratum to: Gigwa-Genotype investigator for genome-wide analyses.

Gigascience. 2016 Nov 2;5(1):48. doi: 10.1186/s13742-016-0153-2.

本文引用的文献

Quantitative trait loci for rice blast resistance detected in a local rice breeding population by genome-wide association mapping.

Breed Sci. 2015 Dec;65(5):388-95. doi: 10.1270/jsbbs.65.388. Epub 2015 Dec 1.

OryzaGenome: Genome Diversity Database of Wild Oryza Species.

Plant Cell Physiol. 2016 Jan;57(1):e1. doi: 10.1093/pcp/pcv171. Epub 2015 Nov 16.

Joint genome-wide association study for milk fatty acid traits in Chinese and Danish Holstein populations.

J Dairy Sci. 2015 Nov;98(11):8152-63. doi: 10.3168/jds.2015-9383. Epub 2015 Sep 9.

Genome Wide Association Mapping for Arabinoxylan Content in a Collection of Tetraploid Wheats.

PLoS One. 2015 Jul 15;10(7):e0132787. doi: 10.1371/journal.pone.0132787. eCollection 2015.

Functional classification of 15 million SNPs detected from diverse chicken populations.

DNA Res. 2015 Jun;22(3):205-17. doi: 10.1093/dnares/dsv005. Epub 2015 Apr 29.

High dimensional biological data retrieval optimization with NoSQL technology.

BMC Genomics. 2014;15 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2164-15-S8-S3. Epub 2014 Nov 13.

SNP-Seek database of SNPs derived from 3000 rice genomes.

Nucleic Acids Res. 2015 Jan;43(Database issue):D1023-7. doi: 10.1093/nar/gku1039. Epub 2014 Nov 27.

bam.iobio: a web-based, real-time, sequence alignment file inspector.

Nat Methods. 2014 Dec;11(12):1189. doi: 10.1038/nmeth.3174.

WhopGenome: high-speed access to whole-genome variation and sequence data in R.

Bioinformatics. 2015 Feb 1;31(3):413-5. doi: 10.1093/bioinformatics/btu636. Epub 2014 Oct 1.

VariantAnnotation: a Bioconductor package for exploration and annotation of genetic variants.

Bioinformatics. 2014 Jul 15;30(14):2076-8. doi: 10.1093/bioinformatics/btu168. Epub 2014 Mar 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于全基因组分析的Gigwa基因型研究工具

Gigwa-Genotype investigator for genome-wide analyses.

作者信息

Sempéré Guilhem, Philippe Florian, Dereeper Alexis, Ruiz Manuel, Sarah Gautier, Larmande Pierre

机构信息

UMR InterTryp (CIRAD), Campus International de Baillarguet, 34398, Montpellier, Cedex 5, France.

South Green Bioinformatics Platform, 1000 Avenue Agropolis, 34934, Montpellier, Cedex 5, France.