Fast-GBS：一种用于从测序基因分型数据中高效且高精度地调用单核苷酸多态性（SNP）的新流程。

Fast-GBS: a new pipeline for the efficient and highly accurate calling of SNPs from genotyping-by-sequencing data.

作者信息

Torkamaneh Davoud, Laroche Jérôme, Bastien Maxime, Abed Amina, Belzile François

机构信息

Département de Phytologie, Université Laval, Quebec City, QC, Canada.

Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec City, QC, Canada.

出版信息

BMC Bioinformatics. 2017 Jan 3;18(1):5. doi: 10.1186/s12859-016-1431-9.

DOI:10.1186/s12859-016-1431-9

PMID:28049422

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5210301/

Abstract

BACKGROUND

Next-generation sequencing (NGS) technologies have accelerated considerably the investigation into the composition of genomes and their functions. Genotyping-by-sequencing (GBS) is a genotyping approach that makes use of NGS to rapidly and economically scan a genome. It has been shown to allow the simultaneous discovery and genotyping of thousands to millions of SNPs across a wide range of species. For most users, the main challenge in GBS is the bioinformatics analysis of the large amount of sequence information derived from sequencing GBS libraries in view of calling alleles at SNP loci. Herein we describe a new GBS bioinformatics pipeline, Fast-GBS, designed to provide highly accurate genotyping, to require modest computing resources and to offer ease of use.

RESULTS

Fast-GBS is built upon standard bioinformatics language and file formats, is capable of handling data from different sequencing platforms, is capable of detecting different kinds of variants (SNPs, MNPs, and Indels). To illustrate its performance, we called variants in three collections of samples (soybean, barley, and potato) that cover a range of different genome sizes, levels of genome complexity, and ploidy. Within these small sets of samples, we called 35 k, 32 k and 38 k SNPs for soybean, barley and potato, respectively. To assess genotype accuracy, we compared these GBS-derived SNP genotypes with independent data sets obtained from whole-genome sequencing or SNP arrays. This analysis yielded estimated accuracies of 98.7, 95.2, and 94% for soybean, barley, and potato, respectively.

CONCLUSIONS

We conclude that Fast-GBS provides a highly efficient and reliable tool for calling SNPs from GBS data.

摘要

背景

新一代测序（NGS）技术极大地加速了对基因组组成及其功能的研究。简化基因组测序（GBS）是一种利用NGS快速且经济地扫描基因组的基因分型方法。已证明它能够在广泛的物种中同时发现和基因分型数千至数百万个单核苷酸多态性（SNP）。对于大多数用户而言，GBS的主要挑战在于鉴于在SNP位点进行等位基因分型，对从GBS文库测序获得的大量序列信息进行生物信息学分析。在此，我们描述一种新的GBS生物信息学流程Fast-GBS，其旨在提供高度准确的基因分型，所需计算资源适度且易于使用。

结果

Fast-GBS基于标准生物信息学语言和文件格式构建，能够处理来自不同测序平台的数据，能够检测不同类型的变异（SNP、多核苷酸多态性（MNP）和插入缺失（Indel））。为说明其性能，我们在三个样本集合（大豆、大麦和马铃薯）中进行变异分型，这些样本涵盖了不同的基因组大小、基因组复杂程度和倍性水平。在这些少量样本中，我们分别为大豆、大麦和马铃薯鉴定出35k、32k和38k个SNP。为评估基因分型准确性，我们将这些源自GBS的SNP基因分型与从全基因组测序或SNP芯片获得的独立数据集进行比较。该分析得出大豆、大麦和马铃薯的估计准确率分别为98.7%、95.2%和94%。

结论

我们得出结论，Fast-GBS为从GBS数据中鉴定SNP提供了一种高效且可靠的工具。

相似文献

Fast-GBS: a new pipeline for the efficient and highly accurate calling of SNPs from genotyping-by-sequencing data.Fast-GBS：一种用于从测序基因分型数据中高效且高精度地调用单核苷酸多态性（SNP）的新流程。

BMC Bioinformatics. 2017 Jan 3;18(1):5. doi: 10.1186/s12859-016-1431-9.

Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies.基于测序基因分型（GBS）数据的全基因组单核苷酸多态性（SNP）检测：七种流程和两种测序技术的比较

PLoS One. 2016 Aug 22;11(8):e0161333. doi: 10.1371/journal.pone.0161333. eCollection 2016.

Scanning and Filling: Ultra-Dense SNP Genotyping Combining Genotyping-By-Sequencing, SNP Array and Whole-Genome Resequencing Data.扫描与填充：结合简化基因组测序、SNP芯片和全基因组重测序数据的超密集SNP基因分型

PLoS One. 2015 Jul 10;10(7):e0131533. doi: 10.1371/journal.pone.0131533. eCollection 2015.

A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.对低覆盖作物数据集的测序分析方法的比较表明，一种新的工作流程 GB-eaSy 具有优势。

BMC Bioinformatics. 2017 Dec 28;18(1):586. doi: 10.1186/s12859-017-2000-6.

An improved genotyping by sequencing (GBS) approach offering increased versatility and efficiency of SNP discovery and genotyping.一种改进的基因分型测序（GBS）方法，提高了 SNP 发现和基因分型的多功能性和效率。

PLoS One. 2013;8(1):e54603. doi: 10.1371/journal.pone.0054603. Epub 2013 Jan 23.

Comprehensive description of genomewide nucleotide and structural variation in short-season soya bean.短季大豆全基因组核苷酸和结构变异的综合描述。

Plant Biotechnol J. 2018 Mar;16(3):749-759. doi: 10.1111/pbi.12825. Epub 2017 Nov 3.

Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation.牛群中的低深度测序基因分型（GBS）：最大化高质量基因型选择和归因准确性的策略。

BMC Genet. 2017 Apr 5;18(1):32. doi: 10.1186/s12863-017-0501-y.

Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing.利用454测序技术对四倍体苜蓿群体进行简化基因组测序分析的验证

PLoS One. 2015 Jun 26;10(6):e0131918. doi: 10.1371/journal.pone.0131918. eCollection 2015.

UGbS-Flex, a novel bioinformatics pipeline for imputation-free SNP discovery in polyploids without a reference genome: finger millet as a case study.UGbS-Flex，一种新型的生物信息学管道，用于在没有参考基因组的情况下对多倍体进行无插补 SNP 发现：以手指小米为例。

BMC Plant Biol. 2018 Jun 15;18(1):117. doi: 10.1186/s12870-018-1316-3.

Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.基于测序的基因分型在半导体测序平台上的应用：大麦中遗传标记和参考标记排序的比较。

PLoS One. 2013 Oct 3;8(10):e76925. doi: 10.1371/journal.pone.0076925. eCollection 2013.

引用本文的文献

Genome-wide association analysis of winter survival in a diverse Canadian winter wheat population.加拿大不同冬小麦群体冬季存活率的全基因组关联分析。

Plant Genome. 2025 Sep;18(3):e70091. doi: 10.1002/tpg2.70091.

Assigning Phenologically Asynchronous Moths to Source Populations Using Individual Genotypes.利用个体基因型将物候异步蛾类分配到源种群。

Mol Ecol. 2025 Jul;34(14):e17832. doi: 10.1111/mec.17832. Epub 2025 Jun 11.

Genetic diversity and population structure analysis in Asparagus officinalis.石刁柏的遗传多样性与群体结构分析

J Genet Eng Biotechnol. 2025 Jun;23(2):100491. doi: 10.1016/j.jgeb.2025.100491. Epub 2025 Apr 26.

Tracking the North American Asian Longhorned Beetle Invasion With Genomics.利用基因组学追踪北美亚洲长角天牛的入侵

Evol Appl. 2024 Nov 19;17(11):e70036. doi: 10.1111/eva.70036. eCollection 2024 Nov.

Genome-Wide Association Studies and QTL Mapping Reveal a New Locus Associated with Resistance to Bacterial Pustule Caused by pv. in Soybean.全基因组关联研究和数量性状位点定位揭示了大豆中一个与对丁香假单胞菌大豆致病变种引起的细菌性叶斑病抗性相关的新位点。

Plants (Basel). 2024 Sep 5;13(17):2484. doi: 10.3390/plants13172484.

Allelic variability in the Rpp1 locus conferring resistance to Asian soybean rust revealed by genome-wide association.全基因组关联分析揭示了 Rpp1 位点等位基因变异与亚洲大豆锈病抗性的关系。

BMC Plant Biol. 2024 Aug 3;24(1):743. doi: 10.1186/s12870-024-05454-1.

Genetics of flight in spongy moths (Lymantria dispar ssp.): functionally integrated profiling of a complex invasive trait.海绵蛾飞行的遗传学研究（Lymantria dispar ssp.）：复杂入侵特性的功能综合分析。

BMC Genomics. 2024 May 31;25(1):541. doi: 10.1186/s12864-023-09936-8.

Dissection of the locus in two early maturing Canadian soybean populations.对两个早熟加拿大大豆群体中的该基因座进行剖析。

Front Plant Sci. 2024 Feb 8;15:1329065. doi: 10.3389/fpls.2024.1329065. eCollection 2024.

Application of SVR-Mediated GWAS for Identification of Durable Genetic Regions Associated with Soybean Seed Quality Traits.应用支持向量回归介导的全基因组关联研究鉴定与大豆种子品质性状相关的持久遗传区域

Plants (Basel). 2023 Jul 16;12(14):2659. doi: 10.3390/plants12142659.

Mapping of a soybean rust resistance in PI 594756 at the locus.对PI 594756中位于该位点的大豆锈病抗性进行定位。

Mol Breed. 2023 Feb 8;43(2):12. doi: 10.1007/s11032-023-01358-4. eCollection 2023 Feb.

本文引用的文献

Development and analysis of a 20K SNP array for potato (Solanum tuberosum): an insight into the breeding history.用于马铃薯（Solanum tuberosum）的20K单核苷酸多态性阵列的开发与分析：对育种历史的洞察

Theor Appl Genet. 2015 Dec;128(12):2387-401. doi: 10.1007/s00122-015-2593-y. Epub 2015 Aug 12.

PLoS One. 2015 Jul 10;10(7):e0131533. doi: 10.1371/journal.pone.0131533. eCollection 2015.

Identification of loci governing eight agronomic traits using a GBS-GWAS approach and validation by QTL mapping in soya bean.利用 GBS-GWAS 方法鉴定大豆 8 个农艺性状的基因座，并通过 QTL 作图进行验证。

Plant Biotechnol J. 2015 Feb;13(2):211-21. doi: 10.1111/pbi.12249. Epub 2014 Sep 12.

Construction of a dense SNP map of a highly heterozygous diploid potato population and QTL analysis of tuber shape and eye depth.构建高度杂合二倍体马铃薯群体的高密度单核苷酸多态性（SNP）图谱以及块茎形状和芽眼深度的数量性状基因座（QTL）分析

Theor Appl Genet. 2014 Oct;127(10):2159-71. doi: 10.1007/s00122-014-2369-9. Epub 2014 Aug 27.

Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications.整合基于图谱、组装和单倍型的方法以在临床测序应用中进行变异检测。

Nat Genet. 2014 Aug;46(8):912-918. doi: 10.1038/ng.3036. Epub 2014 Jul 13.

TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline.TASSEL-GBS：一种用于测序分析流程的高容量基因分型方法。

PLoS One. 2014 Feb 28;9(2):e90346. doi: 10.1371/journal.pone.0090346. eCollection 2014.

PLoS One. 2013 Oct 3;8(10):e76925. doi: 10.1371/journal.pone.0076925. eCollection 2013.

Using state machines to model the Ion Torrent sequencing process and to improve read error rates.使用状态机对 Ion Torrent 测序过程进行建模，并提高读取错误率。

Bioinformatics. 2013 Jul 1;29(13):i344-51. doi: 10.1093/bioinformatics/btt212.

Stacks: an analysis tool set for population genomics.Stacks：用于群体基因组学的分析工具集。

Mol Ecol. 2013 Jun;22(11):3124-40. doi: 10.1111/mec.12354. Epub 2013 May 24.

Shining a light on dark sequencing: characterising errors in Ion Torrent PGM data.照亮黑暗测序：剖析 Ion Torrent PGM 数据中的错误。

PLoS Comput Biol. 2013 Apr;9(4):e1003031. doi: 10.1371/journal.pcbi.1003031. Epub 2013 Apr 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Fast-GBS：一种用于从测序基因分型数据中高效且高精度地调用单核苷酸多态性（SNP）的新流程。

Fast-GBS: a new pipeline for the efficient and highly accurate calling of SNPs from genotyping-by-sequencing data.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献