TASSEL-GBS：一种用于测序分析流程的高容量基因分型方法。

TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline.

作者信息

Glaubitz Jeffrey C, Casstevens Terry M, Lu Fei, Harriman James, Elshire Robert J, Sun Qi, Buckler Edward S

机构信息

Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America.

Biotechnology Resource Center Bioinformatics Facility, Cornell University, Ithaca, New York, United States of America.

出版信息

PLoS One. 2014 Feb 28;9(2):e90346. doi: 10.1371/journal.pone.0090346. eCollection 2014.

DOI:10.1371/journal.pone.0090346

PMID:24587335

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3938676/

Abstract

Genotyping by sequencing (GBS) is a next generation sequencing based method that takes advantage of reduced representation to enable high throughput genotyping of large numbers of individuals at a large number of SNP markers. The relatively straightforward, robust, and cost-effective GBS protocol is currently being applied in numerous species by a large number of researchers. Herein we describe a bioinformatics pipeline, TASSEL-GBS, designed for the efficient processing of raw GBS sequence data into SNP genotypes. The TASSEL-GBS pipeline successfully fulfills the following key design criteria: (1) Ability to run on the modest computing resources that are typically available to small breeding or ecological research programs, including desktop or laptop machines with only 8-16 GB of RAM, (2) Scalability from small to extremely large studies, where hundreds of thousands or even millions of SNPs can be scored in up to 100,000 individuals (e.g., for large breeding programs or genetic surveys), and (3) Applicability in an accelerated breeding context, requiring rapid turnover from tissue collection to genotypes. Although a reference genome is required, the pipeline can also be run with an unfinished "pseudo-reference" consisting of numerous contigs. We describe the TASSEL-GBS pipeline in detail and benchmark it based upon a large scale, species wide analysis in maize (Zea mays), where the average error rate was reduced to 0.0042 through application of population genetic-based SNP filters. Overall, the GBS assay and the TASSEL-GBS pipeline provide robust tools for studying genomic diversity.

摘要

测序基因分型（GBS）是一种基于新一代测序的方法，它利用简化基因组表征来实现对大量个体在大量单核苷酸多态性（SNP）标记上的高通量基因分型。相对简单、稳健且经济高效的GBS方案目前正被大量研究人员应用于众多物种。在此，我们描述了一个生物信息学流程TASSEL - GBS，其设计用于将原始GBS序列数据高效处理为SNP基因型。TASSEL - GBS流程成功满足了以下关键设计标准：（1）能够在小型育种或生态研究项目通常可用的适度计算资源上运行，包括仅有8 - 16GB随机存取存储器的台式机或笔记本电脑；（2）从小规模研究到超大规模研究的可扩展性，在超大规模研究中，可对数以十万计甚至数百万计的个体中的数十万个甚至数百万个SNP进行评分（例如，用于大型育种项目或遗传调查）；（3）适用于加速育种环境，要求从组织采集到获得基因型的周转迅速。尽管需要一个参考基因组，但该流程也可以使用由众多重叠群组成的未完成“伪参考”来运行。我们详细描述了TASSEL - GBS流程，并基于在玉米（Zea mays）中进行的大规模全物种分析对其进行了基准测试，通过应用基于群体遗传学的SNP筛选，平均错误率降至0.0042。总体而言，GBS分析和TASSEL - GBS流程为研究基因组多样性提供了强大的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a64/3938676/7035fc65ade4/pone.0090346.g001.jpg

相似文献

TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline.TASSEL-GBS：一种用于测序分析流程的高容量基因分型方法。

PLoS One. 2014 Feb 28;9(2):e90346. doi: 10.1371/journal.pone.0090346. eCollection 2014.

Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies.基于测序基因分型（GBS）数据的全基因组单核苷酸多态性（SNP）检测：七种流程和两种测序技术的比较

PLoS One. 2016 Aug 22;11(8):e0161333. doi: 10.1371/journal.pone.0161333. eCollection 2016.

GBS-SNP-CROP: a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data.GBS-SNP-CROP：一种用于单核苷酸多态性（SNP）发现和植物种质特征分析的无参考序列流程，使用可变长度的双端测序基因分型数据。

BMC Bioinformatics. 2016 Jan 12;17:29. doi: 10.1186/s12859-016-0879-y.

UGbS-Flex, a novel bioinformatics pipeline for imputation-free SNP discovery in polyploids without a reference genome: finger millet as a case study.UGbS-Flex，一种新型的生物信息学管道，用于在没有参考基因组的情况下对多倍体进行无插补 SNP 发现：以手指小米为例。

BMC Plant Biol. 2018 Jun 15;18(1):117. doi: 10.1186/s12870-018-1316-3.

Fast-GBS: a new pipeline for the efficient and highly accurate calling of SNPs from genotyping-by-sequencing data.Fast-GBS：一种用于从测序基因分型数据中高效且高精度地调用单核苷酸多态性（SNP）的新流程。

BMC Bioinformatics. 2017 Jan 3;18(1):5. doi: 10.1186/s12859-016-1431-9.

Applications of genotyping-by-sequencing (GBS) in maize genetics and breeding.基于测序的基因分型（GBS）在玉米遗传学和育种中的应用。

Sci Rep. 2020 Oct 1;10(1):16308. doi: 10.1038/s41598-020-73321-8.

Bridging the genotyping gap: using genotyping by sequencing (GBS) to add high-density SNP markers and new value to traditional bi-parental mapping and breeding populations.弥合基因分型差距：利用测序基因分型 (GBS) 为传统的双亲作图和育种群体添加高密度 SNP 标记和新价值。

Theor Appl Genet. 2013 Nov;126(11):2699-716. doi: 10.1007/s00122-013-2166-x. Epub 2013 Aug 6.

Integrating targeted genetic markers to genotyping-by-sequencing for an ultimate genotyping tool.整合靶向遗传标记与测序基因分型，以获得最终的基因分型工具。

Theor Appl Genet. 2024 Oct 4;137(10):247. doi: 10.1007/s00122-024-04750-6.

Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.基于测序的基因分型在半导体测序平台上的应用：大麦中遗传标记和参考标记排序的比较。

PLoS One. 2013 Oct 3;8(10):e76925. doi: 10.1371/journal.pone.0076925. eCollection 2013.

A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.对低覆盖作物数据集的测序分析方法的比较表明，一种新的工作流程 GB-eaSy 具有优势。

BMC Bioinformatics. 2017 Dec 28;18(1):586. doi: 10.1186/s12859-017-2000-6.

引用本文的文献

Insights into the genetic and biochemical basis of Gibberella ear rot resistance in maize.对玉米赤霉穗腐病抗性的遗传和生化基础的见解。

Plant Genome. 2025 Sep;18(3):e70099. doi: 10.1002/tpg2.70099.

Exploring the biochemical and molecular mechanisms that contribute to Huanglongbing (HLB) tolerance in Citrus australis hybrids.探索澳洲柑橘杂交种中有助于黄龙病（HLB）耐受性的生化和分子机制。

BMC Genomics. 2025 Aug 19;26(1):761. doi: 10.1186/s12864-025-11942-x.

The genetic basis of microbiome recruitment in grapevine and its association with fermentative and pathogenic taxa.葡萄藤中微生物群落募集的遗传基础及其与发酵和致病类群的关联。

New Phytol. 2025 Oct;248(1):178-192. doi: 10.1111/nph.70387. Epub 2025 Jul 22.

Genetic and environmental influences on the distributions of three chromosomal drive haplotypes in maize.遗传和环境对玉米中三种染色体驱动单倍型分布的影响。

PLoS Genet. 2025 Jul 16;21(7):e1011742. doi: 10.1371/journal.pgen.1011742. eCollection 2025 Jul.

Genome-wide association study reveals the QTLs and candidate genes associated with seed longevity in soybean (Glycine max (L.) Merrill).全基因组关联研究揭示了与大豆（Glycine max (L.) Merrill）种子寿命相关的数量性状位点和候选基因。

BMC Plant Biol. 2025 Jul 2;25(1):829. doi: 10.1186/s12870-025-06822-1.

Genetic diversity, population structure, and cannabinoid variation in feral Cannabis sativa germplasm from the United States.美国野生大麻种质的遗传多样性、种群结构和大麻素变异

Sci Rep. 2025 Jul 1;15(1):20423. doi: 10.1038/s41598-025-07912-8.

The genetic basis of chloride exclusion in grapevines.葡萄中氯离子排斥的遗传基础。

G3 (Bethesda). 2025 Sep 3;15(9). doi: 10.1093/g3journal/jkaf149.

SNP Analysis Reveals Novel Insights into the Genetic Diversity of Colombian .单核苷酸多态性分析揭示了对哥伦比亚人遗传多样性的新见解。

Genes (Basel). 2025 May 30;16(6):675. doi: 10.3390/genes16060675.

Genetic and environmental influences on the distributions of three chromosomal drive haplotypes in maize.遗传和环境对玉米中三种染色体驱动单倍型分布的影响。

bioRxiv. 2025 May 27:2025.05.22.655462. doi: 10.1101/2025.05.22.655462.

Wheat yellow rust in Uruguay: understanding the genetic resistance in a panel of breeding and commercial germplasm.乌拉圭的小麦条锈病：了解一组育种和商业种质中的遗传抗性

Theor Appl Genet. 2025 Jun 11;138(7):145. doi: 10.1007/s00122-025-04937-5.

本文引用的文献

PLoS One. 2013 Oct 3;8(10):e76925. doi: 10.1371/journal.pone.0076925. eCollection 2013.

Genotyping by genome reducing and sequencing for outbred animals.全基因组降维测序在近交系动物中的基因分型

PLoS One. 2013 Jul 18;8(7):e67500. doi: 10.1371/journal.pone.0067500. Print 2013.

Digital genotyping of sorghum - a diverse plant species with a large repeat-rich genome.高粱的数字基因分型——一种具有丰富重复序列的大型基因组的多样化植物物种。

BMC Genomics. 2013 Jul 5;14:448. doi: 10.1186/1471-2164-14-448.

A filtering method to generate high quality short reads using illumina paired-end technology.一种使用 Illumina 配对末端技术生成高质量短读段的过滤方法。

PLoS One. 2013 Jun 17;8(6):e66643. doi: 10.1371/journal.pone.0066643. Print 2013.

Comprehensive genotyping of the USA national maize inbred seed bank.美国国家玉米自交系种子库的全面基因分型

Genome Biol. 2013 Jun 11;14(6):R55. doi: 10.1186/gb-2013-14-6-r55.

Discovering motifs that induce sequencing errors.发现诱导测序错误的模体。

BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S1. doi: 10.1186/1471-2105-14-S5-S1. Epub 2013 Apr 10.

The population structure and recent colonization history of Oregon threespine stickleback determined using restriction-site associated DNA-sequencing.利用限制位点相关 DNA 测序确定俄勒冈州三刺鱼的种群结构和近期的殖民历史。

Mol Ecol. 2013 Jun;22(11):2864-83. doi: 10.1111/mec.12330.

Genotyping-by-sequencing in ecological and conservation genomics.生态与保护基因组学中的简化基因组测序

Mol Ecol. 2013 Jun;22(11):2841-7. doi: 10.1111/mec.12350. Epub 2013 May 25.

Adaptive evolution during an ongoing range expansion: the invasive bank vole (Myodes glareolus) in Ireland.持续扩张过程中的适应性进化：爱尔兰的入侵田鼠（Myodes glareolus）。

Mol Ecol. 2013 Jun;22(11):2971-85. doi: 10.1111/mec.12343. Epub 2013 May 24.

RESTseq--efficient benchtop population genomics with RESTriction Fragment SEQuencing.RESTseq——基于限制性片段测序的高效台式群体基因组学。

PLoS One. 2013 May 17;8(5):e63960. doi: 10.1371/journal.pone.0063960. Print 2013.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

TASSEL-GBS：一种用于测序分析流程的高容量基因分型方法。

TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献