Suppr超能文献

基于定制扩增子测序的低成本单核苷酸多态性(SNP)基因分型方法:测序数千样本基因分型法(GT-seq)

Genotyping-in-Thousands by sequencing (GT-seq): A cost effective SNP genotyping method based on custom amplicon sequencing.

作者信息

Campbell Nathan R, Harmon Stephanie A, Narum Shawn R

机构信息

Columbia River Inter-Tribal Fish Commission, 3059F National Fish Hatchery Road, Hagerman, ID, 83332, USA.

出版信息

Mol Ecol Resour. 2015 Jul;15(4):855-67. doi: 10.1111/1755-0998.12357. Epub 2014 Dec 25.

Abstract

Genotyping-in-Thousands by sequencing (GT-seq) is a method that uses next-generation sequencing of multiplexed PCR products to generate genotypes from relatively small panels (50-500) of targeted single-nucleotide polymorphisms (SNPs) for thousands of individuals in a single Illumina HiSeq lane. This method uses only unlabelled oligos and PCR master mix in two thermal cycling steps for amplification of targeted SNP loci. During this process, sequencing adapters and dual barcode sequence tags are incorporated into the amplicons enabling thousands of individuals to be pooled into a single sequencing library. Post sequencing, reads from individual samples are split into individual files using their unique combination of barcode sequences. Genotyping is performed with a simple perl script which counts amplicon-specific sequences for each allele, and allele ratios are used to determine the genotypes. We demonstrate this technique by genotyping 2068 individual steelhead trout (Oncorhynchus mykiss) samples with a set of 192 SNP markers in a single library sequenced in a single Illumina HiSeq lane. Genotype data were 99.9% concordant to previously collected TaqMan(™) genotypes at the same 192 loci, but call rates were slightly lower with GT-seq (96.4%) relative to Taqman (99.0%). Of the 192 SNPs, 187 were genotyped in ≥90% of the individual samples and only 3 SNPs were genotyped in <70% of samples. This study demonstrates amplicon sequencing with GT-seq greatly reduces the cost of genotyping hundreds of targeted SNPs relative to existing methods by utilizing a simple library preparation method and massive efficiency of scale.

摘要

测序数千样本基因分型法(GT-seq)是一种利用多重PCR产物的二代测序技术,从相对较小的目标单核苷酸多态性(SNP)面板(50-500个)中为数千个个体生成基因型的方法,可在单个Illumina HiSeq泳道中完成。该方法在两个热循环步骤中仅使用未标记的寡核苷酸和PCR预混液来扩增目标SNP位点。在此过程中,测序接头和双条形码序列标签被掺入扩增子中,使得数千个个体能够汇集到一个单一的测序文库中。测序后,利用条形码序列的独特组合将来自各个样本的读数拆分成单独的文件。通过一个简单的perl脚本进行基因分型,该脚本对每个等位基因的扩增子特异性序列进行计数,并使用等位基因比例来确定基因型。我们通过在单个Illumina HiSeq泳道中测序的一个文库中,用一组192个SNP标记对2068个虹鳟鱼(Oncorhynchus mykiss)个体样本进行基因分型,展示了这项技术。在相同的192个位点上,基因型数据与先前收集的TaqMan(™)基因型的一致性为99.9%,但GT-seq的检出率(96.4%)相对于Taqman(99.0%)略低。在192个SNP中,187个在≥90%的个体样本中进行了基因分型,只有3个SNP在<70%的样本中进行了基因分型。这项研究表明,相对于现有方法,利用GT-seq进行扩增子测序通过采用简单的文库制备方法和大规模的规模效率,大大降低了对数百个目标SNP进行基因分型的成本。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验