Soybean Genomics and Improvement Laboratory, US Department of Agriculture, Agricultural Research Service, Beltsville, Maryland 20705, USA.
BMC Genomics. 2010 Aug 16;11:475. doi: 10.1186/1471-2164-11-475.
Next generation sequencing has significantly increased the speed at which single nucleotide polymorphisms (SNPs) can be discovered and subsequently used as molecular markers for research. Unfortunately, for species such as common bean (Phaseolus vulgaris L.) which do not have a whole genome sequence available, the use of next generation sequencing for SNP discovery is much more difficult and costly. To this end we developed a method which couples sequences obtained from the Roche 454-FLX system (454) with the Illumina Genome Analyzer (GA) for high-throughput SNP discovery.
Using a multi-tier reduced representation library we discovered a total of 3,487 SNPs of which 2,795 contained sufficient flanking genomic sequence for SNP assay development. Using Sanger sequencing to determine the validation rate of these SNPs, we found that 86% are likely to be true SNPs. Furthermore, we designed a GoldenGate assay which contained 1,050 of the 3,487 predicted SNPs. A total of 827 of the 1,050 SNPs produced a working GoldenGate assay (79%).
Through combining two next generation sequencing techniques we have developed a method that allows high-throughput SNP discovery in any diploid organism without the need of a whole genome sequence or the creation of normalized cDNA libraries. The need to only perform one 454 run and one GA sequencer run allows high-throughput SNP discovery with sufficient sequence for assay development to be performed in organisms, such as common bean, which have limited genomic resources.
下一代测序技术大大提高了单核苷酸多态性(SNP)的发现速度,随后可将其用作研究的分子标记。不幸的是,对于没有全基因组序列的物种,如普通菜豆(Phaseolus vulgaris L.),使用下一代测序进行 SNP 发现更加困难且昂贵。为此,我们开发了一种方法,该方法结合了 Roche 454-FLX 系统(454)和 Illumina Genome Analyzer(GA)的序列,用于高通量 SNP 发现。
使用多层次简化代表文库,我们总共发现了 3487 个 SNP,其中 2795 个 SNP 包含足够的侧翼基因组序列用于 SNP 检测开发。使用 Sanger 测序确定这些 SNP 的验证率,我们发现 86%的 SNP 可能是真实的 SNP。此外,我们设计了一种 GoldenGate 检测,其中包含 3487 个预测 SNP 中的 1050 个。总共 1050 个 SNP 中有 827 个产生了有效的 GoldenGate 检测(79%)。
通过结合两种下一代测序技术,我们开发了一种方法,可以在任何二倍体生物中进行高通量 SNP 发现,而无需全基因组序列或标准化 cDNA 文库的创建。仅需要进行一次 454 运行和一次 GA 测序器运行,就可以在普通菜豆等基因组资源有限的生物中进行高通量 SNP 发现,并提供足够的序列用于检测开发。