Department of Natural Resources & Environmental Sciences, University of Illinois, Urbana, IL 61801, USA.
Gene. 2012 Feb 25;494(2):196-201. doi: 10.1016/j.gene.2011.12.001. Epub 2011 Dec 24.
EST data generated from 14 apple genotypes were downloaded from NCBI and mapped against a reference EST assembly to identify Single Nucleotide Polymorphisms (SNPs). Mapping of these SNPs was undertaken using 90% of sequence similarity and minimum coverage of four reads at each SNP position. In total, 37,807 SNPs were identified with an average of one SNP every 187 bp from a total of 6888 unique EST contigs. Identified SNPs were checked for flanking sequences of ≥ 60 bp along both sides of SNP alleles for reliable design of a custom high-throughput genotyping assay. A total of 12,299 SNPs, representing 6525 contigs, fit the selected criterion of ≥ 60 bp sequences flanking a SNP position. Of these, 1411 SNPs were validated using four apple genotypes. Based on genotyping assays, it was estimated that 60% of SNPs were valid SNPs, while 26% of SNPs might be derived from paralogous regions.
从 NCBI 下载了来自 14 个苹果基因型的表达序列标签 (EST) 数据,并将其映射到参考 EST 组装上,以鉴定单核苷酸多态性 (SNP)。使用 90%的序列相似度和每个 SNP 位置至少 4 次读取的最小覆盖度来进行这些 SNP 的映射。总共鉴定出 37807 个 SNP,平均每 6888 个独特的 EST 连续体中有一个 SNP,每 187bp 出现一个 SNP。鉴定出的 SNP 被检查了 SNP 等位基因两侧至少 60bp 的侧翼序列,以可靠地设计定制的高通量基因分型测定。共有 12299 个 SNP,代表 6525 个连续体,符合 SNP 位置两侧至少 60bp 序列的选择标准。其中,使用四个苹果基因型对 1411 个 SNP 进行了验证。基于基因分型测定,估计有 60%的 SNP 是有效的 SNP,而 26%的 SNP 可能来自于同源区域。