Genetic Strains Research Center, National Institute of Genetics, Mishima, Shizuoka, Japan.
BMC Genomics. 2010 May 20;11:315. doi: 10.1186/1471-2164-11-315.
High-density oligonucleotide arrays are effective tools for genotyping numerous loci simultaneously. In small genome species (genome size: < approximately 300 Mb), whole-genome DNA hybridization to expression arrays has been used for various applications. In large genome species, transcript hybridization to expression arrays has been used for genotyping. Although rice is a fully sequenced model plant of medium genome size (approximately 400 Mb), there are a few examples of the use of rice oligonucleotide array as a genotyping tool.
We compared the single feature polymorphism (SFP) detection performance of whole-genome and transcript hybridizations using the Affymetrix GeneChip Rice Genome Array, using the rice cultivars with full genome sequence, japonica cultivar Nipponbare and indica cultivar 93-11. Both genomes were surveyed for all probe target sequences. Only completely matched 25-mer single copy probes of the Nipponbare genome were extracted, and SFPs between them and 93-11 sequences were predicted. We investigated optimum conditions for SFP detection in both whole genome and transcript hybridization using differences between perfect match and mismatch probe intensities of non-polymorphic targets, assuming that these differences are representative of those between mismatch and perfect targets. Several statistical methods of SFP detection by whole-genome hybridization were compared under the optimized conditions. Causes of false positives and negatives in SFP detection in both types of hybridization were investigated.
The optimizations allowed a more than 20% increase in true SFP detection in whole-genome hybridization and a large improvement of SFP detection performance in transcript hybridization. Significance analysis of the microarray for log-transformed raw intensities of PM probes gave the best performance in whole genome hybridization, and 22,936 true SFPs were detected with 23.58% false positives by whole genome hybridization. For transcript hybridization, stable SFP detection was achieved for highly expressed genes, and about 3,500 SFPs were detected at a high sensitivity (> 50%) in both shoot and young panicle transcripts. High SFP detection performances of both genome and transcript hybridizations indicated that microarrays of a complex genome (e.g., of Oryza sativa) can be effectively utilized for whole genome genotyping to conduct mutant mapping and analysis of quantitative traits such as gene expression levels.
高密度寡核苷酸阵列是同时对多个基因座进行基因分型的有效工具。在小基因组物种(基因组大小:<约 300Mb)中,全基因组 DNA 杂交到表达阵列已被用于各种应用。在大基因组物种中,转录物杂交到表达阵列已被用于基因分型。尽管水稻是一种中等大小基因组(约 400Mb)的全测序模式植物,但利用水稻寡核苷酸阵列作为基因分型工具的例子很少。
我们使用 Affymetrix GeneChip Rice Genome Array 比较了全基因组和转录物杂交的单特征多态性(SFP)检测性能,使用具有全基因组序列的水稻品种粳稻品种 Nipponbare 和籼稻品种 93-11。对两个基因组的所有探针靶序列进行了调查。仅提取了 Nipponbare 基因组中完全匹配的 25 -mer 单拷贝探针,并预测了它们与 93-11 序列之间的 SFP。我们研究了全基因组和转录物杂交中 SFP 检测的最佳条件,假设这些差异代表了错配和完全匹配目标之间的差异,通过非多态性目标的完美匹配和错配探针强度之间的差异。在优化条件下比较了全基因组杂交中几种 SFP 检测的统计方法。研究了两种类型杂交中 SFP 检测的假阳性和假阴性的原因。
优化允许全基因组杂交中 SFP 检测的真实阳性率提高 20%以上,转录物杂交中 SFP 检测性能有了很大提高。对 PM 探针对数转换原始强度的微阵列进行了显着性分析,在全基因组杂交中表现出最佳性能,全基因组杂交检测到 22936 个真实 SFP,假阳性率为 23.58%。对于转录物杂交,实现了高表达基因的稳定 SFP 检测,在芽和幼穗转录物中均以高灵敏度(>50%)检测到约 3500 个 SFP。全基因组和转录物杂交的高 SFP 检测性能表明,复杂基因组(例如,Oryza sativa)的微阵列可以有效地用于全基因组基因分型,以进行突变图谱绘制和基因表达水平等数量性状的分析。