Department of Horticulture and Crop Science, The Ohio State University, Ohio Agricultural Research and Development Center, Wooster, OH, USA.
PLoS One. 2012;7(9):e45520. doi: 10.1371/journal.pone.0045520. Epub 2012 Sep 20.
The effects of selection on genome variation were investigated and visualized in tomato using a high-density single nucleotide polymorphism (SNP) array. 7,720 SNPs were genotyped on a collection of 426 tomato accessions (410 inbreds and 16 hybrids) and over 97% of the markers were polymorphic in the entire collection. Principal component analysis (PCA) and pairwise estimates of F(st) supported that the inbred accessions represented seven sub-populations including processing, large-fruited fresh market, large-fruited vintage, cultivated cherry, landrace, wild cherry, and S. pimpinellifolium. Further divisions were found within both the contemporary processing and fresh market sub-populations. These sub-populations showed higher levels of genetic diversity relative to the vintage sub-population. The array provided a large number of polymorphic SNP markers across each sub-population, ranging from 3,159 in the vintage accessions to 6,234 in the cultivated cherry accessions. Visualization of minor allele frequency revealed regions of the genome that distinguished three representative sub-populations of cultivated tomato (processing, fresh market, and vintage), particularly on chromosomes 2, 4, 5, 6, and 11. The PCA loadings and F(st) outlier analysis between these three sub-populations identified a large number of candidate loci under positive selection on chromosomes 4, 5, and 11. The extent of linkage disequilibrium (LD) was examined within each chromosome for these sub-populations. LD decay varied between chromosomes and sub-populations, with large differences reflective of breeding history. For example, on chromosome 11, decay occurred over 0.8 cM for processing accessions and over 19.7 cM for fresh market accessions. The observed SNP variation and LD decay suggest that different patterns of genetic variation in cultivated tomato are due to introgression from wild species and selection for market specialization.
利用高密度单核苷酸多态性(SNP)芯片研究并可视化了选择对番茄基因组变异的影响。在 426 个番茄品种(410 个自交系和 16 个杂交种)的集合上对 7720 个 SNPs 进行了基因型分析,整个集合中超过 97%的标记都是多态性的。主成分分析(PCA)和 Fst 的成对估计支持 426 个自交系品种代表 7 个亚群,包括加工、大果鲜食、大果老品种、栽培樱桃、地方品种、野生樱桃和 S. pimpinellifolium。在现代加工和鲜食亚群中都发现了进一步的细分。与老品种亚群相比,这些亚群表现出更高水平的遗传多样性。该芯片在每个亚群中提供了大量多态性 SNP 标记,从老品种品种中的 3159 个到栽培樱桃品种中的 6234 个不等。对次要等位基因频率的可视化揭示了区分栽培番茄三个代表性亚群(加工、鲜食和老品种)的基因组区域,特别是在染色体 2、4、5、6 和 11 上。这三个亚群之间的 PCA 载荷和 Fst 异常值分析确定了染色体 4、5 和 11 上多个受正向选择的候选基因座。还检查了这三个亚群中每个染色体上的连锁不平衡(LD)程度。LD 衰减在不同染色体和亚群之间存在差异,反映了不同的选育历史。例如,在染色体 11 上,加工品种的 LD 衰减发生在 0.8 cM 以内,而鲜食品种的 LD 衰减发生在 19.7 cM 以上。观察到的 SNP 变异和 LD 衰减表明,栽培番茄中不同的遗传变异模式是由于与野生种的基因渗入和对市场专业化的选择。