Federal University of Viçosa, Department of General Biology, Viçosa, MG, Brazil.
Federal University of Viçosa, Department of Animal Science, Viçosa, MG, Brazil.
PLoS One. 2019 Sep 25;14(9):e0219417. doi: 10.1371/journal.pone.0219417. eCollection 2019.
Linkage disequilibrium (LD) analysis provides information on the evolutionary aspects of populations. Recently, haplotype blocks have been used to increase the power of quantitative trait loci detection in genome-wide association studies and the prediction accuracy of genomic selection. Our objectives were as follows: to compare the degree of LD, LD decay, and LD decay extent in popcorn populations; to characterize the number and length of haplotype blocks in the populations; and to determine whether maize chromosomes also have a pattern of interspaced regions of high and low rates of recombination. We used a biparental population, a synthetic, and a breeding population, genotyped for approximately 75,000 single nucleotide polymorphisms (SNPs). The sample size ranged from 190 to 192 plants. For the whole-genome LD and haplotype block analyses, we assumed a window of 500 kb. To characterize the block and step patterns of LD in the populations, we constructed LD maps by chromosome, defining a cold spot as a chromosome segment including SNPs with the same LDU position. The LD and haplotype block analyses were also performed at the intragenic level, selecting 12 genes related to zein, starch, cellulose, and fatty acid biosynthesis. The populations with the higher and lower frequencies of |D'| values greater than 0.75 were the biparental (65-74%) and the breeding population (26-58%), respectively. There were slight differences between the populations regarding the average distance for SNPs with |D'| values greater than 0.75 (in the range of approximately 207 to 229 kb). The level of LD expressed by the r2 values was low in the populations (0.02, 0.04, and 0.04, on average) but comparable to some non-isolated human populations. The frequency of r2 values greater than 0.75 was lower in the biparental population (0.2-0.5%) and higher in the other populations (0.2-1.6%). The average distance for SNPs with r2 values greater than 0.75 was much higher in the biparental population (approximately 80 to 126 kb). In the other populations, the ranges were approximately 6 to 19 and 6 to 35 kb. The heatmaps for the regions covered by the first 100 SNPs in each chromosome, in each population (1 to 3.3 Mb, approximately), provided evidence that the comparatively few high r2 values (close to 1.0) occurred only for SNPs in close proximity, especially in the synthetic and breeding populations. Due to the reduced number of SNPs in the haplotype blocks (2 to 3) in the populations, it is not expected advantage of a haplotype-based association study as well as genomic selection along generations. The results concerning LD decay (rapid decay after 5-10 kb) and LD decay extent (along up to 300 kb) are in the range observed with maize inbred line panels. The LD maps indicate that maize chromosomes had a pattern of regions of extensive LD interspaced with regions of low LD. However, our simulated LD map provides evidence that this pattern can reflect regions with differences in allele frequencies and LD levels (expressed by |D'|) and not regions with high and low rates of recombination.
连锁不平衡(LD)分析提供了有关种群进化方面的信息。最近,单倍型块已被用于增加全基因组关联研究中数量性状位点的检测能力和基因组选择的预测准确性。我们的目标如下:比较爆米花种群的 LD 程度、LD 衰减和 LD 衰减程度;描述种群中单倍型块的数量和长度;并确定玉米染色体是否也存在间隔高重组率和低重组率区域的模式。我们使用了一个双亲和一个合成群体以及一个育种群体,对大约 75000 个单核苷酸多态性(SNP)进行了基因型分析。样本量范围为 190 到 192 株。对于全基因组 LD 和单倍型块分析,我们假设窗口大小为 500kb。为了描述群体中 LD 和单倍型块的模式,我们通过染色体构建 LD 图谱,将冷点定义为包含相同 LDU 位置 SNP 的染色体片段。还在基因内水平上进行了 LD 和单倍型块分析,选择了 12 个与 zein、淀粉、纤维素和脂肪酸生物合成相关的基因。|D'|值大于 0.75 的双亲和育种群体(65-74%和 26-58%)的频率更高。种群之间的 SNP 的平均距离略有差异,|D'|值大于 0.75(约为 207 到 229kb)。种群中 LD 由 r2 值表示的水平较低(平均为 0.02、0.04 和 0.04),但与一些非隔离的人类种群相当。双亲和群体中 r2 值大于 0.75 的频率较低(0.2-0.5%),其他群体中较高(0.2-1.6%)。双亲和群体中 r2 值大于 0.75 的 SNP 的平均距离高得多(约为 80 到 126kb)。在其他群体中,范围约为 6 到 19 和 6 到 35kb。每个种群中每个染色体上前 100 个 SNP 覆盖区域的热图(1 到 3.3Mb,约)提供了证据,表明相对较少的高 r2 值(接近 1.0)仅发生在近距离的 SNP 中,尤其是在合成和育种群体中。由于群体中单倍型块中的 SNP 数量较少(2 到 3),因此在几代中基于单倍型的关联研究和基因组选择不会具有优势。LD 衰减(5-10kb 后快速衰减)和 LD 衰减程度(可达 300kb)的结果在玉米自交系面板中观察到的范围内。LD 图谱表明,玉米染色体存在广泛 LD 区域与低 LD 区域相间的模式。然而,我们的模拟 LD 图谱提供了证据,表明这种模式可以反映等位基因频率和 LD 水平(由 |D'| 表示)不同的区域,而不是高重组率和低重组率的区域。