Memon Shabana, Jia Xianqing, Gu Longjiang, Zhang Xiaohui
School of life Sciences, Nanjing University, Nanjing, 210093, China.
Lecturer, Department of Plant Breeding and Genetics, Sindh Agriculture University, Tando Jam, Hyderabad, 70060, Pakistan.
BMC Evol Biol. 2016 Jan 27;16:25. doi: 10.1186/s12862-016-0590-7.
The variation rate in genomic regions associated with different alleles, impacts to distinct evolutionary patterns involving rare alleles. The rare alleles bias towards genome-wide association studies (GWASs), aim to detect different variants at genomic loci associated with single-nucleotide polymorphisms (SNPs) inclined to produce different haplotypes. Here, we sequenced Arabidopsis thaliana and compared its coding and non-coding genomic regions with its closest outgroup relative, Arabidopsis lyrta, which accounted for the ancestral misinference. The use of genome-wide SNPs interpret the genetic architecture of rare alleles in Arabidopsis thaliana, elucidating a significant departure from a neutral evolutionary model and the pattern of polymorphisms around a selected locus will exclusively influence natural selection.
We found 23.4% of the rare alleles existing randomly in the genome. Notably, in our results significant differences (P < 0.01) were estimated in the relative rates between rare versus intermediate alleles, between fixed versus non-fixed mutations, and between type I versus type II rare-mutations by using the χ (2)-test. However, the rare alleles generating negative values of Tajima's D suggest that they generated under selective sweeps. Relative to polymorphic sites including SNPs, 67.5% of the fixed mutations were attributed, indicating major contributors to speciation. Substantially, an evolution occurred in the rare allele that was 1.42-times faster than that in a major haplotype.
Our results interpret that rare alleles fits a random occurrence model, indicating that rare alleles occur at any locus in a genome and in any accession in a species. Based on the higher relative rate of derived to ancient mutations and higher average D xy, we conclude that rare alleles evolve faster than the higher frequency alleles. The rapid evolution of rare alleles indicates that they must have been newly generated with fixed mutations, compared with the other alleles. Eventually, PCR and sequencing results, in the flanking regions of rare allele loci confirm that they are of short extension, indicating the absence of a genome-wide pattern for a rare haplotype. The indel-associated model for rare alleles assumes that indel-associated mutations only occur in an indel heterozygote.
与不同等位基因相关的基因组区域的变异率,影响涉及稀有等位基因的不同进化模式。稀有等位基因偏向于全基因组关联研究(GWAS),旨在检测与倾向于产生不同单倍型的单核苷酸多态性(SNP)相关的基因组位点处的不同变体。在此,我们对拟南芥进行了测序,并将其编码和非编码基因组区域与其最亲近的外类群亲属琴叶拟南芥进行了比较,这解释了祖先错误推断的原因。使用全基因组SNP来解读拟南芥中稀有等位基因的遗传结构,阐明了与中性进化模型的显著偏离,并且选定基因座周围的多态性模式将专门影响自然选择。
我们发现23.4%的稀有等位基因随机存在于基因组中。值得注意的是,在我们的结果中,通过使用χ²检验估计了稀有等位基因与中间等位基因之间、固定突变与非固定突变之间以及I型与II型稀有突变之间的相对率存在显著差异(P < 0.01)。然而,产生负Tajima's D值的稀有等位基因表明它们是在选择性清除下产生的。相对于包括SNP在内的多态性位点,67.5%的固定突变被归因,表明其对物种形成的主要贡献。实质上,稀有等位基因的进化速度比主要单倍型快1.42倍。
我们的结果表明稀有等位基因符合随机发生模型,表明稀有等位基因在基因组的任何位点和物种的任何 accession 中出现。基于衍生突变与古老突变的相对率较高以及平均D xy较高,我们得出结论,稀有等位基因的进化速度比高频等位基因快。稀有等位基因的快速进化表明,与其他等位基因相比,它们一定是通过固定突变新产生的。最终,稀有等位基因座侧翼区域的PCR和测序结果证实它们的延伸较短,表明不存在稀有单倍型的全基因组模式。稀有等位基因的插入缺失相关模型假设插入缺失相关突变仅发生在插入缺失杂合子中。