Department of Biology and Institute for Genome Sciences & Policy, Duke University, Box 90338, Durham, NC 27708, USA.
Evol Dev. 2012 Mar-Apr;14(2):152-67. doi: 10.1111/j.1525-142X.2012.00532.x.
Despite the fact that noncoding sequences comprise a substantial fraction of functional sites within all genomes, the evolutionary mechanisms that operate on genetic variation within regulatory elements remain poorly understood. In this study, we examine the population genetics of the core, upstream cis-regulatory regions of eight genes (AN, CyIIa, CyIIIa, Endo16, FoxB, HE, SM30 a, and SM50) that function during the early development of the purple sea urchin, Strongylocentrotus purpuratus. Quantitative and qualitative measures of segregating variation are not conspicuously different between cis-regulatory and closely linked "proxy neutral" noncoding regions containing no known functional sites. Length and compound mutations are common in noncoding sequences; conventional descriptive statistics ignore such mutations, under-representing true genetic variation by approximately 28% for these loci in this population. Patterns of variation in the cis-regulatory regions of six of the genes examined (CyIIa, CyIIIa, Endo16, FoxB, AN, and HE) are consistent with directional selection. Genetic variation within annotated transcription factor binding sites is comparable to, and frequently greater than, that of surrounding sequences. Comparisons of two paralog pairs (CyIIa/CyIIIa and AN/HE) suggest that distinct evolutionary processes have operated on their cis-regulatory regions following gene duplication. Together, these analyses provide a detailed view of the evolutionary mechanisms operating on noncoding sequences within a natural population, and underscore how little is known about how these processes operate on cis-regulatory sequences.
尽管非编码序列构成了所有基因组中功能位点的很大一部分,但在调节元件内遗传变异起作用的进化机制仍知之甚少。在这项研究中,我们研究了在紫色海胆(Strongylocentrotus purpuratus)早期发育过程中起作用的八个基因(AN、CyIIa、CyIIIa、Endo16、FoxB、HE、SM30a 和 SM50)的核心、上游顺式调控区的群体遗传学。在顺式调控区和紧密连锁的“代理中性”非编码区(不含已知功能位点)中,可分离变异的定量和定性测量没有明显差异。非编码序列中常见长度和复合突变;传统的描述性统计忽略了这些突变,使这些位点在该群体中的真实遗传变异减少了约 28%。在六个被研究基因的顺式调控区(CyIIa、CyIIIa、Endo16、FoxB、AN 和 HE)中,变异模式与定向选择一致。注释转录因子结合位点内的遗传变异与周围序列相当,并且经常大于周围序列。对两个同源基因对(CyIIa/CyIIIa 和 AN/HE)的比较表明,在基因复制后,它们的顺式调控区经历了不同的进化过程。总的来说,这些分析提供了对自然群体中非编码序列起作用的进化机制的详细了解,并强调了人们对这些过程在顺式调控序列上的作用知之甚少。