Department of Biology, Hopkins Marine Station, Stanford University, Oceanview Boulevard, Pacific Grove, CA 93950, USA.
Proc Biol Sci. 2012 Apr 7;279(1732):1412-20. doi: 10.1098/rspb.2011.1823. Epub 2011 Oct 12.
Natural selection can act on all the expressed genes of an individual, leaving signatures of genetic differentiation or diversity at many loci across the genome. New power to assay these genome-wide effects of selection comes from associating multi-locus patterns of polymorphism with gene expression and function. Here, we performed one of the first genome-wide surveys in a marine species, comparing purple sea urchins, Strongylocentrotus purpuratus, from two distant locations along the species' wide latitudinal range. We examined 9112 polymorphic loci from upstream non-coding and coding regions of genes for signatures of selection with respect to gene function and tissue- and ontogenetic gene expression. We found that genetic differentiation (F(ST)) varied significantly across functional gene classes. The strongest enrichment occurred in the upstream regions of E3 ligase genes, enzymes known to regulate protein abundance during development and environmental stress. We found enrichment for high heterozygosity in genes directly involved in immune response, particularly NALP genes, which mediate pro-inflammatory signals during bacterial infection. We also found higher heterozygosity in immune genes in the southern population, where disease incidence and pathogen diversity are greater. Similar to the major histocompatibility complex in mammals, balancing selection may enhance genetic diversity in the innate immune system genes of this invertebrate. Overall, our results show that how genome-wide polymorphism data coupled with growing databases on gene function and expression can combine to detect otherwise hidden signals of selection in natural populations.
自然选择可以作用于个体表达的所有基因,在基因组的许多位点留下遗传分化或多样性的特征。从多基因座的多态性与基因表达和功能相关联的角度来检测这些选择的全基因组效应,这为我们提供了新的能力。在这里,我们在一种海洋物种中进行了首次全基因组调查之一,比较了来自该物种广泛的纬度范围的两个遥远地点的紫色海胆(Strongylocentrotus purpuratus)。我们研究了 9112 个多态性基因座,这些基因座位于基因的上游非编码区和编码区,用于检测与基因功能以及组织和个体发育基因表达有关的选择信号。我们发现遗传分化(FST)在功能基因类别之间存在显著差异。最强的富集发生在 E3 连接酶基因的上游区域,这些酶已知在发育和环境压力期间调节蛋白质丰度。我们发现直接参与免疫反应的基因中的高杂合性富集,特别是 NALP 基因,这些基因在细菌感染期间介导促炎信号。我们还发现南部种群中免疫基因的杂合度更高,那里的疾病发病率和病原体多样性更高。与哺乳动物的主要组织相容性复合体类似,平衡选择可能会增强这种无脊椎动物先天免疫系统基因的遗传多样性。总的来说,我们的研究结果表明,全基因组多态性数据与不断增长的基因功能和表达数据库相结合,如何在自然种群中检测到选择的隐藏信号。