Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, University of Muenster, Germany.
Mol Biol Evol. 2012 Feb;29(2):787-96. doi: 10.1093/molbev/msr250. Epub 2011 Oct 19.
The wealth of available genomic data presents an unrivaled opportunity to study the molecular basis of evolution. Studies on gene family expansions and site-dependent analyses have already helped establish important insights into how proteins facilitate adaptation. However, efforts to conduct full-scale cross-genomic comparisons between species are challenged by both growing amounts of data and the inherent difficulty in accurately inferring homology between deeply rooted species. Proteins, in comparison, evolve by means of domain rearrangements, a process more amenable to study given the strength of profile-based homology inference and the lower rates with which rearrangements occur. However, adapting to a constantly changing environment can require molecular modulations beyond reach of rearrangement alone. Here, we explore rates and functional implications of novel domain emergence in contrast to domain gain and loss in 20 arthropod species of the pancrustacean clade. Emerging domains are more likely disordered in structure and spread more rapidly within their genomes than established domains. Furthermore, although domain turnover occurs at lower rates than gene family turnover, we find strong evidence that the emergence of novel domains is foremost associated with environmental adaptation such as abiotic stress response. The results presented here illustrate the simplicity with which domain-based analyses can unravel key players of nature's adaptational machinery, complementing the classical site-based analyses of adaptation.
丰富的基因组数据为研究进化的分子基础提供了无与伦比的机会。对基因家族扩张和基于位点的分析的研究已经帮助我们深入了解蛋白质如何促进适应。然而,由于数据量的不断增加以及在准确推断深根物种之间同源性方面的固有困难,进行跨物种的全基因组比较的努力受到了挑战。相比之下,蛋白质通过结构域重排进行进化,由于基于轮廓的同源性推断的强度以及重排发生的频率较低,该过程更易于研究。然而,适应不断变化的环境可能需要超出仅重排范围的分子调节。在这里,我们研究了在 20 种甲壳类动物的节肢动物门谱系中,与结构域获得和缺失相比,新结构域出现的速率和功能意义。新兴结构域在结构上更可能是无序的,并且在其基因组中的传播速度比已建立的结构域更快。此外,尽管结构域的更替率低于基因家族的更替率,但我们有强有力的证据表明,新结构域的出现主要与环境适应有关,例如非生物胁迫反应。这里呈现的结果说明了基于结构域的分析可以如何简单地揭示自然适应机制的关键参与者,补充了经典的基于位点的适应分析。