Evolutionary Genomics Group, Research Programme in Biomedical Informatics, Universitat Pompeu Fabra (UPF)-Institute Municipal d'Investigació Mèdica (IMIM), Barcelona, Spain.
Mol Biol Evol. 2012 Mar;29(3):883-6. doi: 10.1093/molbev/msr263. Epub 2011 Oct 31.
Low-complexity sequences are extremely abundant in eukaryotic proteins for reasons that remain unclear. One hypothesis is that they contribute to the formation of novel coding sequences, facilitating the generation of novel protein functions. Here, we test this hypothesis by examining the content of low-complexity sequences in proteins of different age. We show that recently emerged proteins contain more low-complexity sequences than older proteins and that these sequences often form functional domains. These data are consistent with the idea that low-complexity sequences may play a key role in the emergence of novel genes.
低复杂度序列在真核蛋白质中极为丰富,但原因尚不清楚。一种假设是,它们有助于形成新的编码序列,从而促进新蛋白质功能的产生。在这里,我们通过检查不同年龄蛋白质中的低复杂度序列含量来检验这一假说。我们发现,新出现的蛋白质比旧蛋白质含有更多的低复杂度序列,而且这些序列通常形成功能域。这些数据与低复杂度序列可能在新基因的出现中发挥关键作用的观点一致。