Huntley Melanie A, Golding G Brian
Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1, Canada.
Genetics. 2004 Mar;166(3):1141-54. doi: 10.1534/genetics.166.3.1141.
Proteins associated with disease and development of the nervous system are thought to contain repetitive, simple sequences. However, genome-wide surveys for simple sequences within proteins have revealed that repetitive peptide sequences are the most frequent shared peptide segments among eukaryotic proteins, including those of Saccharomyces cerevisiae, which has few to no specialized developmental and neurological proteins. It is therefore of interest to determine if these specialized proteins have an excess of simple sequences when compared to other sets of compositionally similar proteins. We have determined the relative abundance of simple sequences within neurological proteins and find no excess of repetitive simple sequence within this class. In fact, polyglutamine repeats that are associated with many neurodegenerative diseases are no more abundant within neurological specialized proteins than within nonneurological collections of proteins. We also examined the codon composition of serine homopolymers to determine what forces may play a role in the evolution of extended homopolymers. Codon type homogeneity tends to be favored, suggesting replicative slippage instead of selection as the main force responsible for producing these homopolymers.
与神经系统疾病和发育相关的蛋白质被认为含有重复的简单序列。然而,对蛋白质中简单序列的全基因组调查显示,重复肽序列是真核生物蛋白质中最常见的共享肽段,包括酿酒酵母的蛋白质,而酿酒酵母几乎没有或根本没有专门的发育和神经学蛋白质。因此,确定与其他组成相似的蛋白质组相比,这些专门的蛋白质是否具有过量的简单序列是很有意义的。我们已经确定了神经学蛋白质中简单序列的相对丰度,发现在这一类蛋白质中不存在过量的重复简单序列。事实上,与许多神经退行性疾病相关的聚谷氨酰胺重复序列在神经学专门蛋白质中的丰度并不比非神经学蛋白质集合中的丰度高。我们还研究了丝氨酸同聚物的密码子组成,以确定哪些力量可能在延伸同聚物的进化中起作用。密码子类型同质性往往更受青睐,这表明复制滑移而非选择是产生这些同聚物的主要力量。