Weiner January, Beaussart Francois, Bornberg-Bauer Erich
Division of Bioinformatics, School of Biological Sciences, The Westfalian Wilhelms University of Münster, Germany.
FEBS J. 2006 May;273(9):2037-47. doi: 10.1111/j.1742-4658.2006.05220.x.
The main mechanisms shaping the modular evolution of proteins are gene duplication, fusion and fission, recombination and loss of fragments. While a large body of research has focused on duplications and fusions, we concentrated, in this study, on how domains are lost. We investigated motif databases and introduced a measure of protein similarity that is based on domain arrangements. Proteins are represented as strings of domains and comparison was based on the classic dynamic alignment scheme. We found that domain losses and duplications were more frequent at the ends of proteins. We showed that losses can be explained by the introduction of start and stop codons which render the terminal domains nonfunctional, such that further shortening, until the whole domain is lost, is not evolutionarily selected against. We demonstrated that domains which also occur as single-domain proteins are less likely to be lost at the N terminus and in the middle, than at the C terminus. We conclude that fission/fusion events with single-domain proteins occur mostly at the C terminus. We found that domain substitutions are rare, in particular in the middle of proteins. We also showed that many cases of substitutions or losses result from erroneous annotations, but we were also able to find courses of evolutionary events where domains vanish over time. This is explained by a case study on the bacterial formate dehydrogenases.
塑造蛋白质模块化进化的主要机制包括基因复制、融合与分裂、重组以及片段丢失。虽然大量研究聚焦于复制和融合,但在本研究中,我们关注的是结构域如何丢失。我们研究了基序数据库,并引入了一种基于结构域排列的蛋白质相似性度量方法。蛋白质被表示为结构域的字符串,比较基于经典的动态比对方案。我们发现结构域丢失和复制在蛋白质末端更为频繁。我们表明,丢失可以通过起始密码子和终止密码子的引入来解释,这使得末端结构域失去功能,以至于进一步缩短直至整个结构域丢失,在进化上不会被选择淘汰。我们证明,那些也以单结构域蛋白质形式存在的结构域,在N端和中间比在C端更不容易丢失。我们得出结论,单结构域蛋白质的分裂/融合事件大多发生在C端。我们发现结构域替换很少见,尤其是在蛋白质中间。我们还表明,许多替换或丢失的情况是由错误注释导致的,但我们也能够找到结构域随时间消失的进化事件过程。这通过对细菌甲酸脱氢酶的案例研究得到了解释。