a Faculdade de Gama , Universidade de Brasilia , Gama , 72405-610 , Brazil .
J Biomol Struct Dyn. 2013;31(11):1293-300. doi: 10.1080/07391102.2012.735623. Epub 2012 Nov 12.
The conserved protein sequence motifs present in all prokaryotic proteomes, "omnipresent motifs," presumably, correspond to the earliest proteins of the Last Universal Cellular Ancestor, from which all the proteomes have descended. Fifteen proteomes, each representing one of the total 15 diverse phyla of 131 Eubacteria and Archea, from which the omnipresent elements have been originally derived, are exhaustively screened. All those proteins which harbor the omnipresent motifs are identified. Six "omnipresent" protein types are revealed which are located in all 15 proteomes: ABC cassettes, FtsH proteases, translation initiation factors, translation elongation factors, isoleucyl-tRNA synthases, and RNA polymerases β'. In addition to the omnipresent motifs, these proteins also contain other highly conserved motifs, standing for additional modules of the proteins. Remarkably, the identified tentative earliest proteins are responsible for only three basic functions: supply of monomers (ABC transporters and proteases), protein synthesis (initiation and elongation factors, aminoacyl-tRNA synthases), and RNA synthesis (polymerases). No enzymes involved in metabolic activities are present in the list of the earliest proteins derived by this approach. Some of the omnipresent sequence motifs are found, indeed, in the metabolic enzymes (e.g. NTP binding motifs), but these enzymes do not make a sequence matching collection of 15 sequences, i.e. they are not omnipresent. Future analysis of less conserved sequence motifs may reveal at what degree of conservation (stage of evolution) the metabolic enzymes could have entered the scene.
所有原核生物蛋白质组中都存在保守的蛋白质序列基序,即“普遍存在的基序”,这些基序可能对应于最后一个普遍存在的细胞祖先的最早蛋白质,所有的蛋白质组都是从这个祖先中衍生出来的。从最初衍生出普遍存在的元素的 15 个蛋白质组中,每个蛋白质组都代表着 131 个细菌和古细菌的 15 个不同门中的一个。对所有这些蛋白质组进行了详尽的筛选,以识别出所有具有普遍存在基序的蛋白质。结果发现了六种位于所有 15 种蛋白质组中的“普遍存在”蛋白质类型:ABC 盒、FtsH 蛋白酶、翻译起始因子、翻译延伸因子、异亮氨酸 tRNA 合成酶和 RNA 聚合酶 β'。除了普遍存在的基序外,这些蛋白质还包含其他高度保守的基序,代表蛋白质的其他模块。值得注意的是,鉴定出的暂定最早的蛋白质仅负责三种基本功能:提供单体(ABC 转运蛋白和蛋白酶)、蛋白质合成(起始和延伸因子、氨酰-tRNA 合成酶)和 RNA 合成(聚合酶)。在这种方法中,从最早的蛋白质中没有发现参与代谢活动的酶。一些普遍存在的序列基序确实存在于代谢酶中(例如 NTP 结合基序),但这些酶没有形成一个由 15 个序列组成的序列匹配集合,也就是说,它们不是普遍存在的。未来对不太保守的序列基序的分析可能会揭示代谢酶在何种程度的保守(进化阶段)下进入了这一领域。