Pietrokovski S
Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA.
Protein Sci. 1998 Jan;7(1):64-71. doi: 10.1002/pro.5560070106.
Analysis of the conserved sequence features of inteins (protein "introns") reveals that they are composed of three distinct modular domains. The N-terminal (N) and C-terminal (C) domains are predicted to perform different parts of the autocatalytic protein splicing reaction. An optional endonuclease domain (EN) is shown to correspond to different types of homing endonucleases in different inteins. The N domain contains motifs predicted to catalyze the first steps of protein splicing, leading to the cleavage of the intein N terminus from its protein host. Intein N domain motifs are also found in C-terminal autocatalytic domains (CADs) present in hedgehog and other protein families. Specific residues in the N domain of intein and CADs are proposed to form a charge relay system involved in cleaving their N-termini. The intein C domain is apparently unique to inteins and contains motifs that catalyze the final protein splicing steps: ligation of the intein flanks and cleavage of its C terminus to release the free intein and spliced host protein. All intein EN domains known thus far have dodecapeptide (DOD, LAGLI-DADG) type homing endonuclease motifs. This work identifies an EN domain with an HNH homing-endonuclease motif and two new small inteins with no EN domains. One of these small inteins might be inactive or a "pseudo intein." The results suggest a modular architecture for inteins, clarify their origin and relationship to other protein families, and extend recent experimental findings on the functional roles of intein N, C, and EN motifs.
对内含肽(蛋白质“内含子”)保守序列特征的分析表明,它们由三个不同的模块化结构域组成。预测N端(N)和C端(C)结构域在自催化蛋白质剪接反应中执行不同的部分。一个可选的内切核酸酶结构域(EN)显示与不同内含肽中的不同类型归巢内切核酸酶相对应。N结构域包含预测可催化蛋白质剪接第一步的基序,导致内含肽N端从其蛋白质宿主上裂解下来。内含肽N结构域基序也存在于刺猬蛋白和其他蛋白质家族中的C端自催化结构域(CAD)中。内含肽和CAD的N结构域中的特定残基被认为形成了一个参与裂解其N端的电荷中继系统。内含肽C结构域显然是内含肽所特有的,并且包含催化蛋白质剪接最后步骤的基序:内含肽侧翼的连接以及其C端的裂解以释放游离的内含肽和剪接后的宿主蛋白质。迄今为止已知的所有内含肽EN结构域都具有十二肽(DOD,LAGLI-DADG)型归巢内切核酸酶基序。这项工作鉴定出一个具有HNH归巢内切核酸酶基序的EN结构域和两个没有EN结构域的新的小内含肽。其中一个小内含肽可能是无活性的或“假内含肽 ”。结果表明内含肽具有模块化结构,阐明了它们的起源以及与其他蛋白质家族的关系,并扩展了最近关于内含肽N、C和EN基序功能作用的实验发现。