Lee Dong Wook, Kim Jong Kyoung, Lee Sumin, Choi Seungjin, Kim Sanguk, Hwang Inhwan
Laboratory of Cellular Systems Biology, Division of Molecular and Life Sciences, POSTECH, Pohang 790-784, Korea.
Plant Cell. 2008 Jun;20(6):1603-22. doi: 10.1105/tpc.108.060541. Epub 2008 Jun 13.
The N-terminal transit peptides of nuclear-encoded plastid proteins are necessary and sufficient for their import into plastids, but the information encoded by these transit peptides remains elusive, as they have a high sequence diversity and lack consensus sequences or common sequence motifs. Here, we investigated the sequence information contained in transit peptides. Hierarchical clustering on transit peptides of 208 plastid proteins showed that the transit peptide sequences are grouped to multiple sequence subgroups. We selected representative proteins from seven of these multiple subgroups and confirmed that their transit peptide sequences are highly dissimilar. Protein import experiments revealed that each protein contained transit peptide-specific sequence motifs critical for protein import into chloroplasts. Bioinformatics analysis identified sequence motifs that were conserved among members of the identified subgroups. The sequence motifs identified by the two independent approaches were nearly identical or significantly overlapped. Furthermore, the accuracy of predicting a chloroplast protein was greatly increased by grouping the transit peptides into multiple sequence subgroups. Based on these data, we propose that the transit peptides are composed of multiple sequence subgroups that contain distinctive sequence motifs for chloroplast targeting.
核编码质体蛋白的N端转运肽对于其导入质体是必要且充分的,但其编码的信息仍然难以捉摸,因为它们具有高度的序列多样性,并且缺乏共有序列或常见序列基序。在此,我们研究了转运肽中包含的序列信息。对208种质体蛋白的转运肽进行层次聚类分析表明,转运肽序列被分为多个序列亚组。我们从这些多个亚组中的七个中选择了代表性蛋白,并证实它们的转运肽序列高度不同。蛋白质导入实验表明,每种蛋白质都含有对蛋白质导入叶绿体至关重要的转运肽特异性序列基序。生物信息学分析确定了在已鉴定亚组成员中保守的序列基序。通过两种独立方法鉴定的序列基序几乎相同或显著重叠。此外,通过将转运肽分为多个序列亚组,预测叶绿体蛋白的准确性大大提高。基于这些数据,我们提出转运肽由多个序列亚组组成,这些亚组包含用于叶绿体靶向的独特序列基序。