Hiss Jan A, Schneider Gisbert
Johann Wolfgang Goethe-University, D-60323 Frankfurt am Main, Germany.
Brief Bioinform. 2009 Sep;10(5):569-78. doi: 10.1093/bib/bbp030. Epub 2009 Jun 17.
Protein targeting in eukaryotic cells is vital for cell survival and development. N-terminal signal peptides guide proteins to the membrane of the endoplasmic reticulum (ER) and initiate translocation into the ER lumen. Here, we review the status of signal peptide architecture and prediction with an emphasis on exceptionally long signal peptides, which often escape the notion of the currently available prediction methods. We benchmark publicly available prediction methods for their ability to correctly identify exceptionally long signal peptides. A set of 136 annotated eukaryotic signals served as reference data. The best prediction tool detected only 63%. A potential reason for the poor performance is the domain architecture of long signal peptides, whose structural peculiarities are insufficiently considered by current prediction algorithms. To overcome this limitation, we motivate a general domain view of long signal peptides, which becomes detectable when both the overall length and secondary structure of long signal peptides are taken into consideration. This concept provides a structural framework for identifying and understanding multiple targeting and post-targeting functions.
真核细胞中的蛋白质靶向对于细胞存活和发育至关重要。N端信号肽引导蛋白质到达内质网(ER)膜,并启动转运进入内质网腔。在此,我们综述信号肽结构和预测的现状,重点关注特别长的信号肽,它们常常超出当前可用预测方法的范畴。我们对公开可用的预测方法识别特别长信号肽的能力进行基准测试。一组136个注释的真核信号用作参考数据。最佳预测工具仅检测到63%。性能不佳的一个潜在原因是长信号肽的结构域结构,当前预测算法对其结构特性考虑不足。为克服这一限制,我们倡导对长信号肽采用一种总体结构域观点,当同时考虑长信号肽的整体长度和二级结构时,这种观点就变得可检测到。这一概念为识别和理解多种靶向及靶向后功能提供了一个结构框架。