Martinez Aude, Traverso José A, Valot Benoît, Ferro Myriam, Espagne Christelle, Ephritikhine Geneviève, Zivy Michel, Giglione Carmela, Meinnel Thierry
Institut des Sciences du Végétal, UPR2355, Centre National de la Recherche Scientifique, Gif-sur-Yvette, France.
Proteomics. 2008 Jul;8(14):2809-31. doi: 10.1002/pmic.200701191.
Most proteins in all organisms undergo crucial N-terminal modifications involving N-terminal methionine excision, N-alpha-acetylation or N-myristoylation (N-Myr), or S-palmitoylation. We investigated the occurrence of these poorly annotated but essential modifications in proteomes, focusing on eukaryotes. Experimental data for the N-terminal sequences of animal, fungi, and archaeal proteins, were used to build dedicated predictive modules in a new software. In vitro N-Myr experiments were performed with both plant and animal N-myristoyltransferases, for accurate prediction of the modification. N-terminal modifications from the fully sequenced genome of Arabidopsis thaliana were determined by MS. We identified 105 new modified protein N-termini, which were used to check the accuracy of predictive data. An accuracy of more than 95% was achieved, demonstrating (i) overall conservation of the specificity of the modification machinery in higher eukaryotes and (ii) robustness of the prediction tool. Predictions were made for various proteomes. Proteins that had undergone both N-terminal methionine (Met) cleavage and N-acetylation were found to be strongly overrepresented among the most abundant proteins, in contrast to those retaining their genuine unblocked Met. Here we propose that the nature of the second residue of an ORF is a key marker of the abundance of the mature protein in eukaryotes.
所有生物体中的大多数蛋白质都会经历关键的N端修饰,包括N端甲硫氨酸切除、N-α-乙酰化或N-肉豆蔻酰化(N-Myr),或S-棕榈酰化。我们研究了蛋白质组中这些注释不完善但必不可少的修饰的发生情况,重点关注真核生物。利用动物、真菌和古细菌蛋白质N端序列的实验数据,在一个新软件中构建了专门的预测模块。用植物和动物N-肉豆蔻酰转移酶进行了体外N-Myr实验,以准确预测这种修饰。通过质谱法确定了拟南芥全基因组中N端的修饰。我们鉴定出105个新的修饰蛋白N端,用于检验预测数据的准确性。预测准确率超过95%,这表明(i)高等真核生物中修饰机制特异性的总体保守性,以及(ii)预测工具的稳健性。对各种蛋白质组进行了预测。与保留其真正未封闭甲硫氨酸的蛋白质相比,那些经历了N端甲硫氨酸(Met)切割和N-乙酰化的蛋白质在最丰富的蛋白质中被发现严重过度表达。在此我们提出,开放阅读框第二个残基的性质是真核生物中成熟蛋白质丰度的关键标志。