Vaca Jacome Alvaro Sebastian, Rabilloud Thierry, Schaeffer-Reiss Christine, Rompais Magali, Ayoub Daniel, Lane Lydie, Bairoch Amos, Van Dorsselaer Alain, Carapito Christine
BioOrganic Mass Spectrometry Laboratory (LSMBO), Université de Strasbourg, IPHC, Strasbourg, France.
IPHC, CNRS, UMR7178, Strasbourg, France.
Proteomics. 2015 Jul;15(14):2519-24. doi: 10.1002/pmic.201400617. Epub 2015 Jun 8.
The high throughput characterization of protein N-termini is becoming an emerging challenge in the proteomics and proteogenomics fields. The present study describes the free N-terminome analysis of human mitochondria-enriched samples using trimethoxyphenyl phosphonium (TMPP) labelling approaches. Owing to the extent of protein import and cleavage for mitochondrial proteins, determining the new N-termini generated after translocation/processing events for mitochondrial proteins is crucial to understand the transformation of precursors to mature proteins. The doublet N-terminal oriented proteomics (dN-TOP) strategy based on a double light/heavy TMPP labelling has been optimized in order to improve and automate the workflow for efficient, fast and reliable high throughput N-terminome analysis. A total of 2714 proteins were identified and 897 N-terminal peptides were characterized (424 N-α-acetylated and 473 TMPP-labelled peptides). These results allowed the precise identification of the N-terminus of 693 unique proteins corresponding to 26% of all identified proteins. Overall, 120 already annotated processing cleavage sites were confirmed while 302 new cleavage sites were characterized. The accumulation of experimental evidence of mature N-termini should allow increasing the knowledge of processing mechanisms and consequently also enhance cleavage sites prediction algorithms. Complete datasets have been deposited to the ProteomeXchange Consortium with identifiers PXD001521, PXD001522 and PXD001523 (http://proteomecentral.proteomexchange.org/dataset/PXD001521, http://proteomecentral.proteomexchange.org/dataset/PXD0001522 and http://proteomecentral.proteomexchange.org/dataset/PXD001523, respectively).
蛋白质N端的高通量表征正成为蛋白质组学和蛋白质基因组学领域中一个新出现的挑战。本研究描述了使用三甲氧基苯基鏻(TMPP)标记方法对富含人线粒体的样品进行游离N端蛋白质组分析。由于线粒体蛋白质的导入和切割程度,确定线粒体蛋白质在转运/加工事件后产生的新N端对于理解前体向成熟蛋白质的转变至关重要。基于双轻/重TMPP标记的双重N端导向蛋白质组学(dN-TOP)策略已得到优化,以改进和自动化工作流程,实现高效、快速且可靠的高通量N端蛋白质组分析。总共鉴定出2714种蛋白质,并对897个N端肽段进行了表征(424个N-α-乙酰化肽段和473个TMPP标记肽段)。这些结果使得能够精确鉴定693种独特蛋白质的N端,占所有鉴定蛋白质的26%。总体而言,确认了120个已注释的加工切割位点,同时表征了302个新的切割位点。成熟N端实验证据的积累应有助于增加对加工机制的了解,从而也能增强切割位点预测算法。完整的数据集已存入蛋白质组交换联盟,标识符分别为PXD001521、PXD001522和PXD001523(分别为http://proteomecentral.proteomexchange.org/dataset/PXD001521、http://proteomecentral.proteomexchange.org/dataset/PXD0001522和http://proteomecentral.proteomexchange.org/dataset/PXD001523)。