Zybailov Boris, Sun Qi, van Wijk Klaas J
Department of Plant Biology, Cornell University, Ithaca, New York 14853, USA.
Anal Chem. 2009 Oct 1;81(19):8015-24. doi: 10.1021/ac9011792.
Post-translational modifications (PTMs) of proteins add to the complexity of proteomes, thereby complicating the task of proteome characterization. An efficient strategy to identify this peptide heterogeneity is important for determination of protein function, as well as for mass spectrometry-based protein quantification. Furthermore, studies of allelic variation or single nucleotide polymorphisms (SNPs) at the proteome level, as well as mRNA editing, are increasingly relevant, but validation and determination of false positive rates are challenging. Here we describe an effective workflow for large scale PTM and amino acid substitution identification based on high resolution and high mass accuracy RPLC-MS data sets. A systematic validation strategy of PTMs using RPLC retention time shifts was implemented, and a decision tree for validation is presented. This workflow was applied to Arabidopsis proteome preparations; 1.5 million MS/MS spectra were processed resulting in 20% sequence assignments, with 5% from modified sequences and matching to 2904 proteins; this high assignment rate is in part due to the high quality spectral data. A searchable modified peptide library for Arabidopsis is available online at http://ppdb.tc.cornell.edu/. We discuss confidence in peptide and PTM assignment based on the acquired data set, as well as implications for quantitative analysis of physiologically induced and preparation-related modifications.
蛋白质的翻译后修饰(PTMs)增加了蛋白质组的复杂性,从而使蛋白质组表征任务变得复杂。识别这种肽异质性的有效策略对于确定蛋白质功能以及基于质谱的蛋白质定量都很重要。此外,在蛋白质组水平上对等位基因变异或单核苷酸多态性(SNP)以及mRNA编辑的研究越来越相关,但验证和假阳性率的确定具有挑战性。在这里,我们描述了一种基于高分辨率和高质量精度反相液相色谱-质谱(RPLC-MS)数据集进行大规模PTM和氨基酸取代鉴定的有效工作流程。实施了一种使用RPLC保留时间偏移对PTM进行系统验证的策略,并给出了一个验证决策树。该工作流程应用于拟南芥蛋白质组制备;处理了150万个MS/MS谱图,序列分配率为20%,其中5%来自修饰序列,匹配到2904种蛋白质;如此高的分配率部分归因于高质量的光谱数据。一个可搜索的拟南芥修饰肽库可在http://ppdb.tc.cornell.edu/在线获取。我们基于所获得的数据集讨论了肽和PTM分配的可信度,以及对生理诱导和制备相关修饰定量分析的影响。