Department of Biomedical Informatics, Vanderbilt University Medical School, Nashville, TN 37232-8575, USA.
Anal Bioanal Chem. 2012 Sep;404(4):1115-25. doi: 10.1007/s00216-012-6011-x. Epub 2012 May 3.
Spectral counting has become a widely used approach for measuring and comparing protein abundance in label-free shotgun proteomics. However, when analyzing complex samples, the ambiguity of matching between peptides and proteins greatly affects the assessment of peptide and protein inventories, differentiation, and quantification. Meanwhile, the configuration of database searching algorithms that assign peptides to MS/MS spectra may produce different results in comparative proteomic analysis. Here, we present three strategies to improve comparative proteomics through spectral counting. We show that comparing spectral counts for peptide groups rather than for protein groups forestalls problems introduced by shared peptides. We demonstrate the advantage and flexibility of this new method in two datasets. We present four models to combine four popular search engines that lead to significant gains in spectral counting differentiation. Among these models, we demonstrate a powerful vote counting model that scales well for multiple search engines. We also show that semi-tryptic searching outperforms tryptic searching for comparative proteomics. Overall, these techniques considerably improve protein differentiation on the basis of spectral count tables.
谱计数已成为一种广泛用于测量和比较无标记鸟枪法蛋白质组学中蛋白质丰度的方法。然而,在分析复杂样品时,肽与蛋白质之间的匹配模糊性极大地影响了肽和蛋白质目录、区分和定量的评估。同时,分配肽到 MS/MS 谱的数据库搜索算法的配置可能会在比较蛋白质组学分析中产生不同的结果。在这里,我们提出了三种通过谱计数改进比较蛋白质组学的策略。我们表明,比较肽组的谱计数而不是蛋白质组的谱计数可以防止由共享肽引起的问题。我们在两个数据集上展示了这种新方法的优势和灵活性。我们提出了四种组合四种流行的搜索引擎的模型,这导致了在光谱计数区分方面的显著收益。在这些模型中,我们展示了一种强大的投票计数模型,它可以很好地扩展到多个搜索引擎。我们还表明,半酶切搜索比酶切搜索更适合于比较蛋白质组学。总的来说,这些技术在基于光谱计数表的基础上大大提高了蛋白质的区分度。