Hartmann Erica M, Armengaud Jean
Biology and the Built Environment Center, Institute of Ecology and Evolution, University of Oregon, Eugene, OR, USA.
Proteomics. 2014 Dec;14(23-24):2637-46. doi: 10.1002/pmic.201400157. Epub 2014 Sep 23.
Proteogenomics consists of the annotation or reannotation of protein-coding nucleic acid sequences based on the empirical observation of their gene products. While functional annotation of predicted genes is increasingly feasible given the multiplicity of genomes available for many branches of the tree of life, the accurate annotation of the translational start sites is still a point of contention. Extensive coverage of the proteome, including specifically the N-termini, is now possible, thanks to next-generation mass spectrometers able to record data from thousands of proteins at once. Efforts to increase the peptide coverage of protein sequences and to detect low abundance proteins are important to make proteomic and proteogenomic studies more comprehensive. In this review, we present the panoply of N-terminus-oriented strategies that have been developed over the last decade.
蛋白质基因组学包括基于对其基因产物的实证观察对蛋白质编码核酸序列进行注释或重新注释。鉴于生命之树许多分支都有多种可用基因组,预测基因的功能注释越来越可行,但翻译起始位点的准确注释仍是一个有争议的问题。由于新一代质谱仪能够一次性记录数千种蛋白质的数据,现在已经能够对蛋白质组进行广泛覆盖,特别是包括N端。提高蛋白质序列的肽段覆盖率以及检测低丰度蛋白质的努力对于使蛋白质组学和蛋白质基因组学研究更全面非常重要。在本综述中,我们介绍了过去十年中开发的一系列以N端为导向的策略。