Department of Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, United States.
J Proteome Res. 2013 Jun 7;12(6):2846-57. doi: 10.1021/pr400173d. Epub 2013 May 30.
Full-length de novo sequencing of unknown proteins remains a challenging open problem. Traditional methods that sequence spectra individually are limited by short peptide length, incomplete peptide fragmentation, and ambiguous de novo interpretations. We address these issues by determining consensus sequences for assembled tandem mass (MS/MS) spectra from overlapping peptides (e.g., by using multiple enzymatic digests). We have combined electron-transfer dissociation (ETD) with collision-induced dissociation (CID) and higher-energy collision-induced dissociation (HCD) fragmentation methods to boost interpretation of long, highly charged peptides and take advantage of corroborating b/y/c/z ions in CID/HCD/ETD. Using these strategies, we show that triplet CID/HCD/ETD MS/MS spectra from overlapping peptides yield de novo sequences of average length 70 AA and as long as 200 AA at up to 99% sequencing accuracy.
从头开始对未知蛋白质进行全长测序仍然是一个具有挑战性的开放问题。传统的逐个测序谱的方法受到短肽长度、不完全肽片段化和从头解释的歧义性的限制。我们通过确定来自重叠肽的串联质谱 (MS/MS) 谱的一致序列来解决这些问题(例如,使用多种酶消化)。我们将电子转移解离 (ETD) 与碰撞诱导解离 (CID) 和更高能量碰撞诱导解离 (HCD) 片段化方法相结合,以增强对长的、高电荷肽的解释,并利用 CID/HCD/ETD 中的佐证 b/y/c/z 离子。使用这些策略,我们表明来自重叠肽的三重 CID/HCD/ETD MS/MS 谱可产生平均长度为 70AA 的从头序列,最长可达 200AA,测序准确率高达 99%。