From the ‡Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS; University of Chinese Academy of Sciences; Institute of Computing Technology, CAS, Beijing 100190, China.
§State Key Laboratory of Proteomics; Beijing Proteome Research Center; National Center for Protein Sciences Beijing; Beijing Institute of Lifeomics, Beijing 102206, China.
Mol Cell Proteomics. 2019 Apr;18(4):773-785. doi: 10.1074/mcp.TIR118.000918. Epub 2019 Jan 8.
peptide sequencing for large-scale proteomics remains challenging because of the lack of full coverage of ion series in tandem mass spectra. We developed a mirror protease of trypsin, acetylated LysargiNase (Ac-LysargiNase), with superior activity and stability. The mirror spectrum pairs derived from the Ac-LysargiNase and trypsin treated samples can generate full and ion series, which provide mutual complementarity of each other, and allow us to develop a novel algorithm, pNovoM, for sequencing. Using pNovoM to sequence peptides of purified proteins, the accuracy of the sequence was close to 100%. More importantly, from a large-scale yeast proteome sample digested with trypsin and Ac-LysargiNase individually, 48% of all tandem mass spectra formed mirror spectrum pairs, 97% of which contained full coverage of ion series, resulting in precision sequencing of full-length peptides by pNovoM. This enabled pNovoM to successfully sequence 21,249 peptides from 3,753 proteins and interpreted 44-152% more spectra than pNovo+ and PEAKS at a 5% FDR at the spectrum level. Moreover, the mirror protease strategy had an obvious advantage in sequencing long peptides. We believe that the combination of mirror protease strategy and pNovoM will be an effective approach for precision sequencing on both single proteins and proteome samples.
用于大规模蛋白质组学的肽测序仍然具有挑战性,因为串联质谱中离子系列的覆盖度不足。我们开发了一种胰蛋白酶的镜像蛋白酶,即乙酰化赖氨酰蛋白酶(Ac-LysargiNase),它具有更高的活性和稳定性。来自 Ac-LysargiNase 和胰蛋白酶处理样品的镜像谱对可以产生完整的和离子系列,彼此提供互补性,使我们能够开发一种新的算法 pNovoM 进行测序。使用 pNovoM 对纯化蛋白的肽进行测序,序列的准确性接近 100%。更重要的是,从用胰蛋白酶和 Ac-LysargiNase 单独消化的大规模酵母蛋白质组样品中,48%的所有串联质谱形成镜像谱对,其中 97%包含完整的离子系列覆盖度,从而通过 pNovoM 实现全长肽的精确测序。这使得 pNovoM 能够成功地从 3753 个蛋白质中测序 21249 个肽,并在谱级别的 5% FDR 下比 pNovo+和 PEAKS 分别解释更多的谱,达到 44-152%。此外,镜像蛋白酶策略在测序长肽方面具有明显的优势。我们相信,镜像蛋白酶策略和 pNovoM 的结合将是对单个蛋白质和蛋白质组样品进行精确测序的有效方法。