Fälth Maria, Sköld Karl, Svensson Marcus, Nilsson Anna, Fenyö David, Andren Per E
Laboratory for Biological and Medical Mass Spectrometry, Biomedical Centre, Box 583, Uppsala University, SE-75123 Uppsala, Sweden.
Mol Cell Proteomics. 2007 Jul;6(7):1188-97. doi: 10.1074/mcp.M700016-MCP200. Epub 2007 Mar 30.
A new approach using targeted sequence collections has been developed for identifying endogenous peptides. This approach enables a fast, specific, and sensitive identification of endogenous peptides. Three different sequence collections were constituted in this study to mimic the peptidomic samples: SwePep precursors, SwePep peptides, and SwePep predicted. The searches for neuropeptides performed against these three sequence collections were compared with searches performed against the entire mouse proteome, which is commonly used to identify neuropeptides. These four sequence collections were searched with both Mascot and X! Tandem. Evaluation of the sequence collections was achieved using a set of manually identified and previously verified peptides. By using the three new sequence collections, which more accurately mimic the sample, 3 times as many peptides were significantly identified, with a false-positive rate below 1%, in comparison with the mouse proteome. The new sequence collections were also used to identify previously uncharacterized peptides from brain tissue; 27 previously uncharacterized peptides and potentially bioactive neuropeptides were identified. These novel peptides are cleaved from the peptide precursors at sites that are characteristic for prohormone convertases, and some of them have post-translational modifications that are characteristic for neuropeptides. The targeted protein sequence collections for different species are publicly available for download from SwePep.
一种利用靶向序列集的新方法已被开发用于鉴定内源性肽段。这种方法能够快速、特异性且灵敏地鉴定内源性肽段。在本研究中构建了三种不同的序列集来模拟肽组学样本:SwePep前体、SwePep肽段以及预测的SwePep。将针对这三种序列集进行的神经肽搜索与针对常用于鉴定神经肽的整个小鼠蛋白质组进行的搜索进行了比较。使用Mascot和X! Tandem对这四个序列集进行了搜索。使用一组经人工鉴定且先前已验证的肽段对序列集进行评估。与小鼠蛋白质组相比,通过使用更准确模拟样本的这三种新序列集,显著鉴定出的肽段数量增加了3倍,假阳性率低于1%。这些新序列集还被用于鉴定来自脑组织的先前未表征的肽段;鉴定出了27种先前未表征的肽段以及潜在的生物活性神经肽。这些新型肽段是从前体肽段在激素原转化酶特有的位点切割而来,并且其中一些具有神经肽特有的翻译后修饰。不同物种的靶向蛋白质序列集可从SwePep公开下载。