He Lin, Xin Lei, Shan Baozhen, Lajoie Gilles A, Ma Bin
David R. Cheriton School of Computer Science, University of Waterloo , Waterloo, Ontario N2L 3G1, Canada.
J Proteome Res. 2014 Sep 5;13(9):3881-95. doi: 10.1021/pr401115y. Epub 2014 Aug 25.
Glycosylation is one of the most commonly observed post-translational modifications (PTMs) in eukaryotes. It is believed that more than 50% eukaryotic proteins are glycosylated. To reveal the biological functions of protein-linked glycans involved in numerous biological processes, the high-throughput identification of both glycoproteins and the attached glycan structures becomes fundamentally important. Tandem mass spectrometry (MS/MS) is an effective method for glycoproteomic analysis because of its high sensitivity and selectivity. Two experimental approaches exist to obtain MS/MS spectral data of glycopeptides. One consists of isolating glycans from glycopeptides and generating MS/MS spectra of the glycans and peptides separately. The other approach produces spectra directly from intact glycopeptides. The latter approach has the advantage of retaining the glycosylation site information. However, the spectral data cannot be readily analyzed because of the lack of software specifically designed for the identification of intact glycopeptides. To address this need, we developed a novel software tool, GlycoMaster DB, to assist the automated and high-throughput identification of intact N-linked glycopeptides from MS/MS spectra. The software simultaneously searches a protein sequence database and a glycan structure database to find the best pair of peptide and glycan for each input spectrum. GlycoMaster DB can analyze mass spectral data produced with HCD/ETD mixed fragmentation, where HCD spectra are used to identify glycans and ETD spectra are used to determine peptide sequences. When only HCD spectra are available, GlycoMaster DB can still help to identify the glycans, and a list of possible peptide sequences are reported according to the accurate precursor mass and the N-linked glycopeptide sequon. GlycoMaster DB is freely accessible at http://www-novo.cs.uwaterloo.ca:8080/GlycoMasterDB .
糖基化是真核生物中最常见的翻译后修饰(PTM)之一。据信,超过50%的真核生物蛋白质都进行了糖基化。为了揭示参与众多生物过程的蛋白质连接聚糖的生物学功能,高通量鉴定糖蛋白及其连接的聚糖结构变得至关重要。串联质谱(MS/MS)因其高灵敏度和选择性,是糖蛋白质组分析的有效方法。存在两种实验方法来获取糖肽的MS/MS光谱数据。一种方法是从糖肽中分离聚糖,并分别生成聚糖和肽的MS/MS光谱。另一种方法是直接从完整的糖肽产生光谱。后一种方法的优点是保留了糖基化位点信息。然而,由于缺乏专门设计用于鉴定完整糖肽的软件,光谱数据难以分析。为满足这一需求,我们开发了一种新型软件工具GlycoMaster DB,以协助从MS/MS光谱中自动高通量鉴定完整的N-连接糖肽。该软件同时搜索蛋白质序列数据库和聚糖结构数据库,为每个输入光谱找到最佳的肽和聚糖对。GlycoMaster DB可以分析用HCD/ETD混合碎裂产生的质谱数据,其中HCD光谱用于鉴定聚糖,ETD光谱用于确定肽序列。当只有HCD光谱可用时,GlycoMaster DB仍然可以帮助鉴定聚糖,并根据精确的前体质量和N-连接糖肽序列基序报告可能的肽序列列表。可通过http://www-novo.cs.uwaterloo.ca:8080/GlycoMasterDB免费访问GlycoMaster DB。