Laboratory of Protein and Peptide Pharmaceuticals & Proteomics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112.
Computer Network Information Center, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112.
Mol Cell Proteomics. 2020 Apr;19(4):672-689. doi: 10.1074/mcp.RA119.001791. Epub 2020 Feb 26.
Large-scale identification of -linked intact glycopeptides by liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) in human serum is challenging because of the wide dynamic range of serum protein abundances, the lack of a complete serum N-glycan database and the existence of proteoforms. In this regard, a spectral library search method was presented for the identification of -linked intact glycopeptides from -linked glycoproteins in human serum with target-decoy and motif-specific false discovery rate (FDR) control. Serum proteins were firstly separated into low-abundance and high-abundance proteins by acetonitrile (ACN) precipitation. After digestion, the -linked intact glycopeptides were enriched by hydrophilic interaction liquid chromatography (HILIC) and a portion of the enriched -linked intact glycopeptides were processed by Peptide-N-Glycosidase F (PNGase F) to generate -linked deglycopeptides. Both -linked intact glycopeptides and deglycopeptides were analyzed by LC-MS/MS. From -linked deglycopeptides data sets, 764 -linked glycoproteins, 1699 -linked glycosites and 3328 unique -linked deglycopeptides were identified. Four types of -linked glycosylation motifs (NXS/T/C/V, X≠P) were used to recognize the -linked deglycopeptides. The spectra of these -linked deglycopeptides were utilized for -linked deglycopeptides library construction and identification of -linked intact glycopeptides. A database containing 739 N-glycan masses was constructed and utilized during spectral library search for the identification of -linked intact glycopeptides. In total, 526 -linked glycoproteins, 1036 -linked glycosites, 22,677 -linked intact glycopeptides and 738 N-glycan masses were identified under 1% FDR, representing the most in-depth serum N-glycoproteome identified by LC-MS/MS at -linked intact glycopeptide level.
在人类血清中,通过液相色谱-串联质谱(LC-MS/MS)对 -连接的完整糖肽进行大规模鉴定具有挑战性,这是因为血清蛋白丰度的动态范围很宽,缺乏完整的血清 N-糖基数据库,并且存在蛋白质异构体。在这方面,提出了一种谱库搜索方法,用于通过靶标 -诱饵和基于模体的特异性错误发现率(FDR)控制,从人类血清中的 -连接糖蛋白中鉴定 -连接的完整糖肽。首先通过乙腈(ACN)沉淀将血清蛋白分为低丰度和高丰度蛋白。消化后,通过亲水相互作用液相色谱(HILIC)富集 -连接的完整糖肽,并对部分富集的 -连接的完整糖肽进行肽-N-糖苷酶 F(PNGase F)处理,以生成 -连接的去糖肽。通过 LC-MS/MS 分析 -连接的完整糖肽和去糖肽。从 -连接的去糖肽数据集鉴定到 764 个 -连接糖蛋白、1699 个 -连接糖基位点和 3328 个独特的 -连接去糖肽。使用四种 -连接糖基化模体(NXS/T/C/V,X≠P)来识别 -连接的去糖肽。这些 -连接的去糖肽的谱用于构建 -连接的去糖肽文库,并鉴定 -连接的完整糖肽。构建了一个包含 739 个 N-聚糖质量的数据库,并在谱库搜索过程中用于鉴定 -连接的完整糖肽。在 1% FDR 下,共鉴定到 526 个 -连接糖蛋白、1036 个 -连接糖基位点、22677 个 -连接的完整糖肽和 738 个 N-聚糖质量,代表了在 -连接的完整糖肽水平上通过 LC-MS/MS 鉴定的最深入的血清 N-糖蛋白组。