Sun Shisheng, Hu Yingwei, Ao Minghui, Shah Punit, Chen Jing, Yang Weiming, Jia Xingwang, Tian Yuan, Thomas Stefani, Zhang Hui
1Department of Pathology, Johns Hopkins University, Baltimore, MD 21287 USA.
2College of Life Science, Northwest University, Xi'an, 710069 Shaanxi China.
Clin Proteomics. 2019 Sep 7;16:35. doi: 10.1186/s12014-019-9254-0. eCollection 2019.
N-linked glycoprotein is a highly interesting class of proteins for clinical and biological research. The large-scale characterization of N-linked glycoproteins accomplished by mass spectrometry-based glycoproteomics has provided valuable insights into the interdependence of glycoprotein structure and protein function. However, these studies focused mainly on the analysis of specific sample type, and lack the integration of glycoproteomic data from different tissues, body fluids or cell types.
In this study, we collected the human glycosite-containing peptides identified through their de-glycosylated forms by mass spectrometry from over 100 publications and unpublished datasets generated from our laboratory. A database resource termed -GlycositeAtlas was created and further used for the distribution analyses of glycoproteins among different human cells, tissues and body fluids. Finally, a web interface of -GlycositeAtlas was created to maximize the utility and value of the database.
The -GlycositeAtlas database contains more than 30,000 glycosite-containing peptides (representing > 14,000 N-glycosylation sites) from more than 7200 -glycoproteins from different biological sources including human-derived tissues, body fluids and cell lines from over 100 studies.
The entire human -glycoproteome database as well as 22 sub-databases associated with individual tissues or body fluids can be downloaded from the -GlycositeAtlas website at http://nglycositeatlas.biomarkercenter.org.
N-连接糖蛋白是临床和生物学研究中一类极具吸引力的蛋白质。基于质谱的糖蛋白质组学对N-连接糖蛋白进行的大规模表征,为深入了解糖蛋白结构与蛋白质功能之间的相互依存关系提供了有价值的见解。然而,这些研究主要集中于特定样本类型的分析,缺乏对来自不同组织、体液或细胞类型的糖蛋白质组数据的整合。
在本研究中,我们从100多篇出版物以及我们实验室生成的未发表数据集中,收集了通过质谱法以去糖基化形式鉴定出的含人类糖基化位点的肽段。创建了一个名为-GlycositeAtlas的数据库资源,并进一步用于分析糖蛋白在不同人类细胞、组织和体液中的分布情况。最后,创建了-GlycositeAtlas的网络界面,以最大限度地提高该数据库的实用性和价值。
-GlycositeAtlas数据库包含来自7200多种不同生物来源(包括来自100多项研究的人类组织、体液和细胞系)的糖蛋白的30000多个含糖基化位点的肽段(代表超过14000个N-糖基化位点)。
整个人类糖蛋白质组数据库以及与各个组织或体液相关的22个子数据库可从-GlycositeAtlas网站(http://nglycositeatlas.biomarkercenter.org)下载。