Gupta Manoj Kumar, Jayaram Savita, Madugundu Anil K, Chavan Sandip, Advani Jayshree, Pandey Akhilesh, Thongboonkerd Visith, Sirdeshmukh Ravi
Institute of Bioinformatics, International Tech Park, Bangalore 560066, India.
J Proteome Res. 2014 Jul 3;13(7):3178-90. doi: 10.1021/pr500023p. Epub 2014 Jun 24.
In line with the aims of the Chromosome-centric Human Proteome Project (C-HPP) to completely annotate proteins of each chromosome and biology/disease driven HPP (B/D-HPP) to decipher their relation to diseases, we have generated a nonredundant catalogue of protein-coding genes for Chromosome 12 (Chr. 12) and further annotated proteins associated with major neurological disorders. Integrating high level proteomic evidence from four major databases (neXtProt, Global Proteome Machine (GPMdb), PeptideAtlas, and Human Protein Atlas (HPA)) along with Ensembl data resource resulted in the identification of 1066 protein coding genes, of which 171 were defined as "missing proteins" based on the weak or complete absence of experimental evidence. With functional annotations using DAVID and GAD, about 40% of the proteins could be grouped as brain related with implications in cancer or neurological disorders. We used published and unpublished high confidence mass spectrometry data from our group and other literature consisting of more than 5000 proteins derived from clinical specimens from patients with human gliomas, Alzheimer's disease, and Parkinson's disease and mapped it onto Chr. 12. We observed a total of 202 proteins mapping to human Chr. 12, 136 of which were differentially expressed in these disease conditions as compared to the normal. Functional grouping indicated their association with cell cycle, cell-to-cell signaling, and other important processes and networks, whereas their disease association analysis confirmed neurological diseases and cancer as the major group along with psycological disorders, with several overexpressed genes/proteins mapping to 12q13-15 amplicon region. Using multiple strategies and bioinformatics tools, we identified 103 differentially expressed proteins to have secretory potential, 17 of which have already been reported in direct analysis of the plasma or cerebrospinal fluid (CSF) from the patients and 21 of them mapped to cancer associated protein (CAPs) database that are amenable to selective reaction monitoring (SRM) assays for targeted proteomic analysis. Our analysis also reveals, for the first time, mass spectrometric evidence for two "missing proteins" from Chr. 12, namely, synaptic vesicle 2-related protein (SVOP) and IQ motif containing D (IQCD). The analysis provides a snapshot of Chr. 12 encoded proteins associated with gliomas and major neurological conditions and their secretability which can be used to drive efforts for clinical applications.
为了实现以染色体为中心的人类蛋白质组计划(C-HPP)对每条染色体上的蛋白质进行全面注释以及生物学/疾病驱动的人类蛋白质组计划(B/D-HPP)来解读其与疾病关系的目标,我们生成了12号染色体(Chr. 12)的蛋白质编码基因非冗余目录,并进一步注释了与主要神经系统疾病相关的蛋白质。整合来自四个主要数据库(neXtProt、全球蛋白质组机器数据库(GPMdb)、肽图集和人类蛋白质图谱(HPA))的高级蛋白质组学证据以及Ensembl数据资源,共鉴定出1066个蛋白质编码基因,其中171个基于实验证据薄弱或完全缺失被定义为“缺失蛋白质”。通过使用DAVID和GAD进行功能注释,约40%的蛋白质可归类为与大脑相关,与癌症或神经系统疾病有关。我们使用了来自我们团队已发表和未发表的高可信度质谱数据以及其他文献,这些数据包含来自人类胶质瘤、阿尔茨海默病和帕金森病患者临床标本的5000多种蛋白质,并将其映射到Chr. 12上。我们共观察到202种蛋白质映射到人类Chr. 12,其中136种在这些疾病状态下与正常情况相比存在差异表达。功能分组表明它们与细胞周期、细胞间信号传导以及其他重要过程和网络相关,而它们的疾病关联分析证实神经系统疾病和癌症是主要类别,同时还有心理障碍,有几个过表达的基因/蛋白质映射到12q13 - 15扩增区域。通过多种策略和生物信息学工具,我们鉴定出103种具有分泌潜力的差异表达蛋白质,其中17种已在对患者血浆或脑脊液(CSF)的直接分析中被报道,21种映射到癌症相关蛋白质(CAPs)数据库,可用于靶向蛋白质组分析的选择性反应监测(SRM)检测。我们的分析还首次揭示了Chr. 12上两种“缺失蛋白质”的质谱证据,即突触小泡2相关蛋白(SVOP)和含IQ基序的D蛋白(IQCD)。该分析提供了与胶质瘤和主要神经系统疾病相关的Chr. 12编码蛋白质及其分泌能力的概况,可用于推动临床应用研究。