School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China.
State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China.
J Proteome Res. 2020 Dec 4;19(12):4808-4814. doi: 10.1021/acs.jproteome.0c00370. Epub 2020 Nov 10.
The Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2012 to perfect the annotation of human protein existence by identifying stronger evidence of the expression of missing proteins (MPs) at the protein level. After an 8 year effort all over the world, the number of MPs in the neXtProt database significantly decreased from 5511 (2012-02-24) to 1899 (2020-01-17). It is now more difficult to provide confident evidence of the remaining MPs because of their specific characteristics, including low abundance, low molecular weight, unexpected modifications, transmembrane structure, tissue-expression specificity, and so on. A higher resolution mass spectrometry (MS) interpretation engine might provide an opportunity to identify these buried MPs in complex samples by the combination with multi-tissue large-scale proteomics. In this study, open-pFind was used to dig MPs from 20 pairs of healthy human tissues by Wang et al. ( 2019, 15 (2), e8503) combined with our large-scale testis data set digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity for different amino acid residues ( 2019, 18 (12), 4189-4196). A total of 1 535 536 peptides with 17 283 477 peptide-spectrum matches (PSMs) were mapped to 14 279 protein entries at a false discovery rate of <1% at the PSM, peptide, and protein levels. A total of 103 MP candidates were identified, among which 86 candidates had more unique peptide numbers compared with our single testis tissue. After rigorous screening, manual checks, peptide synthesis, and matching with documented peptides from PeptideAtlas, we validated four MPs, P0C7T8 (duodenum and small intestine), Q8WWZ4 (stomach and rectum), Q8IV35 (fallopian tube), and O14921 (tonsil), at the protein level. All MS raw files have been deposited to the ProteomeXchange with identifier PXD021391.
染色体为中心的人类蛋白质组计划(C-HPP)于 2012 年启动,旨在通过鉴定缺失蛋白(MPs)在蛋白质水平上表达的更强证据来完善人类蛋白质存在的注释。经过 8 年的全球努力,neXtProt 数据库中的 MPs 数量从 5511 个(2012-02-24)显著减少到 1899 个(2020-01-17)。由于其特殊特征,包括低丰度、低分子量、意外修饰、跨膜结构、组织表达特异性等,现在更难提供对剩余 MPs 的有信心的证据。更高分辨率的质谱(MS)解释引擎可能通过与多组织大规模蛋白质组学相结合,为识别这些复杂样品中的埋藏 MPs 提供机会。在这项研究中,Wang 等人(2019 年,15(2),e8503)结合我们使用三种对不同氨基酸残基具有特异性的酶(Glu-C、Lys-C 和胰蛋白酶)消化的大规模睾丸数据集,使用 open-pFind 从 20 对健康人体组织中挖掘 MPs。总共映射到 14279 个蛋白质条目,假发现率(FDR)<1%,PSM、肽和蛋白质水平。共鉴定出 103 个 MP 候选物,其中 86 个候选物与我们的单个睾丸组织相比具有更多的独特肽数。经过严格筛选、手动检查、肽合成以及与 PeptideAtlas 中的已有肽匹配,我们在蛋白质水平上验证了 4 个 MPs,分别是 P0C7T8(十二指肠和小肠)、Q8WWZ4(胃和直肠)、Q8IV35(输卵管)和 O14921(扁桃体)。所有 MS 原始文件都已存入 ProteomeXchange,标识符为 PXD021391。