蛋白质探索者：一个用于公共质谱数据集蛋白质检测探索的资源库规模的资源。

ProteinExplorer: A Repository-Scale Resource for Exploration of Protein Detection in Public Mass Spectrometry Data Sets.

出版信息

J Proteome Res. 2018 Dec 7;17(12):4227-4234. doi: 10.1021/acs.jproteome.8b00496. Epub 2018 Oct 15.

DOI:10.1021/acs.jproteome.8b00496

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6709584/

Abstract

High-throughput tandem mass spectrometry has enabled the detection and identification of over 75% of all proteins predicted to result in translated gene products in the human genome. In fact, the galloping rate of data acquisition and sharing of mass spectrometry data has led to the current availability of many tens of terabytes of public data in thousands of human data sets. The systematic reanalysis of these public data sets has been used to build a community-scale spectral library of 2.1 million precursors for over 1 million unique sequences from over 19,000 proteins (including spectra of synthetic peptides). However, it has remained challenging to find and inspect spectra of peptides covering functional protein regions or matching novel proteins. ProteinExplorer addresses these challenges with an intuitive interface mapping tens of millions of identifications to functional sites on nearly all human proteins while maintaining provenance for every identification back to the original data set and data file. Additionally, ProteinExplorer facilitates the selection and inspection of HPP-compliant peptides whose spectra can be matched to spectra of synthetic peptides and already includes HPP-compliant evidence for 107 missing (PE2, PE3, and PE4) and 23 dubious (PE5) proteins. Finally, ProteinExplorer allows users to rate spectra and to contribute to a community library of peptides entitled PrEdict (Protein Existance dictionary) mapping to novel proteins but whose preliminary identities have not yet been fully established with community-scale false discovery rates and synthetic peptide spectra. ProteinExplorer can be now be accessed at https://massive.ucsd.edu/ProteoSAFe/protein_explorer_splash.jsp .

摘要

高通量串联质谱技术已经能够检测和鉴定人类基因组中超过 75%的所有预测翻译产物基因的蛋白质。事实上，质谱数据的获取和共享速度正在迅速加快，目前已经有数千个人类数据集的公共数据达到了数十 TB 之多。对这些公共数据集进行系统的重新分析，已经构建了一个包含 210 万个前体的、针对 19000 多种蛋白质（包括合成肽谱）的 100 多万个独特序列的社区规模的谱库。然而，要找到并检查覆盖功能蛋白区域或匹配新型蛋白质的肽段仍然具有挑战性。ProteinExplorer 通过直观的界面，将数千万个鉴定结果映射到几乎所有人类蛋白质的功能位点上，同时保持每个鉴定结果回溯到原始数据集和数据文件的出处，解决了这些挑战。此外，ProteinExplorer 还方便了选择和检查符合 HPP 标准的肽段，这些肽段的谱可以与合成肽段的谱相匹配，并且已经包含了 107 个缺失（PE2、PE3 和 PE4）和 23 个可疑（PE5）蛋白质的 HPP 证据。最后，ProteinExplorer 允许用户对谱进行评分，并为一个名为 PrEdict（蛋白质存在字典）的新型蛋白质肽段社区库做出贡献，这些肽段的初步身份尚未通过社区规模的假发现率和合成肽谱完全确定。现在可以通过 https://massive.ucsd.edu/ProteoSAFe/protein_explorer_splash.jsp 访问 ProteinExplorer。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3550/6709584/f655fd93b552/nihms-1035539-f0001.jpg

相似文献

ProteinExplorer: A Repository-Scale Resource for Exploration of Protein Detection in Public Mass Spectrometry Data Sets.蛋白质探索者：一个用于公共质谱数据集蛋白质检测探索的资源库规模的资源。

J Proteome Res. 2018 Dec 7;17(12):4227-4234. doi: 10.1021/acs.jproteome.8b00496. Epub 2018 Oct 15.

Identifying PE2 and PE5 Proteins from Existing Mass Spectrometry Data Using pFind.利用 pFind 从现有质谱数据中鉴定 PE2 和 PE5 蛋白。

J Proteome Res. 2024 Jul 5;23(7):2323-2331. doi: 10.1021/acs.jproteome.3c00674. Epub 2024 Jun 12.

Assembling the Community-Scale Discoverable Human Proteome.组装社区规模可发现的人类蛋白质组。

Cell Syst. 2018 Oct 24;7(4):412-421.e5. doi: 10.1016/j.cels.2018.08.004. Epub 2018 Aug 29.

Combination of Multiple Spectral Libraries Improves the Current Search Methods Used to Identify Missing Proteins in the Chromosome-Centric Human Proteome Project.多个光谱库的组合改进了当前用于在以染色体为中心的人类蛋白质组计划中识别缺失蛋白质的搜索方法。

J Proteome Res. 2015 Dec 4;14(12):4959-66. doi: 10.1021/acs.jproteome.5b00578. Epub 2015 Sep 14.

The 2023 Report on the Proteome from the HUPO Human Proteome Project.2023 年人类蛋白质组组织蛋白质组报告。

J Proteome Res. 2024 Feb 2;23(2):532-549. doi: 10.1021/acs.jproteome.3c00591. Epub 2024 Jan 17.

Large-Scale Reanalysis of Publicly Available HeLa Cell Proteomics Data in the Context of the Human Proteome Project.大规模重新分析人类蛋白质组计划背景下公开可用的 HeLa 细胞蛋白质组学数据。

J Proteome Res. 2018 Dec 7;17(12):4160-4170. doi: 10.1021/acs.jproteome.8b00392. Epub 2018 Sep 17.

The spectral networks paradigm in high throughput mass spectrometry.高通量质谱中的光谱网络范式

Mol Biosyst. 2012 Oct;8(10):2535-44. doi: 10.1039/c2mb25085c.

Spectral Library Search Improves Assignment of TMT Labeled MS/MS Spectra.光谱库检索可提高 TMT 标记 MS/MS 谱的分配。

J Proteome Res. 2018 Sep 7;17(9):3325-3331. doi: 10.1021/acs.jproteome.8b00594. Epub 2018 Aug 16.

Looking for Missing Proteins in the Proteome of Human Spermatozoa: An Update.寻找人类精子蛋白质组中缺失的蛋白质：最新进展

J Proteome Res. 2016 Nov 4;15(11):3998-4019. doi: 10.1021/acs.jproteome.6b00400. Epub 2016 Aug 23.

ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects.蛋白质推理器：用于大规模蛋白质组学项目的可靠蛋白质鉴定和多实验比较

J Proteomics. 2015 Nov 3;129:25-32. doi: 10.1016/j.jprot.2015.07.006. Epub 2015 Jul 18.

引用本文的文献

Abnormal mitochondrial structure and function in brown adipose tissue of SLC35A4-MP knockout mice.SLC35A4-MP基因敲除小鼠棕色脂肪组织中线粒体的结构和功能异常。

Sci Adv. 2025 Aug 29;11(35):eads7381. doi: 10.1126/sciadv.ads7381.

The 2024 Report on the Human Proteome from the HUPO Human Proteome Project.人类蛋白质组组织（HUPO）人类蛋白质组计划2024年人类蛋白质组报告。

J Proteome Res. 2024 Dec 6;23(12):5296-5311. doi: 10.1021/acs.jproteome.4c00776. Epub 2024 Nov 8.

The 2023 Report on the Proteome from the HUPO Human Proteome Project.2023 年人类蛋白质组组织蛋白质组报告。

J Proteome Res. 2024 Feb 2;23(2):532-549. doi: 10.1021/acs.jproteome.3c00591. Epub 2024 Jan 17.

A high-stringency blueprint of the human proteome.人类蛋白质组的高精度蓝图。

Nat Commun. 2020 Oct 16;11(1):5301. doi: 10.1038/s41467-020-19045-9.

Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project.人类蛋白质组研究取得重大里程碑：根据 HUPO 人类蛋白质组计划，现在可可靠检测到 >90%的预测人类蛋白质。

J Proteome Res. 2020 Dec 4;19(12):4735-4746. doi: 10.1021/acs.jproteome.0c00485. Epub 2020 Oct 19.

The Archaeal Proteome Project advances knowledge about archaeal cell biology through comprehensive proteomics.古菌蛋白质组计划通过全面的蛋白质组学研究推进古菌细胞生物学的知识。

Nat Commun. 2020 Jun 19;11(1):3145. doi: 10.1038/s41467-020-16784-7.

The ProteomeXchange consortium in 2020: enabling 'big data' approaches in proteomics.2020 年蛋白质组交换联盟：在蛋白质组学中启用“大数据”方法。

Nucleic Acids Res. 2020 Jan 8;48(D1):D1145-D1152. doi: 10.1093/nar/gkz984.

Human Proteome Project Mass Spectrometry Data Interpretation Guidelines 3.0.人类蛋白质组计划质谱数据分析解释指南 3.0.

J Proteome Res. 2019 Dec 6;18(12):4108-4116. doi: 10.1021/acs.jproteome.9b00542. Epub 2019 Oct 21.

Mass Spectrometry-Based Plasma Proteomics: Considerations from Sample Collection to Achieving Translational Data.基于质谱的血浆蛋白质组学：从样本采集到实现转化数据的考虑因素。

J Proteome Res. 2019 Dec 6;18(12):4085-4097. doi: 10.1021/acs.jproteome.9b00503. Epub 2019 Oct 11.

Progress on Identifying and Characterizing the Human Proteome: 2019 Metrics from the HUPO Human Proteome Project.人类蛋白质组鉴定与特征分析进展：2019 年 HUPO 人类蛋白质组计划指标。

J Proteome Res. 2019 Dec 6;18(12):4098-4107. doi: 10.1021/acs.jproteome.9b00434. Epub 2019 Sep 13.

本文引用的文献

Assembling the Community-Scale Discoverable Human Proteome.组装社区规模可发现的人类蛋白质组。

Cell Syst. 2018 Oct 24;7(4):412-421.e5. doi: 10.1016/j.cels.2018.08.004. Epub 2018 Aug 29.

Ensembl 2018.Ensembl 2018.

Nucleic Acids Res. 2018 Jan 4;46(D1):D754-D761. doi: 10.1093/nar/gkx1098.

Progress on the HUPO Draft Human Proteome: 2017 Metrics of the Human Proteome Project.人类蛋白质组计划 HUPO 草案进展：2017 年人类蛋白质组项目指标。

J Proteome Res. 2017 Dec 1;16(12):4281-4287. doi: 10.1021/acs.jproteome.7b00375. Epub 2017 Oct 9.

Architecture of the human interactome defines protein communities and disease networks.人类相互作用组的架构定义了蛋白质群落和疾病网络。

Nature. 2017 May 25;545(7655):505-509. doi: 10.1038/nature22366. Epub 2017 May 17.

Building ProteomeTools based on a complete synthetic human proteome.基于完整的合成人类蛋白质组构建蛋白质组工具。

Nat Methods. 2017 Mar;14(3):259-262. doi: 10.1038/nmeth.4153. Epub 2017 Jan 30.

UniProt: the universal protein knowledgebase.通用蛋白质知识库：UniProt

Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.

The neXtProt knowledgebase on human proteins: 2017 update.人类蛋白质的neXtProt知识库：2017年更新。

Nucleic Acids Res. 2017 Jan 4;45(D1):D177-D182. doi: 10.1093/nar/gkw1062. Epub 2016 Nov 29.

Ensembl 2017.Ensembl 2017年

Nucleic Acids Res. 2017 Jan 4;45(D1):D635-D642. doi: 10.1093/nar/gkw1104. Epub 2016 Nov 28.

Human Proteome Project Mass Spectrometry Data Interpretation Guidelines 2.1.人类蛋白质组计划质谱数据解读指南2.1。

J Proteome Res. 2016 Nov 4;15(11):3961-3970. doi: 10.1021/acs.jproteome.6b00392. Epub 2016 Aug 24.

Looking for Missing Proteins in the Proteome of Human Spermatozoa: An Update.寻找人类精子蛋白质组中缺失的蛋白质：最新进展

J Proteome Res. 2016 Nov 4;15(11):3998-4019. doi: 10.1021/acs.jproteome.6b00400. Epub 2016 Aug 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验