• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质组生成器:基于从头转录组组装和高精度肽质量谱匹配的综合蛋白质组学框架。

ProteomeGenerator: A Framework for Comprehensive Proteomics Based on de Novo Transcriptome Assembly and High-Accuracy Peptide Mass Spectral Matching.

机构信息

Molecular Pharmacology Program , Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center , New York City , New York 10065 , United States.

Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology , New York University Langone Health , New York City , New York 10016 , United States.

出版信息

J Proteome Res. 2018 Nov 2;17(11):3681-3692. doi: 10.1021/acs.jproteome.8b00295. Epub 2018 Oct 19.

DOI:10.1021/acs.jproteome.8b00295
PMID:30295032
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6727203/
Abstract

Modern mass spectrometry now permits genome-scale and quantitative measurements of biological proteomes. However, analysis of specific specimens is currently hindered by the incomplete representation of biological variability of protein sequences in canonical reference proteomes and the technical demands for their construction. Here, we report ProteomeGenerator, a framework for de novo and reference-assisted proteogenomic database construction and analysis based on sample-specific transcriptome sequencing and high-accuracy mass spectrometry proteomics. This enables the assembly of proteomes encoded by actively transcribed genes, including sample-specific protein isoforms resulting from non-canonical mRNA transcription, splicing, or editing. To improve the accuracy of protein isoform identification in non-canonical proteomes, ProteomeGenerator relies on statistical target-decoy database matching calibrated using sample-specific controls. Its current implementation includes automatic integration with MaxQuant mass spectrometry proteomics algorithms. We applied this method for the proteogenomic analysis of splicing factor SRSF2 mutant leukemia cells, demonstrating high-confidence identification of non-canonical protein isoforms arising from alternative transcriptional start sites, intron retention, and cryptic exon splicing as well as improved accuracy of genome-scale proteome discovery. Additionally, we report proteogenomic performance metrics for current state-of-the-art implementations of SEQUEST HT, MaxQuant, Byonic, and PEAKS mass spectral analysis algorithms. Finally, ProteomeGenerator is implemented as a Snakemake workflow within a Singularity container for one-step installation in diverse computing environments, thereby enabling open, scalable, and facile discovery of sample-specific, non-canonical, and neomorphic biological proteomes.

摘要

现代质谱技术现在允许对生物蛋白质组进行基因组规模和定量测量。然而,特定样本的分析目前受到以下因素的限制:在规范参考蛋白质组中蛋白质序列的生物变异性的不完全代表性,以及构建这些蛋白质组的技术要求。在这里,我们报告了 ProteomeGenerator,这是一种基于样本特异性转录组测序和高精度质谱蛋白质组学的从头开始和参考辅助蛋白质基因组数据库构建和分析的框架。这使得能够组装由活跃转录基因编码的蛋白质组,包括由于非规范 mRNA 转录、剪接或编辑而产生的样本特异性蛋白质同工型。为了提高非规范蛋白质组中蛋白质同工型鉴定的准确性, ProteomeGenerator 依赖于使用样本特异性对照校准的统计目标诱饵数据库匹配。它的当前实现包括与 MaxQuant 质谱蛋白质组学算法的自动集成。我们将该方法应用于剪接因子 SRSF2 突变白血病细胞的蛋白质基因组分析,证明了从替代转录起始位点、内含子保留和隐蔽外显子剪接产生的非规范蛋白质同工型的高可信度鉴定,以及对基因组规模蛋白质组发现的准确性的提高。此外,我们报告了当前 SEQUEST HT、MaxQuant、Byonic 和 PEAKS 质谱分析算法的最先进实现的蛋白质基因组性能指标。最后, ProteomeGenerator 作为 Singularity 容器中的 Snakemake 工作流实现,可在各种计算环境中一步安装,从而实现开放、可扩展和简便的样本特异性、非规范和新生生物蛋白质组的发现。

相似文献

1
ProteomeGenerator: A Framework for Comprehensive Proteomics Based on de Novo Transcriptome Assembly and High-Accuracy Peptide Mass Spectral Matching.蛋白质组生成器:基于从头转录组组装和高精度肽质量谱匹配的综合蛋白质组学框架。
J Proteome Res. 2018 Nov 2;17(11):3681-3692. doi: 10.1021/acs.jproteome.8b00295. Epub 2018 Oct 19.
2
Identification of Differentially Expressed Splice Variants by the Proteogenomic Pipeline Splicify.通过 Proteogenomic 管道 Splicify 鉴定差异表达的剪接变体。
Mol Cell Proteomics. 2017 Oct;16(10):1850-1863. doi: 10.1074/mcp.TIR117.000056. Epub 2017 Jul 26.
3
Integrative Proteogenomics Using ProteomeGenerator2.基于 ProteomeGenerator2 的整合蛋白质基因组学
J Proteome Res. 2023 Aug 4;22(8):2750-2764. doi: 10.1021/acs.jproteome.3c00005. Epub 2023 Jul 7.
4
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序:是否已经成为数据库驱动肽鉴定的准确替代方法?
Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.
5
Proteogenomics: From next-generation sequencing (NGS) and mass spectrometry-based proteomics to precision medicine.蛋白质基因组学:从下一代测序(NGS)和基于质谱的蛋白质组学到精准医学。
Clin Chim Acta. 2019 Nov;498:38-46. doi: 10.1016/j.cca.2019.08.010. Epub 2019 Aug 14.
6
Combination of Proteogenomics with Peptide Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications.蛋白质基因组学与肽测序相结合,可鉴定新基因和隐藏的转录后修饰。
mBio. 2019 Oct 15;10(5):e02367-19. doi: 10.1128/mBio.02367-19.
7
Identification of new protein coding sequences and signal peptidase cleavage sites of Helicobacter pylori strain 26695 by proteogenomics.通过蛋白质组学鉴定幽门螺杆菌 26695 株的新蛋白编码序列和信号肽切割位点。
J Proteomics. 2013 Jun 28;86:27-42. doi: 10.1016/j.jprot.2013.04.036. Epub 2013 May 9.
8
A tool for integrating genetic and mass spectrometry-based peptide data: Proteogenomics Viewer: PV: A genome browser-like tool, which includes MS data visualization and peptide identification parameters.一种整合基于基因和质谱的肽段数据的工具:蛋白质基因组学查看器:PV:一种类似基因组浏览器的工具,包括质谱数据可视化和肽段鉴定参数。
Bioessays. 2017 Jul;39(7). doi: 10.1002/bies.201700015. Epub 2017 Jun 5.
9
Methods for Optimization of Protein Extraction and Proteogenomic Mapping in Sweet Potato.优化甘薯中蛋白质提取和蛋白质基因组图谱的方法。
Methods Mol Biol. 2020;2139:309-324. doi: 10.1007/978-1-0716-0528-8_23.
10
Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq.利用 LC/MS/MS 和 RNA-Seq 鉴定乳腺癌新型可变剪接生物标志物。
BMC Bioinformatics. 2020 Dec 3;21(Suppl 9):541. doi: 10.1186/s12859-020-03824-8.

引用本文的文献

1
An Automated Workflow to Address Proteome Complexity and the Large Search Space Problem in Proteomics and HLA-I Immunopeptidomics.一种用于解决蛋白质组复杂性以及蛋白质组学和HLA-I免疫肽组学中大型搜索空间问题的自动化工作流程。
Mol Cell Proteomics. 2025 Jul 21;24(9):101039. doi: 10.1016/j.mcpro.2025.101039.
2
Identification of non-canonical peptides with moPepGen.使用moPepGen鉴定非经典肽段。
Nat Biotechnol. 2025 Jun 16. doi: 10.1038/s41587-025-02701-0.
3
Integration of proteomics profiling data to facilitate discovery of cancer neoantigens: a survey.

本文引用的文献

1
MaxQuant goes Linux.MaxQuant 支持 Linux 系统。
Nat Methods. 2018 Jun;15(6):401. doi: 10.1038/s41592-018-0018-y.
2
Detection of Proteome Diversity Resulted from Alternative Splicing is Limited by Trypsin Cleavage Specificity.检测由可变剪接导致的蛋白质组多样性受到胰蛋白酶切割特异性的限制。
Mol Cell Proteomics. 2018 Mar;17(3):422-430. doi: 10.1074/mcp.RA117.000155. Epub 2017 Dec 8.
3
Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins.深度转录组注释可实现隐匿性小蛋白的发现和功能特征分析。
整合蛋白质组学分析数据以促进癌症新抗原的发现:一项综述。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf087.
4
The role and application of bioinformatics techniques and tools in drug discovery.生物信息学技术和工具在药物发现中的作用与应用。
Front Pharmacol. 2025 Feb 13;16:1547131. doi: 10.3389/fphar.2025.1547131. eCollection 2025.
5
moPepGen: Rapid and Comprehensive Identification of Non-canonical Peptides.moPepGen:非经典肽段的快速全面鉴定
bioRxiv. 2024 Nov 5:2024.03.28.587261. doi: 10.1101/2024.03.28.587261.
6
Integrative Proteogenomics Using ProteomeGenerator2.基于 ProteomeGenerator2 的整合蛋白质基因组学
J Proteome Res. 2023 Aug 4;22(8):2750-2764. doi: 10.1021/acs.jproteome.3c00005. Epub 2023 Jul 7.
7
A Drug Repurposing Pipeline Based on Bladder Cancer Integrated Proteotranscriptomics Signatures.基于膀胱癌综合蛋白质转录组学特征的药物重新利用流程
Methods Mol Biol. 2023;2684:59-99. doi: 10.1007/978-1-0716-3291-8_4.
8
Proteotranscriptomics - A facilitator in omics research.蛋白质转录组学——组学研究的促进者
Comput Struct Biotechnol J. 2022 Jul 9;20:3667-3675. doi: 10.1016/j.csbj.2022.07.007. eCollection 2022.
9
Proteogenomic Analysis of Breast Cancer Transcriptomic and Proteomic Data, Using De Novo Transcript Assembly: Genome-Wide Identification of Novel Peptides and Clinical Implications.基于从头转录组组装的乳腺癌转录组学和蛋白质组学数据的蛋白质基因组分析:新型肽的全基因组鉴定及其临床意义。
Mol Cell Proteomics. 2022 Apr;21(4):100220. doi: 10.1016/j.mcpro.2022.100220. Epub 2022 Feb 26.
10
Immunopeptidogenomics: Harnessing RNA-Seq to Illuminate the Dark Immunopeptidome.免疫肽组学:利用 RNA-Seq 照亮黑暗免疫肽组。
Mol Cell Proteomics. 2021;20:100143. doi: 10.1016/j.mcpro.2021.100143. Epub 2021 Sep 10.
Elife. 2017 Oct 30;6:e27860. doi: 10.7554/eLife.27860.
4
High Sensitivity Quantitative Proteomics Using Automated Multidimensional Nano-flow Chromatography and Accumulated Ion Monitoring on Quadrupole-Orbitrap-Linear Ion Trap Mass Spectrometer.基于自动化多维纳流色谱和四极杆-轨道阱-线性离子阱质谱联用仪的高灵敏度定量蛋白质组学研究
Mol Cell Proteomics. 2017 Nov;16(11):2006-2016. doi: 10.1074/mcp.RA117.000023. Epub 2017 Aug 18.
5
Impact of Alternative Splicing on the Human Proteome.可变剪接对人类蛋白质组的影响。
Cell Rep. 2017 Aug 1;20(5):1229-1241. doi: 10.1016/j.celrep.2017.07.025.
6
Identification of Differentially Expressed Splice Variants by the Proteogenomic Pipeline Splicify.通过 Proteogenomic 管道 Splicify 鉴定差异表达的剪接变体。
Mol Cell Proteomics. 2017 Oct;16(10):1850-1863. doi: 10.1074/mcp.TIR117.000056. Epub 2017 Jul 26.
7
Detecting protein variants by mass spectrometry: a comprehensive study in cancer cell-lines.通过质谱法检测蛋白质变体:在癌细胞系中的综合研究
Genome Med. 2017 Jul 18;9(1):62. doi: 10.1186/s13073-017-0454-9.
8
Classification and function of small open reading frames.小开放阅读框的分类与功能。
Nat Rev Mol Cell Biol. 2017 Sep;18(9):575-589. doi: 10.1038/nrm.2017.58. Epub 2017 Jul 12.
9
Functional proteogenomics reveals biomarkers and therapeutic targets in lymphomas.功能蛋白质组学揭示了淋巴瘤的生物标志物和治疗靶点。
Proc Natl Acad Sci U S A. 2017 Jun 20;114(25):6581-6586. doi: 10.1073/pnas.1701263114. Epub 2017 Jun 12.
10
Singularity: Scientific containers for mobility of compute.奇点:用于计算移动性的科学容器。
PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017.