通过 Ursgal 辅助的联合方法增强开放修饰搜索。

Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal.

机构信息

Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States.

Department of Chemistry and Biochemistry, University of Bern, 3012 Bern, Switzerland.

出版信息

J Proteome Res. 2021 Apr 2;20(4):1986-1996. doi: 10.1021/acs.jproteome.0c00799. Epub 2021 Jan 29.

DOI:10.1021/acs.jproteome.0c00799

PMID:33514075

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8259620/

Abstract

The identification of peptide sequences and their post-translational modifications (PTMs) is a crucial step in the analysis of bottom-up proteomics data. The recent development of open modification search (OMS) engines allows virtually all PTMs to be searched for. This not only increases the number of spectra that can be matched to peptides but also greatly advances the understanding of the biological roles of PTMs through the identification, and the thereby facilitated quantification, of peptidoforms (peptide sequences and their potential PTMs). Whereas the benefits of combining results from multiple protein database search engines have been previously established, similar approaches for OMS results have been missing so far. Here we compare and combine results from three different OMS engines, demonstrating an increase in peptide spectrum matches of 8-18%. The unification of search results furthermore allows for the combined downstream processing of search results, including the mapping to potential PTMs. Finally, we test for the ability of OMS engines to identify glycosylated peptides. The implementation of these engines in the Python framework Ursgal facilitates the straightforward application of the OMS with unified parameters and results files, thereby enabling yet unmatched high-throughput, large-scale data analysis.

摘要

肽序列及其翻译后修饰（PTMs）的鉴定是进行自下而上蛋白质组学数据分析的关键步骤。最近开放修饰搜索（OMS）引擎的发展允许几乎所有的 PTMs 都可以被搜索到。这不仅增加了可以与肽匹配的谱数量，而且通过鉴定和促进肽形式（肽序列及其潜在的 PTMs）的定量，极大地推进了对 PTMs 生物学作用的理解。尽管以前已经确立了结合多个蛋白质数据库搜索引擎结果的优势，但到目前为止，类似的 OMS 结果的方法还没有出现。在这里，我们比较和结合了三种不同的 OMS 引擎的结果，证明肽谱匹配增加了 8-18%。搜索结果的统一还允许对搜索结果进行联合下游处理，包括潜在 PTMs 的映射。最后，我们测试了 OMS 引擎识别糖基化肽的能力。这些引擎在 Python 框架 Ursgal 中的实现简化了 OMS 的应用，具有统一的参数和结果文件，从而实现了无与伦比的高通量、大规模数据分析。

相似文献

Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal.通过 Ursgal 辅助的联合方法增强开放修饰搜索。

J Proteome Res. 2021 Apr 2;20(4):1986-1996. doi: 10.1021/acs.jproteome.0c00799. Epub 2021 Jan 29.

Ursgal, Universal Python Module Combining Common Bottom-Up Proteomics Tools for Large-Scale Analysis.Ursgal，用于大规模分析的整合常见自下而上蛋白质组学工具的通用Python模块。

J Proteome Res. 2016 Mar 4;15(3):788-94. doi: 10.1021/acs.jproteome.5b00860. Epub 2016 Jan 13.

Hunting for unexpected post-translational modifications by spectral library searching with tier-wise scoring.通过分层评分的谱库搜索寻找意外的翻译后修饰。

J Proteome Res. 2014 May 2;13(5):2262-71. doi: 10.1021/pr401006g. Epub 2014 Apr 2.

PIPI2: Sensitive Tag-Based Database Search to Identify Peptides with Multiple Post-translational Modifications.PIPI2：基于敏感标签的数据库搜索，用于鉴定具有多种翻译后修饰的肽。

J Proteome Res. 2024 Jun 7;23(6):1960-1969. doi: 10.1021/acs.jproteome.3c00819. Epub 2024 May 21.

Comparative database search engine analysis on massive tandem mass spectra of pork-based food products for halal proteomics.基于猪肉的食品清真蛋白质组学大规模串联质谱的比较数据库搜索引擎分析

J Proteomics. 2021 Jun 15;241:104240. doi: 10.1016/j.jprot.2021.104240. Epub 2021 Apr 21.

PTMiner: Localization and Quality Control of Protein Modifications Detected in an Open Search and Its Application to Comprehensive Post-translational Modification Characterization in Human Proteome.PTMiner：在开放搜索中检测到的蛋白质修饰的定位和质量控制及其在人类蛋白质组中全面翻译后修饰特征中的应用。

Mol Cell Proteomics. 2019 Feb;18(2):391-405. doi: 10.1074/mcp.RA118.000812. Epub 2018 Nov 12.

Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing.快速开放修改谱库搜索通过近似最近邻索引。

J Proteome Res. 2018 Oct 5;17(10):3463-3474. doi: 10.1021/acs.jproteome.8b00359. Epub 2018 Sep 13.

In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。

J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.

Evaluation of proteomic search engines for the analysis of histone modifications.用于分析组蛋白修饰的蛋白质组学搜索引擎评估

J Proteome Res. 2014 Oct 3;13(10):4470-8. doi: 10.1021/pr5008015. Epub 2014 Sep 7.

Accelerating open modification spectral library searching on tensor core in high-dimensional space.在高维空间的张量核上加速开放修改谱库搜索。

Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad404.

引用本文的文献

Quorum sensing mediates morphology and motility transitions in the model archaeon .群体感应介导了模式古菌的形态和运动转变。

mBio. 2025 Jun 18:e0090625. doi: 10.1128/mbio.00906-25.

Extracting informative glycan-specific ions from glycopeptide MS/MS spectra with GlyCounter.使用GlyCounter从糖肽串联质谱（MS/MS）谱图中提取信息丰富的聚糖特异性离子。

bioRxiv. 2025 Mar 25:2025.03.24.645139. doi: 10.1101/2025.03.24.645139.

Quorum sensing mediates morphology and motility transitions in the model archaeon .群体感应介导了模式古菌中的形态和运动转变。

bioRxiv. 2025 Jan 14:2025.01.14.633064. doi: 10.1101/2025.01.14.633064.

Proteome-wide non-cleavable crosslink identification with MS Annika 3.0 reveals the structure of the C. elegans Box C/D complex.利用MS Annika 3.0进行全蛋白质组不可裂解交联鉴定揭示了秀丽隐杆线虫Box C/D复合物的结构。

Commun Chem. 2024 Dec 19;7(1):300. doi: 10.1038/s42004-024-01386-x.

Identification of structural and regulatory cell-shape determinants in Haloferax volcanii.鉴定火球菌中结构和调控细胞形状的决定因素。

Nat Commun. 2024 Feb 15;15(1):1414. doi: 10.1038/s41467-024-45196-0.

Multienzyme deep learning models improve peptide de novo sequencing by mass spectrometry proteomics.多酶深度学习模型通过质谱蛋白质组学提高肽从头测序。

PLoS Comput Biol. 2023 Jan 20;19(1):e1010457. doi: 10.1371/journal.pcbi.1010457. eCollection 2023 Jan.

Proteomic Sample Preparation and Data Analysis in Line with the Archaeal Proteome Project.与古菌蛋白质组计划一致的蛋白质组样品制备和数据分析。

Methods Mol Biol. 2022;2522:287-300. doi: 10.1007/978-1-0716-2445-6_18.

A peptidoform based proteomic strategy for studying functions of post-translational modifications.基于肽形式的蛋白质组学策略研究翻译后修饰的功能。

Proteomics. 2022 Feb;22(4):e2100316. doi: 10.1002/pmic.202100316. Epub 2021 Dec 23.

Comprehensive glycoproteomics shines new light on the complexity and extent of glycosylation in archaea.全面糖蛋白质组学为古菌糖基化的复杂性和程度带来了新的认识。

PLoS Biol. 2021 Jun 17;19(6):e3001277. doi: 10.1371/journal.pbio.3001277. eCollection 2021 Jun.

本文引用的文献

PTM-Shepherd: Analysis and Summarization of Post-Translational and Chemical Modifications From Open Search Results.PTM-Shepherd：从开放搜索结果中分析和总结翻译后修饰和化学修饰。

Mol Cell Proteomics. 2021;20:100018. doi: 10.1074/mcp.TIR120.002216. Epub 2020 Dec 11.

O-Pair Search with MetaMorpheus for O-glycopeptide characterization.利用 MetaMorpheus 进行 O-糖肽结构分析的 O-对搜索。

Nat Methods. 2020 Nov;17(11):1133-1138. doi: 10.1038/s41592-020-00985-5. Epub 2020 Oct 26.

Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco.使用 MSFragger-Glyco 进行快速全面的 N- 和 O-糖蛋白质组学分析。

Nat Methods. 2020 Nov;17(11):1125-1132. doi: 10.1038/s41592-020-0967-9. Epub 2020 Oct 5.

Identification of modified peptides using localization-aware open search.使用基于定位感知的开放式搜索鉴定修饰肽。

Nat Commun. 2020 Aug 13;11(1):4065. doi: 10.1038/s41467-020-17921-y.

Open Database Searching Enables the Identification and Comparison of Bacterial Glycoproteomes without Defining Glycan Compositions Prior to Searching.开放数据库搜索使我们能够在搜索前无需定义聚糖组成的情况下识别和比较细菌糖蛋白组。

Mol Cell Proteomics. 2020 Sep;19(9):1561-1574. doi: 10.1074/mcp.TIR120.002100. Epub 2020 Jun 23.

The Archaeal Proteome Project advances knowledge about archaeal cell biology through comprehensive proteomics.古菌蛋白质组计划通过全面的蛋白质组学研究推进古菌细胞生物学的知识。

Nat Commun. 2020 Jun 19;11(1):3145. doi: 10.1038/s41467-020-16784-7.

Crystal-C: A Computational Tool for Refinement of Open Search Results.Crystal-C：一种用于优化开放搜索结果的计算工具。

J Proteome Res. 2020 Jun 5;19(6):2511-2515. doi: 10.1021/acs.jproteome.0c00119. Epub 2020 May 8.

Proteomic and interactomic insights into the molecular basis of cell functional diversity.蛋白质组学和相互作用组学揭示细胞功能多样性的分子基础。

Nat Rev Mol Cell Biol. 2020 Jun;21(6):327-340. doi: 10.1038/s41580-020-0231-2. Epub 2020 Mar 31.

TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets.TagGraph 揭示了来自大型串联质谱数据集的广泛蛋白质修饰图谱。

Nat Biotechnol. 2019 Apr;37(4):469-479. doi: 10.1038/s41587-019-0067-5. Epub 2019 Apr 1.

Glycosylation in health and disease.糖基化在健康和疾病中的作用。

Nat Rev Nephrol. 2019 Jun;15(6):346-366. doi: 10.1038/s41581-019-0129-4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验