使用真实微生物组数据集对宏蛋白质组学的光谱库和数据库搜索方法进行基准测试。

Benchmarking Spectral Library and Database Search Approaches for Metaproteomics Using a Ground-Truth Microbiome Dataset.

作者信息

Rajczewski Andrew T, Mehta Subina, Wagner Reid, Gabriel Wassim, Johnson James, Do Katherine, Vintila Simina, Wilhelm Mathias, Kleiner Manuel, Searle Brian C, Griffin Timothy J, Jagtap Pratik D

机构信息

University of Minnesota, Minneapolis, MN.

Computational Mass Spectrometry, Technical University of Munich, Freising, Germany.

出版信息

bioRxiv. 2025 May 20:2025.05.15.654320. doi: 10.1101/2025.05.15.654320.

DOI:10.1101/2025.05.15.654320

PMID:40475569

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12139738/

Abstract

Mass spectrometry-based metaproteomics, the identification and quantification of thousands of proteins expressed by complex microbial communities, has become pivotal for unraveling functional interactions within microbiomes. However, metaproteomics data analysis encounters many challenges, including the search of tandem mass spectra against a protein sequence database using proteomics database search algorithms. We used a ground-truth dataset to assess a spectral library searching method against established database searching approaches. Mass spectrometry data collected by data-dependent acquisition (DDA-MS) was analyzed using database searching approaches (MaxQuant and FragPipe), as well as using Scribe with Prosit predicted spectral libraries. We used FASTA databases that included protein sequences from microbial species present in the ground-truth dataset along with background protein sequences, to estimate error rates and assess the effects on detection, peptide-spectral match quality, and quantification. Using the Scribe search engine resulted in more proteins detected at a 1% false discovery rate (FDR) compared to MaxQuant or FragPipe, while FragPipe detected more peptides verified by PepQuery. Scribe was able to detect more low-abundance proteins in the microbiome dataset and was more accurate in quantifying the microbial community composition. This research provides insights and guidance for metaproteomics researchers aiming to optimize results in their analysis of DDA-MS data.

摘要

基于质谱的宏蛋白质组学，即对复杂微生物群落表达的数千种蛋白质进行鉴定和定量，已成为揭示微生物组内功能相互作用的关键。然而，宏蛋白质组学数据分析面临许多挑战，包括使用蛋白质组学数据库搜索算法在蛋白质序列数据库中搜索串联质谱。我们使用了一个真实数据集，以评估一种光谱库搜索方法与既定的数据库搜索方法。通过数据依赖采集（DDA-MS）收集的质谱数据使用数据库搜索方法（MaxQuant和FragPipe）进行分析，以及使用带有Prosit预测光谱库的Scribe进行分析。我们使用了FASTA数据库，其中包括真实数据集中存在的微生物物种的蛋白质序列以及背景蛋白质序列，以估计错误率并评估对检测、肽-光谱匹配质量和定量的影响。与MaxQuant或FragPipe相比，使用Scribe搜索引擎在1%的错误发现率（FDR）下检测到更多蛋白质，而FragPipe检测到更多经PepQuery验证的肽段。Scribe能够在微生物组数据集中检测到更多低丰度蛋白质，并且在定量微生物群落组成方面更准确。这项研究为旨在优化DDA-MS数据分析结果的宏蛋白质组学研究人员提供了见解和指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d79/12139738/475fdee9a2be/nihpp-2025.05.15.654320v1-f0001.jpg

相似文献

Benchmarking Spectral Library and Database Search Approaches for Metaproteomics Using a Ground-Truth Microbiome Dataset.使用真实微生物组数据集对宏蛋白质组学的光谱库和数据库搜索方法进行基准测试。

bioRxiv. 2025 May 20:2025.05.15.654320. doi: 10.1101/2025.05.15.654320.

Scribe: Next Generation Library Searching for DDA Experiments.记录员：下一代库搜索 DDA 实验。

J Proteome Res. 2023 Feb 3;22(2):482-490. doi: 10.1021/acs.jproteome.2c00672. Epub 2023 Jan 25.

MetaPep: A core peptide database for faster human gut metaproteomics database searches.MetaPep：用于更快地搜索人类肠道宏蛋白质组学数据库的核心肽段数据库。

Comput Struct Biotechnol J. 2023 Aug 29;21:4228-4237. doi: 10.1016/j.csbj.2023.08.025. eCollection 2023.

metaSpectraST: an unsupervised and database-independent analysis workflow for metaproteomic MS/MS data using spectrum clustering.metaSpectraST：一种使用谱聚类的无监督且与数据库无关的代谢组学 MS/MS 数据分析工作流程。

Microbiome. 2023 Aug 7;11(1):176. doi: 10.1186/s40168-023-01602-1.

Data-Independent Acquisition Mass Spectrometry as a Tool for Metaproteomics: Interlaboratory Comparison Using a Model Microbiome.数据非依赖型采集质谱技术作为宏蛋白质组学的工具：使用模型微生物群落进行实验室间比较

bioRxiv. 2025 Jan 2:2024.09.18.613707. doi: 10.1101/2024.09.18.613707.

Comparative database search engine analysis on massive tandem mass spectra of pork-based food products for halal proteomics.基于猪肉的食品清真蛋白质组学大规模串联质谱的比较数据库搜索引擎分析

J Proteomics. 2021 Jun 15;241:104240. doi: 10.1016/j.jprot.2021.104240. Epub 2021 Apr 21.

Mistle: bringing spectral library predictions to metaproteomics with an efficient search index.Mistle：利用高效搜索索引将光谱库预测引入宏蛋白质组学。

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad376.

Proteomics. 2025 May;25(9-10):e202400187. doi: 10.1002/pmic.202400187. Epub 2025 Apr 10.

Increasing taxonomic and functional characterization of host-microbiome interactions by DIA-PASEF metaproteomics.通过数据独立采集-并行累积连续碎裂（DIA-PASEF）宏蛋白质组学增强宿主-微生物组相互作用的分类学和功能表征。

Front Microbiol. 2023 Oct 16;14:1258703. doi: 10.3389/fmicb.2023.1258703. eCollection 2023.

A comprehensive and scalable database search system for metaproteomics.一种用于宏蛋白质组学的全面且可扩展的数据库搜索系统。

BMC Genomics. 2016 Aug 16;17(1):642. doi: 10.1186/s12864-016-2855-3.

本文引用的文献

The microbiologist's guide to metaproteomics.微生物学家的宏蛋白质组学指南。

Imeta. 2025 May 6;4(3):e70031. doi: 10.1002/imt2.70031. eCollection 2025 Jun.

Proteomics. 2025 May;25(9-10):e202400187. doi: 10.1002/pmic.202400187. Epub 2025 Apr 10.

Clinical Microbiome Analysis by Mass Spectrometry-Based Metaproteomics.基于质谱的宏蛋白质组学进行临床微生物组分析

Annu Rev Anal Chem (Palo Alto Calif). 2025 May;18(1):149-172. doi: 10.1146/annurev-anchem-071124-113819. Epub 2025 Jan 15.

Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit.慕尼黑啤酒节：基于 Prosit 的开源光谱库生成和重评分管道。

Proteomics. 2024 Apr;24(8):e2300112. doi: 10.1002/pmic.202300112. Epub 2023 Sep 6.

MetaNovo: An open-source pipeline for probabilistic peptide discovery in complex metaproteomic datasets.MetaNovo：用于复杂宏蛋白质组学数据中概率肽发现的开源管道。

PLoS Comput Biol. 2023 Jun 16;19(6):e1011163. doi: 10.1371/journal.pcbi.1011163. eCollection 2023 Jun.

Mistle: bringing spectral library predictions to metaproteomics with an efficient search index.Mistle：利用高效搜索索引将光谱库预测引入宏蛋白质组学。

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad376.

PepQuery2 democratizes public MS proteomics data for rapid peptide searching. PepQuery2 使公共 MS 蛋白质组学数据民主化，便于快速进行肽搜索。

Nat Commun. 2023 Apr 18;14(1):2213. doi: 10.1038/s41467-023-37462-4.

Scribe: Next Generation Library Searching for DDA Experiments.记录员：下一代库搜索 DDA 实验。

J Proteome Res. 2023 Feb 3;22(2):482-490. doi: 10.1021/acs.jproteome.2c00672. Epub 2023 Jan 25.

Data-independent acquisition boosts quantitative metaproteomics for deep characterization of gut microbiota.数据非依赖采集提高定量宏蛋白质组学深度分析肠道微生物组的能力。

NPJ Biofilms Microbiomes. 2023 Jan 24;9(1):4. doi: 10.1038/s41522-023-00373-9.

Gut microbiome dysregulation drives bone damage in broiler tibial dyschondroplasia by disrupting glucose homeostasis.肠道微生物组失调通过破坏葡萄糖内环境稳态导致肉鸡胫骨软骨发育不良的骨损伤。

NPJ Biofilms Microbiomes. 2023 Jan 3;9(1):1. doi: 10.1038/s41522-022-00360-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用真实微生物组数据集对宏蛋白质组学的光谱库和数据库搜索方法进行基准测试。

Benchmarking Spectral Library and Database Search Approaches for Metaproteomics Using a Ground-Truth Microbiome Dataset.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献