• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于宏蛋白质组学中蛋白质数据库构建和高分辨率分类注释的人工培育基因组参考。

Cultivated genome references for protein database construction and high-resolution taxonomic annotation in metaproteomics.

作者信息

Gao Xiaowei, Liang Hewei, Hu Tongyuan, Zou Yuanqiang, Xiao Liang

机构信息

BGI Research, Shenzhen, China.

BGI Research, Wuhan, China.

出版信息

Microbiol Spectr. 2025 Feb 4;13(2):e0175524. doi: 10.1128/spectrum.01755-24. Epub 2024 Dec 12.

DOI:10.1128/spectrum.01755-24
PMID:39665565
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11792528/
Abstract

Metaproteomics offers a profound understanding of the functional dynamics of the gut microbiome, which is crucial for personalized healthcare strategies. The selection of an appropriate database is a critical step for the identification of peptides and proteins, as well as for the provision of accurate taxonomic and functional annotations. The matched metagenomic-derived database is considered to be the best, but its limitations include the identification of low-abundance organisms and taxonomic resolution. Herein, we constructed a protein database (DBCGR2) based on Cultivated Genome Reference 2 (CGR2) and developed a complete peptide-centric analysis workflow for database searching and for the annotation of taxonomy and function. This workflow was subsequently appraised in comparison with metagenomics-derived databases for the analysis of metaproteomic data. Our findings suggested that the performance of DBCGR2 in identification was comparable with metagenomics-derived databases with improvement in identification rates of peptides from low-abundance species. The database searching results could be fully annotated using the pepTaxa taxonomic annotation approach developed in this study, and the taxonomic resolution was enhanced to strain level. Additionally, the results demonstrated that the sensitivity of functional annotation could be enhanced by employing DBCGR2. Overall, the DBCGR2 combined with pepTaxa can be considered an alternative for metaproteomic data analysis with superior analysis performances.IMPORTANCEMass spectrometry-based metaproteomics offers a profound understanding of the gut microbial taxonomy and functionality. The databases utilized in the analysis of metaproteomic data are crucial, as they determine the identification of proteins that can be recognized and linked to overall human health, in addition to the quality of taxonomic and functional annotation. Among the most effective approaches for constructing protein databases is the utilization of metagenomic sequencing to create matched databases. However, the database, derived from isolated genomes, has yet to undergo rigorous testing for their efficacy and accuracy in protein identification and taxonomic and functional annotation. Here, we constructed a protein database DBCGR2 derived from Cultivated Genome Reference 2 (CGR2) and a complete workflow for data analysis. We compared the performances of DBCGR2 and metagenomics-derived databases. Our results indicated that DBCGR2 can be regarded as an alternative to metagenomics-derived databases, which contribute to metaproteomic data analysis.

摘要

宏蛋白质组学有助于深入了解肠道微生物群的功能动态,这对个性化医疗保健策略至关重要。选择合适的数据库是鉴定肽和蛋白质以及提供准确的分类学和功能注释的关键步骤。匹配的宏基因组衍生数据库被认为是最佳选择,但其局限性包括低丰度生物的鉴定和分类分辨率。在此,我们基于培养基因组参考2(CGR2)构建了一个蛋白质数据库(DBCGR2),并开发了一个完整的以肽为中心的分析工作流程,用于数据库搜索以及分类学和功能注释。随后,将该工作流程与宏基因组衍生数据库进行比较,以评估宏蛋白质组学数据。我们的研究结果表明,DBCGR2在鉴定方面的性能与宏基因组衍生数据库相当,且低丰度物种肽的鉴定率有所提高。使用本研究中开发的pepTaxa分类注释方法可以对数据库搜索结果进行全面注释,分类分辨率提高到菌株水平。此外,结果表明使用DBCGR2可以提高功能注释的灵敏度。总体而言,DBCGR2与pepTaxa相结合可被视为宏蛋白质组学数据分析的替代方案,具有卓越的分析性能。

重要性

基于质谱的宏蛋白质组学有助于深入了解肠道微生物分类学和功能。用于宏蛋白质组学数据分析的数据库至关重要,因为它们不仅决定了可识别的蛋白质鉴定以及与人类整体健康的关联,还决定了分类学和功能注释的质量。构建蛋白质数据库最有效的方法之一是利用宏基因组测序创建匹配的数据库。然而,源自分离基因组的数据库在蛋白质鉴定以及分类学和功能注释方面的有效性和准确性尚未经过严格测试。在此,我们构建了一个源自培养基因组参考2(CGR2)的蛋白质数据库DBCGR2以及一个完整的数据分析工作流程。我们比较了DBCGR2和宏基因组衍生数据库的性能。我们的结果表明,DBCGR2可被视为宏基因组衍生数据库在宏蛋白质组学数据分析方面的替代方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/844f22375300/spectrum.01755-24.f007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/720a1eff72ac/spectrum.01755-24.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/27f687830ff5/spectrum.01755-24.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/5bdfd6cb7ccc/spectrum.01755-24.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/28e6ce5f413b/spectrum.01755-24.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/dc6a9b523280/spectrum.01755-24.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/c59b2b7c2529/spectrum.01755-24.f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/844f22375300/spectrum.01755-24.f007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/720a1eff72ac/spectrum.01755-24.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/27f687830ff5/spectrum.01755-24.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/5bdfd6cb7ccc/spectrum.01755-24.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/28e6ce5f413b/spectrum.01755-24.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/dc6a9b523280/spectrum.01755-24.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/c59b2b7c2529/spectrum.01755-24.f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdb7/11792528/844f22375300/spectrum.01755-24.f007.jpg

相似文献

1
Cultivated genome references for protein database construction and high-resolution taxonomic annotation in metaproteomics.用于宏蛋白质组学中蛋白质数据库构建和高分辨率分类注释的人工培育基因组参考。
Microbiol Spectr. 2025 Feb 4;13(2):e0175524. doi: 10.1128/spectrum.01755-24. Epub 2024 Dec 12.
2
Contigs directed gene annotation (ConDiGA) for accurate protein sequence database construction in metaproteomics.宏基因组学中用于准确蛋白质序列数据库构建的 Contigs 定向基因注释(ConDiGA)。
Microbiome. 2024 Mar 19;12(1):58. doi: 10.1186/s40168-024-01775-3.
3
Using high-abundance proteins as guides for fast and effective peptide/protein identification from human gut metaproteomic data.利用高丰度蛋白质作为人类肠道宏蛋白质组学数据中快速有效肽/蛋白质鉴定的向导。
Microbiome. 2021 Apr 1;9(1):80. doi: 10.1186/s40168-021-01035-8.
4
Optimizing metaproteomics database construction: lessons from a study of the vaginal microbiome.优化宏蛋白质组学数据库构建:来自阴道微生物组研究的经验教训。
mSystems. 2023 Aug 31;8(4):e0067822. doi: 10.1128/msystems.00678-22. Epub 2023 Jun 23.
5
Database selection for shotgun metaproteomic of low-diversity dairy microbiomes.用于低多样性乳微生物组鸟枪法宏蛋白质组学的数据库选择。
Int J Food Microbiol. 2024 Jun 16;418:110706. doi: 10.1016/j.ijfoodmicro.2024.110706. Epub 2024 Apr 15.
6
MetaPro-IQ: a universal metaproteomic approach to studying human and mouse gut microbiota.MetaPro-IQ:一种用于研究人类和小鼠肠道微生物组的通用宏蛋白质组学方法。
Microbiome. 2016 Jun 24;4(1):31. doi: 10.1186/s40168-016-0176-z.
7
MetaLab: an automated pipeline for metaproteomic data analysis.MetaLab:一个用于代谢组学数据分析的自动化流程。
Microbiome. 2017 Dec 2;5(1):157. doi: 10.1186/s40168-017-0375-2.
8
Metagenomic Taxonomy-Guided Database-Searching Strategy for Improving Metaproteomic Analysis.基于宏基因组分类学的数据库检索策略可提高宏蛋白质组分析的效果。
J Proteome Res. 2018 Apr 6;17(4):1596-1605. doi: 10.1021/acs.jproteome.7b00894. Epub 2018 Feb 26.
9
[Microbial metaproteomics--From sample processing to data acquisition and analysis].[微生物元蛋白质组学——从样品处理到数据采集与分析]
Se Pu. 2024 Jul;42(7):658-668. doi: 10.3724/SP.J.1123.2024.02009.
10
A novel clinical metaproteomics workflow enables bioinformatic analysis of host-microbe dynamics in disease.一种新颖的临床代谢组学工作流程能够实现疾病中宿主-微生物动态的生物信息学分析。
mSphere. 2024 Jun 25;9(6):e0079323. doi: 10.1128/msphere.00793-23. Epub 2024 May 23.

本文引用的文献

1
Database selection for shotgun metaproteomic of low-diversity dairy microbiomes.用于低多样性乳微生物组鸟枪法宏蛋白质组学的数据库选择。
Int J Food Microbiol. 2024 Jun 16;418:110706. doi: 10.1016/j.ijfoodmicro.2024.110706. Epub 2024 Apr 15.
2
Decoupling between the genetic potential and the metabolic regulation and expression in microbial organic matter cleavage across microbiomes.微生物组间微生物有机质分解的遗传潜力与代谢调控和表达的解耦。
Microbiol Spectr. 2024 May 2;12(5):e0303623. doi: 10.1128/spectrum.03036-23. Epub 2024 Mar 21.
3
Unipept Desktop 2.0: Construction of Targeted Reference Protein Databases for Metaproteogenomics Analyses.
Unipept Desktop 2.0:用于代谢蛋白质组学分析的靶向参考蛋白质数据库的构建。
J Proteome Res. 2023 Aug 4;22(8):2620-2628. doi: 10.1021/acs.jproteome.3c00091. Epub 2023 Jul 17.
4
Advances in the clinical use of metaproteomics.代谢蛋白质组学临床应用的进展。
Expert Rev Proteomics. 2023 Apr-Jun;20(4-6):71-86. doi: 10.1080/14789450.2023.2215440. Epub 2023 May 30.
5
Positive mood-related gut microbiota in a long-term closed environment: a multiomics study based on the "Lunar Palace 365" experiment.长期密闭环境下积极情绪相关的肠道微生物组:基于“月宫 365”实验的多组学研究。
Microbiome. 2023 Apr 24;11(1):88. doi: 10.1186/s40168-023-01506-0.
6
The genomic landscape of reference genomes of cultivated human gut bacteria.人类肠道共生菌培养参考基因组的基因组景观。
Nat Commun. 2023 Mar 25;14(1):1663. doi: 10.1038/s41467-023-37396-x.
7
Data-independent acquisition boosts quantitative metaproteomics for deep characterization of gut microbiota.数据非依赖采集提高定量宏蛋白质组学深度分析肠道微生物组的能力。
NPJ Biofilms Microbiomes. 2023 Jan 24;9(1):4. doi: 10.1038/s41522-023-00373-9.
8
MetaProD: A Highly-Configurable Mass Spectrometry Analyzer for Multiplexed Proteomic and Metaproteomic Data.MetaProD:用于多重蛋白质组学和代谢蛋白质组学数据的高度可配置质谱分析仪。
J Proteome Res. 2023 Feb 3;22(2):442-453. doi: 10.1021/acs.jproteome.2c00614. Epub 2023 Jan 23.
9
MetaLab-MAG: A Metaproteomic Data Analysis Platform for Genome-Level Characterization of Microbiomes from the Metagenome-Assembled Genomes Database.MetaLab-MAG:一个元蛋白质组数据分析平台,用于从宏基因组组装基因组数据库中对微生物组进行基因组水平的特征描述。
J Proteome Res. 2023 Feb 3;22(2):387-398. doi: 10.1021/acs.jproteome.2c00554. Epub 2022 Dec 12.
10
The ProteomeXchange consortium at 10 years: 2023 update.蛋白质组交换联盟成立十周年:2023 年更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1539-D1548. doi: 10.1093/nar/gkac1040.