利用宏蛋白质组学揭示土壤微生物组的隐藏成员和功能。

Uncovering Hidden Members and Functions of the Soil Microbiome Using Metaproteomics.

机构信息

Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States.

Signature Sciences and Technology Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States.

出版信息

J Proteome Res. 2022 Aug 5;21(8):2023-2035. doi: 10.1021/acs.jproteome.2c00334. Epub 2022 Jul 6.

DOI:10.1021/acs.jproteome.2c00334

PMID:35793793

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9361346/

Abstract

Metaproteomics has been increasingly utilized for high-throughput characterization of proteins in complex environments and has been demonstrated to provide insights into microbial composition and functional roles. However, significant challenges remain in metaproteomic data analysis, including creation of a sample-specific protein sequence database. A well-matched database is a requirement for successful metaproteomics analysis, and the accuracy and sensitivity of PSM identification algorithms suffer when the database is incomplete or contains extraneous sequences. When matched DNA sequencing data of the sample is unavailable or incomplete, creating the proteome database that accurately represents the organisms in the sample is a challenge. Here, we leverage a peptide sequencing approach to identify the sample composition directly from metaproteomic data. First, we created a deep learning model, Kaiko, to predict the peptide sequences from mass spectrometry data and trained it on 5 million peptide-spectrum matches from 55 phylogenetically diverse bacteria. After training, Kaiko successfully identified organisms from soil isolates and synthetic communities directly from proteomics data. Finally, we created a pipeline for metaproteome database generation using Kaiko. We tested the pipeline on native soils collected in Kansas, showing that the sequencing model can be employed as an alternative and complementary method to construct the sample-specific protein database instead of relying on (un)matched metagenomes. Our pipeline identified all highly abundant taxa from 16S rRNA sequencing of the soil samples and uncovered several additional species which were strongly represented only in proteomic data.

摘要

蛋白质组学已经越来越多地被用于高通量分析复杂环境中的蛋白质，并已被证明可以深入了解微生物的组成和功能角色。然而，蛋白质组学数据分析仍然存在重大挑战，包括创建特定于样本的蛋白质序列数据库。一个匹配良好的数据库是成功进行蛋白质组学分析的前提，如果数据库不完整或包含无关序列，那么肽段匹配（PSM）鉴定算法的准确性和灵敏度就会受到影响。当样本的匹配 DNA 测序数据不可用或不完整时，创建准确代表样本中生物体的蛋白质组数据库是一个挑战。在这里，我们利用肽段测序方法直接从蛋白质组学数据中识别样本组成。首先，我们创建了一个深度学习模型 Kaiko，用于从质谱数据中预测肽段序列，并在来自 55 种系统发育多样化细菌的 500 万个肽段-谱匹配数据上对其进行了训练。在训练后，Kaiko 成功地直接从蛋白质组学数据中识别出土壤分离物和人工合成群落中的生物体。最后，我们使用 Kaiko 为蛋白质组数据库生成创建了一个工作流程。我们在堪萨斯州采集的天然土壤上对该工作流程进行了测试，结果表明，该测序模型可以作为构建特定于样本的蛋白质数据库的替代和补充方法，而无需依赖（不）匹配的宏基因组。我们的工作流程鉴定了土壤样本 16S rRNA 测序中所有高度丰富的分类群，并发现了仅在蛋白质组数据中强烈代表的几个其他物种。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0e7/9361346/4691afa926db/pr2c00334_0001.jpg

相似文献

Uncovering Hidden Members and Functions of the Soil Microbiome Using Metaproteomics.利用宏蛋白质组学揭示土壤微生物组的隐藏成员和功能。

J Proteome Res. 2022 Aug 5;21(8):2023-2035. doi: 10.1021/acs.jproteome.2c00334. Epub 2022 Jul 6.

MetaNovo: An open-source pipeline for probabilistic peptide discovery in complex metaproteomic datasets.MetaNovo：用于复杂宏蛋白质组学数据中概率肽发现的开源管道。

PLoS Comput Biol. 2023 Jun 16;19(6):e1011163. doi: 10.1371/journal.pcbi.1011163. eCollection 2023 Jun.

Optimizing metaproteomics database construction: lessons from a study of the vaginal microbiome.优化宏蛋白质组学数据库构建：来自阴道微生物组研究的经验教训。

mSystems. 2023 Aug 31;8(4):e0067822. doi: 10.1128/msystems.00678-22. Epub 2023 Jun 23.

Database-independent de novo metaproteomics of complex microbial communities.数据库独立的复杂微生物群落从头宏蛋白质组学分析。

Cell Syst. 2021 May 19;12(5):375-383.e5. doi: 10.1016/j.cels.2021.04.003. Epub 2021 Apr 30.

Navigating through metaproteomics data: a logbook of database searching.解读宏蛋白质组学数据：数据库搜索日志

Proteomics. 2015 Oct;15(20):3439-53. doi: 10.1002/pmic.201400560. Epub 2015 Apr 27.

[Microbial metaproteomics--From sample processing to data acquisition and analysis].[微生物元蛋白质组学——从样品处理到数据采集与分析]

Se Pu. 2024 Jul;42(7):658-668. doi: 10.3724/SP.J.1123.2024.02009.

A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics.一种以图形为中心的宏蛋白质组学中宏基因组引导的肽和蛋白质鉴定方法。

PLoS Comput Biol. 2016 Dec 5;12(12):e1005224. doi: 10.1371/journal.pcbi.1005224. eCollection 2016 Dec.

A community-supported metaproteomic pipeline for improving peptide identifications in hydrothermal vent microbiota.一种社区支持的宏蛋白质组学管道，用于提高热液喷口微生物群中的肽鉴定。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab052.

Increasing the power of interpretation for soil metaproteomics data.提高土壤宏蛋白质组学数据的解读能力。

Microbiome. 2021 Sep 29;9(1):195. doi: 10.1186/s40168-021-01139-1.

Using high-abundance proteins as guides for fast and effective peptide/protein identification from human gut metaproteomic data.利用高丰度蛋白质作为人类肠道宏蛋白质组学数据中快速有效肽/蛋白质鉴定的向导。

Microbiome. 2021 Apr 1;9(1):80. doi: 10.1186/s40168-021-01035-8.

引用本文的文献

De novo peptide databases enable protein-based stable isotope probing of microbial communities with up to species-level resolution.从头合成肽数据库能够对微生物群落进行基于蛋白质的稳定同位素探测，分辨率可达物种水平。

Environ Microbiome. 2025 Aug 26;20(1):111. doi: 10.1186/s40793-025-00767-6.

Pairwise Attention: Leveraging Mass Differences to Enhance De Novo Sequencing of Mass Spectra.成对注意力机制：利用质量差异增强质谱的从头测序

J Proteome Res. 2025 Jul 4;24(7):3722-3730. doi: 10.1021/acs.jproteome.5c00063. Epub 2025 Jun 2.

NovoLign: metaproteomics by sequence alignment.NovoLign：通过序列比对进行宏蛋白质组学分析。

ISME Commun. 2024 Oct 12;4(1):ycae121. doi: 10.1093/ismeco/ycae121. eCollection 2024 Jan.

Soil microbiome characterization and its future directions with biosensing.土壤微生物群落特征及其生物传感的未来发展方向。

J Biol Eng. 2024 Sep 10;18(1):50. doi: 10.1186/s13036-024-00444-1.

[Microbial metaproteomics--From sample processing to data acquisition and analysis].[微生物元蛋白质组学——从样品处理到数据采集与分析]

Se Pu. 2024 Jul;42(7):658-668. doi: 10.3724/SP.J.1123.2024.02009.

Mapping microhabitats of lignocellulose decomposition by a microbial consortium.通过微生物群落对木质纤维素分解的微生境进行映射。

Nat Chem Biol. 2024 Aug;20(8):1033-1043. doi: 10.1038/s41589-023-01536-7. Epub 2024 Feb 1.

Mix24X, a Lab-Assembled Reference to Evaluate Interpretation Procedures for Tandem Mass Spectrometry Proteotyping of Complex Samples.Mix24X，一种用于评估复杂样本串联质谱蛋白质组学分析解释程序的实验室组装参考品。

Int J Mol Sci. 2023 May 11;24(10):8634. doi: 10.3390/ijms24108634.

Interrogating the role of the milk microbiome in mastitis in the multi-omics era.在多组学时代探究乳汁微生物群在乳腺炎中的作用。

Front Microbiol. 2023 Feb 2;14:1105675. doi: 10.3389/fmicb.2023.1105675. eCollection 2023.

Current progress and critical challenges to overcome in the bioinformatics of mass spectrometry-based metaproteomics.基于质谱的宏蛋白质组学的生物信息学当前进展及需克服的关键挑战。

Comput Struct Biotechnol J. 2023 Jan 16;21:1140-1150. doi: 10.1016/j.csbj.2023.01.015. eCollection 2023.

本文引用的文献

The Metaproteomics Initiative: a coordinated approach for propelling the functional characterization of microbiomes.宏蛋白质组学倡议：推进微生物组功能特征描述的协调方法。

Microbiome. 2021 Dec 20;9(1):243. doi: 10.1186/s40168-021-01176-w.

Increasing the power of interpretation for soil metaproteomics data.提高土壤宏蛋白质组学数据的解读能力。

Microbiome. 2021 Sep 29;9(1):195. doi: 10.1186/s40168-021-01139-1.

UniProt: the universal protein knowledgebase in 2021.UniProt：2021 年的通用蛋白质知识库。

Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.

Integrated network modeling approach defines key metabolic responses of soil microbiomes to perturbations.综合网络建模方法定义了土壤微生物组对干扰的关键代谢反应。

Sci Rep. 2020 Jul 2;10(1):10882. doi: 10.1038/s41598-020-67878-7.

Soil Property and Plant Diversity Determine Bacterial Turnover and Network Interactions in a Typical Arid Inland River Basin, Northwest China.土壤性质与植物多样性决定中国西北典型干旱内陆河流域细菌周转及网络相互作用

Front Microbiol. 2019 Nov 26;10:2655. doi: 10.3389/fmicb.2019.02655. eCollection 2019.

The Simplified Human Intestinal Microbiota (SIHUMIx) Shows High Structural and Functional Resistance against Changing Transit Times in Bioreactors.简化的人类肠道微生物群（SIHUMIx）在生物反应器中对变化的通过时间表现出高度的结构和功能抗性。

Microorganisms. 2019 Dec 3;7(12):641. doi: 10.3390/microorganisms7120641.

Impact of Host DNA and Sequencing Depth on the Taxonomic Resolution of Whole Metagenome Sequencing for Microbiome Analysis.宿主DNA和测序深度对微生物组分析全宏基因组测序分类分辨率的影响。

Front Microbiol. 2019 Jun 12;10:1277. doi: 10.3389/fmicb.2019.01277. eCollection 2019.

Metaphenomic Responses of a Native Prairie Soil Microbiome to Moisture Perturbations.原生草原土壤微生物群落对水分扰动的元表型反应

mSystems. 2019 Jun 11;4(4):e00061-19. doi: 10.1128/mSystems.00061-19.

Perspective and Guidelines for Metaproteomics in Microbiome Studies.宏蛋白质组学在微生物组研究中的观点和指南。

J Proteome Res. 2019 Jun 7;18(6):2370-2380. doi: 10.1021/acs.jproteome.9b00054. Epub 2019 Apr 26.

Community Ecology of Deinococcus in Irradiated Soil.土壤中耐辐射球菌的群落生态学。

Microb Ecol. 2019 Nov;78(4):855-872. doi: 10.1007/s00248-019-01343-5. Epub 2019 Apr 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用宏蛋白质组学揭示土壤微生物组的隐藏成员和功能。

Uncovering Hidden Members and Functions of the Soil Microbiome Using Metaproteomics.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献