• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Sipros Ensemble 可改善复杂宏蛋白质组学的数据库搜索和筛选。

Sipros Ensemble improves database searching and filtering for complex metaproteomics.

机构信息

Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37996, USA.

Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA.

出版信息

Bioinformatics. 2018 Mar 1;34(5):795-802. doi: 10.1093/bioinformatics/btx601.

DOI:10.1093/bioinformatics/btx601
PMID:29028897
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6192206/
Abstract

MOTIVATION

Complex microbial communities can be characterized by metagenomics and metaproteomics. However, metagenome assemblies often generate enormous, and yet incomplete, protein databases, which undermines the identification of peptides and proteins in metaproteomics. This challenge calls for increased discrimination of true identifications from false identifications by database searching and filtering algorithms in metaproteomics.

RESULTS

Sipros Ensemble was developed here for metaproteomics using an ensemble approach. Three diverse scoring functions from MyriMatch, Comet and the original Sipros were incorporated within a single database searching engine. Supervised classification with logistic regression was used to filter database searching results. Benchmarking with soil and marine microbial communities demonstrated a higher number of peptide and protein identifications by Sipros Ensemble than MyriMatch/Percolator, Comet/Percolator, MS-GF+/Percolator, Comet & MyriMatch/iProphet and Comet & MyriMatch & MS-GF+/iProphet. Sipros Ensemble was computationally efficient and scalable on supercomputers.

AVAILABILITY AND IMPLEMENTATION

Freely available under the GNU GPL license at http://sipros.omicsbio.org.

CONTACT

cpan@utk.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

复杂的微生物群落可以通过宏基因组学和宏蛋白质组学来描述。然而,宏基因组组装通常会生成庞大但又不完整的蛋白质数据库,这会影响到宏蛋白质组学中肽和蛋白质的鉴定。这一挑战需要通过数据库搜索和过滤算法来提高宏蛋白质组学中真实鉴定与假鉴定的区分度。

结果

本文开发了 Sipros Ensemble,用于宏蛋白质组学研究,采用集成方法。三个不同的评分函数分别来自于 MyriMatch、Comet 和原始 Sipros,整合到单个数据库搜索引擎中。使用逻辑回归进行监督分类来过滤数据库搜索结果。在土壤和海洋微生物群落中的基准测试表明,Sipros Ensemble 比 MyriMatch/Percolator、Comet/Percolator、MS-GF+/Percolator、Comet & MyriMatch/iProphet 和 Comet & MyriMatch & MS-GF+/iProphet 鉴定出更多的肽和蛋白质。Sipros Ensemble 在超级计算机上具有高效的计算能力和可扩展性。

可用性和实现

可在 http://sipros.omicsbio.org 上根据 GNU GPL 许可证免费获取。

联系人

cpan@utk.edu。

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

1
Sipros Ensemble improves database searching and filtering for complex metaproteomics.Sipros Ensemble 可改善复杂宏蛋白质组学的数据库搜索和筛选。
Bioinformatics. 2018 Mar 1;34(5):795-802. doi: 10.1093/bioinformatics/btx601.
2
Sipros/ProRata: a versatile informatics system for quantitative community proteomics.Sipros/ProRata:一个用于定量群落蛋白质组学的多功能信息学系统。
Bioinformatics. 2013 Aug 15;29(16):2064-5. doi: 10.1093/bioinformatics/btt329. Epub 2013 Jun 21.
3
MetaLP: An integrative linear programming method for protein inference in metaproteomics.MetaLP:一种整合线性规划方法,用于宏蛋白质组学中的蛋白质推断。
PLoS Comput Biol. 2022 Oct 21;18(10):e1010603. doi: 10.1371/journal.pcbi.1010603. eCollection 2022 Oct.
4
Deep learning for peptide identification from metaproteomics datasets.基于深度学习的宏蛋白质组学数据肽段鉴定。
J Proteomics. 2021 Sep 15;247:104316. doi: 10.1016/j.jprot.2021.104316. Epub 2021 Jul 8.
5
Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.优化搜索引擎和后处理方法以最大化高分辨率质谱数据的肽段和蛋白质鉴定
J Proteome Res. 2015 Nov 6;14(11):4662-73. doi: 10.1021/acs.jproteome.5b00536. Epub 2015 Sep 30.
6
Exhaustive database searching for amino acid mutations in proteomes.对蛋白质组中的氨基酸突变进行全面的数据库搜索。
Bioinformatics. 2012 Jul 15;28(14):1895-901. doi: 10.1093/bioinformatics/bts274. Epub 2012 May 10.
7
Sensitive and Specific Spectral Library Searching with CompOmics Spectral Library Searching Tool and Percolator.使用 CompOmics 光谱库检索工具和 percolator 进行敏感和特异的光谱库检索。
J Proteome Res. 2022 May 6;21(5):1365-1370. doi: 10.1021/acs.jproteome.2c00075. Epub 2022 Apr 21.
8
FineFDR: Fine-grained Taxonomy-specific False Discovery Rates Control in Metaproteomics.FineFDR:宏蛋白质组学中细粒度分类学特异性错误发现率控制
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:287-292. doi: 10.1109/bibm55620.2022.9995401. Epub 2023 Jan 2.
9
MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis.MyriMatch:通过多变量超几何分析实现高精度串联质谱肽段鉴定
J Proteome Res. 2007 Feb;6(2):654-61. doi: 10.1021/pr0604054.
10
Proteomic stable isotope probing with an upgraded Sipros algorithm for improved identification and quantification of isotopically labeled proteins.采用升级后的 Sipros 算法进行蛋白质组学稳定同位素探测,以提高同位素标记蛋白的鉴定和定量能力。
Microbiome. 2024 Aug 8;12(1):148. doi: 10.1186/s40168-024-01866-1.

引用本文的文献

1
The microbiologist's guide to metaproteomics.微生物学家的宏蛋白质组学指南。
Imeta. 2025 May 6;4(3):e70031. doi: 10.1002/imt2.70031. eCollection 2025 Jun.
2
Absence of biofilm adhesin proteins changes surface attachment and cell strategy for Hildenborough.生物膜粘附蛋白的缺失改变了希登伯勒菌的表面附着和细胞策略。
J Bacteriol. 2025 Jan 31;207(1):e0037924. doi: 10.1128/jb.00379-24. Epub 2024 Dec 31.
3
SEMQuant: Extending Sipros-Ensemble with Match-Between-Runs for Comprehensive Quantitative Metaproteomics.SEMQuant:通过运行间匹配扩展Sipros集成方法用于全面定量宏蛋白质组学

本文引用的文献

1
Integrated proteomics and metabolomics suggests symbiotic metabolism and multimodal regulation in a fungal-endobacterial system.整合蛋白质组学和代谢组学揭示了真菌-内共生细菌系统中的共生代谢和多模式调控。
Environ Microbiol. 2017 Mar;19(3):1041-1053. doi: 10.1111/1462-2920.13605. Epub 2017 Jan 30.
2
Proteogenomic analyses indicate bacterial methylotrophy and archaeal heterotrophy are prevalent below the grass root zone.蛋白质基因组分析表明,细菌甲基营养和古菌异养在草根区以下普遍存在。
PeerJ. 2016 Nov 8;4:e2687. doi: 10.7717/peerj.2687. eCollection 2016.
3
Proteomic Stable Isotope Probing Reveals Taxonomically Distinct Patterns in Amino Acid Assimilation by Coastal Marine Bacterioplankton.
Bioinform Res Appl. 2024 Jul;14956:102-115. doi: 10.1007/978-981-97-5087-0_9. Epub 2024 Jul 12.
4
Proteomic stable isotope probing with an upgraded Sipros algorithm for improved identification and quantification of isotopically labeled proteins.采用升级后的 Sipros 算法进行蛋白质组学稳定同位素探测,以提高同位素标记蛋白的鉴定和定量能力。
Microbiome. 2024 Aug 8;12(1):148. doi: 10.1186/s40168-024-01866-1.
5
CloudProteoAnalyzer: scalable processing of big data from proteomics using cloud computing.云蛋白质组分析器:利用云计算对蛋白质组学大数据进行可扩展处理。
Bioinform Adv. 2024 Feb 23;4(1):vbae024. doi: 10.1093/bioadv/vbae024. eCollection 2024.
6
A bibliometric analysis of the global impact of metaproteomics research.宏蛋白质组学研究全球影响力的文献计量分析
Front Microbiol. 2023 Jul 5;14:1217727. doi: 10.3389/fmicb.2023.1217727. eCollection 2023.
7
FineFDR: Fine-grained Taxonomy-specific False Discovery Rates Control in Metaproteomics.FineFDR:宏蛋白质组学中细粒度分类学特异性错误发现率控制
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:287-292. doi: 10.1109/bibm55620.2022.9995401. Epub 2023 Jan 2.
8
Functional and structural diversification of incomplete phosphotransferase system in cellulose-degrading clostridia.纤维素降解梭菌中不完全磷酸转移酶系统的功能和结构多样化。
ISME J. 2023 Jun;17(6):823-835. doi: 10.1038/s41396-023-01392-2. Epub 2023 Mar 10.
9
Cross-Feedings, Competition, and Positive and Negative Synergies in a Four-Species Synthetic Community for Anaerobic Degradation of Cellulose to Methane.四物种合成纤维素厌氧甲烷化群落中的交叉喂养、竞争以及正协同和负协同作用。
mBio. 2023 Apr 25;14(2):e0318922. doi: 10.1128/mbio.03189-22. Epub 2023 Feb 27.
10
Alterations of oral microbiota and impact on the gut microbiome in type 1 diabetes mellitus revealed by integrated multi-omic analyses.通过整合多组学分析揭示 1 型糖尿病中口腔微生物组的改变及其对肠道微生物组的影响。
Microbiome. 2022 Dec 28;10(1):243. doi: 10.1186/s40168-022-01435-4.
蛋白质组学稳定同位素示踪揭示了沿海海洋浮游细菌在氨基酸同化方面的分类学独特模式。
mSystems. 2016 Apr 26;1(2). doi: 10.1128/mSystems.00027-15. eCollection 2016 Mar-Apr.
4
Integrated Proteomic Pipeline Using Multiple Search Engines for a Proteogenomic Study with a Controlled Protein False Discovery Rate.使用多种搜索引擎的集成蛋白质组学流程用于蛋白质基因组学研究并控制蛋白质错误发现率
J Proteome Res. 2016 Nov 4;15(11):4082-4090. doi: 10.1021/acs.jproteome.6b00376. Epub 2016 Aug 30.
5
A comprehensive and scalable database search system for metaproteomics.一种用于宏蛋白质组学的全面且可扩展的数据库搜索系统。
BMC Genomics. 2016 Aug 16;17(1):642. doi: 10.1186/s12864-016-2855-3.
6
Proteomic Stable Isotope Probing Reveals Biosynthesis Dynamics of Slow Growing Methane Based Microbial Communities.蛋白质组学稳定同位素示踪揭示基于甲烷的缓慢生长微生物群落的生物合成动态
Front Microbiol. 2016 Apr 29;7:563. doi: 10.3389/fmicb.2016.00563. eCollection 2016.
7
Microbial metaproteomics for characterizing the range of metabolic functions and activities of human gut microbiota.用于表征人类肠道微生物群代谢功能和活性范围的微生物元蛋白质组学。
Proteomics. 2015 Oct;15(20):3424-38. doi: 10.1002/pmic.201400571. Epub 2015 May 28.
8
PepArML: A Meta-Search Peptide Identification Platform for Tandem Mass Spectra.PepArML:一种用于串联质谱的元搜索肽段鉴定平台。
Curr Protoc Bioinformatics. 2013 Dec;44(1323):13.23.1-23. doi: 10.1002/0471250953.bi1323s44.
9
Sigma: strain-level inference of genomes from metagenomic analysis for biosurveillance.西格玛:用于生物监测的宏基因组分析中基因组的菌株水平推断。
Bioinformatics. 2015 Jan 15;31(2):170-7. doi: 10.1093/bioinformatics/btu641. Epub 2014 Sep 29.
10
Diverse and divergent protein post-translational modifications in two growth stages of a natural microbial community.自然微生物群落两个生长阶段中多样且不同的蛋白质翻译后修饰
Nat Commun. 2014 Jul 25;5:4405. doi: 10.1038/ncomms5405.