Fijalkowski Igor, Willems Patrick, Jonckheere Veronique, Simoens Laure, Van Damme Petra
iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, 9000 Ghent, Belgium.
Microlife. 2022 May 14;3:uqac005. doi: 10.1093/femsml/uqac005. eCollection 2022.
Genomic studies of bacteria have long pointed toward widespread prevalence of small open reading frames (sORFs) encoding for short proteins, <100 amino acids in length. Despite the mounting genomic evidence of their robust expression, relatively little progress has been made in their mass spectrometry-based detection and various blanket statements have been used to explain this observed discrepancy. In this study, we provide a large-scale riboproteogenomics investigation of the challenging nature of proteomic detection of such small proteins as informed by conditional translation data. A panel of physiochemical properties alongside recently developed mass spectrometry detectability metrics was interrogated to provide a comprehensive evidence-based assessment of sORF-encoded polypeptide (SEP) detectability. Moreover, a large-scale proteomics and translatomics compendium of proteins produced by Typhimurium (. Typhimurium), a model human pathogen, across a panel of growth conditions is presented and used in support of our SEP detectability analysis. This integrative approach is used to provide a data-driven census of small proteins expressed by . Typhimurium across growth phases and infection-relevant conditions. Taken together, our study pinpoints current limitations in proteomics-based detection of novel small proteins currently missing from bacterial genome annotations.
长期以来,对细菌的基因组研究表明,编码短蛋白(长度小于100个氨基酸)的小开放阅读框(sORF)广泛存在。尽管有越来越多的基因组证据表明它们能强劲表达,但基于质谱的检测进展相对较小,人们用各种笼统的说法来解释这种观察到的差异。在本研究中,我们根据条件翻译数据,对这类小蛋白的蛋白质组学检测的挑战性进行了大规模的核糖蛋白质组学研究。我们研究了一系列理化性质以及最近开发的质谱可检测性指标,以便对sORF编码多肽(SEP)的可检测性进行全面的、基于证据的评估。此外,还展示了由模式人类病原体鼠伤寒沙门氏菌在一系列生长条件下产生的蛋白质的大规模蛋白质组学和翻译组学汇编,并用于支持我们的SEP可检测性分析。这种综合方法用于对鼠伤寒沙门氏菌在不同生长阶段和感染相关条件下表达的小蛋白进行数据驱动的普查。总之,我们的研究指出了目前基于蛋白质组学检测细菌基因组注释中缺失的新型小蛋白的局限性。