Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to Be University), Mangalore, India.
Institute of Bioinformatics, International Technology Park, Bangalore, India.
Mol Cell Proteomics. 2023 May;22(5):100533. doi: 10.1016/j.mcpro.2023.100533. Epub 2023 Mar 21.
Mycobacterium avium is one of the prominent disease-causing bacteria in humans. It causes lymphadenitis, chronic and extrapulmonary, and disseminated infections in adults, children, and immunocompromised patients. M. avium has ∼4500 predicted protein-coding regions on average, which can help discover several variants at the proteome level. Many of them are potentially associated with virulence; thus, identifying such proteins can be a helpful feature in developing panel-based theranostics. In line with such a long-term goal, we carried out an in-depth proteomic analysis of M. avium with both data-dependent and data-independent acquisition methods. Further, a set of proteogenomic investigations were carried out using (i) a protein database for Mycobacterium tuberculosis, (ii) an M. avium genome six-frame-translated database, and (iii) a variant protein database of M. avium. A search of mass spectrometry data against M. avium protein database resulted in identifying 2954 proteins. Further, proteogenomic analyses aided in identifying 1301 novel peptide sequences and correcting translation start sites for 15 proteins. Ultimately, we created a spectral library of M. avium proteins, including novel genome search-specific peptides and variant peptides detected in this study. We validated the spectral library by a data-independent acquisition of the M. avium proteome. Thus, we present an M. avium spectral library of 29,033 peptide precursors supported by 0.4 million fragment ions for further use by the biomedical community.
鸟分枝杆菌是人类中一种重要的致病细菌。它可引起成年人、儿童和免疫功能低下患者的淋巴结炎、慢性和肺外播散性感染。鸟分枝杆菌平均约有 4500 个预测的蛋白质编码区,这有助于在蛋白质组水平上发现多个变体。其中许多可能与毒力有关;因此,鉴定这些蛋白质可以作为开发基于面板的治疗方法的一个有用特征。为了实现这一长期目标,我们使用依赖数据和独立数据的采集方法对鸟分枝杆菌进行了深入的蛋白质组分析。此外,还使用(i)结核分枝杆菌蛋白质数据库、(ii)鸟分枝杆菌全基因组六框翻译数据库和(iii)鸟分枝杆菌变体蛋白质数据库进行了一组蛋白质基因组学研究。对质谱数据进行搜索,结果在鸟分枝杆菌蛋白质数据库中鉴定出 2954 种蛋白质。此外,蛋白质基因组学分析有助于鉴定出 1301 种新的肽序列,并纠正 15 种蛋白质的翻译起始位点。最终,我们创建了一个鸟分枝杆菌蛋白质的光谱文库,包括本研究中检测到的特定于新基因组搜索的肽和变体肽。我们通过鸟分枝杆菌蛋白质组的独立数据采集来验证该光谱文库。因此,我们提供了一个由 29033 个肽前体组成的鸟分枝杆菌光谱文库,支持 400 万个片段离子,以供生物医学界进一步使用。