Department of Biology , Dalhousie University , Halifax , Nova Scotia B3H 4R2 , Canada.
J Proteome Res. 2019 Oct 4;18(10):3555-3566. doi: 10.1021/acs.jproteome.9b00144. Epub 2019 Sep 19.
Metaproteomics can provide critical information about biological systems, but peptides are found within a complex background of other peptides. This complex background can change across samples, in some cases drastically. Cofragmentation, the coelution of peptides with similar mass to charge ratios, is one factor that influences which peptides are identified in an LC-MS/MS experiment: it is dependent on the nature and complexity of this dynamic background. Metaproteomics applications are particularly susceptible to cofragmentation-induced bias; they have vast protein sequence diversity and the abundance of those proteins can span many orders of magnitude. We have developed a mechanistic model that determines the number of potentially cofragmenting peptides in a given sample (called , https://github.com/bertrand-lab/cobia ). We then used previously published data sets to validate our model, showing that the resulting peptide-specific score reflects the cofragmentation "risk" of peptides. Using an Antarctic sea ice edge metatranscriptome case study, we found that more rare taxonomic and functional groups are associated with higher cofragmentation bias. We also demonstrate how cofragmentation scores can be used to guide the selection of protein- or peptide-based biomarkers. We illustrate potential consequences of cofragmentation for multiple metaproteomic approaches, and suggest practical paths forward to cope with cofragmentation-induced bias.
代谢组学可以提供关于生物系统的关键信息,但肽在其他肽的复杂背景中被发现。这种复杂的背景可以在不同的样本中发生变化,在某些情况下变化很大。共碎裂,即具有相似质荷比的肽共同洗脱,是影响 LC-MS/MS 实验中鉴定哪些肽的一个因素:它取决于这种动态背景的性质和复杂性。代谢组学应用特别容易受到共碎裂诱导的偏差影响;它们具有广泛的蛋白质序列多样性,并且这些蛋白质的丰度可以跨越多个数量级。我们开发了一种机制模型,可以确定给定样本中潜在共碎裂肽的数量(称为 ,https://github.com/bertrand-lab/cobia)。然后,我们使用先前发表的数据集来验证我们的模型,表明得到的肽特异性得分反映了肽的共碎裂“风险”。使用南极海冰边缘宏转录组学案例研究,我们发现更多罕见的分类和功能群与更高的共碎裂偏差相关。我们还展示了如何使用共碎裂得分来指导基于蛋白质或肽的生物标志物的选择。我们说明了共碎裂对多种代谢组学方法的潜在影响,并提出了应对共碎裂诱导偏差的实用方法。