Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States.
J Proteome Res. 2019 Sep 6;18(9):3429-3438. doi: 10.1021/acs.jproteome.9b00330. Epub 2019 Aug 23.
Peptides detected by tandem mass spectrometry (MS/MS) in bottom-up proteomics serve as proxies for the proteins expressed in the sample. Protein inference is a process routinely applied to these peptides to generate a plausible list of candidate protein identifications. The use of multiple proteases for parallel protein digestions expands sequence coverage, provides additional peptide identifications, and increases the probability of identifying peptides that are unique to a single protein, which are all valuable for protein inference. We have developed and implemented a multi-protease protein inference algorithm in MetaMorpheus, a bottom-up search software program, which incorporates the calculation of protease-specific -values and preserves the association of peptide sequences and their protease of origin. This integrated multi-protease protein inference algorithm provides more accurate results than either the aggregation of results from the separate analysis of the peptide identifications produced by each protease (separate approach) in MetaMorpheus, or results that are obtained using Fido, ProteinProphet, or DTASelect2. MetaMorpheus' integrated multi-protease data analysis decreases the ambiguity of the protein group list, reduces the frequency of erroneous identifications, and increases the number of post-translational modifications identified, while combining multi-protease search and protein inference into a single software program.
通过串联质谱(MS/MS)检测到的肽作为样本中表达的蛋白质的代表。蛋白质推断是一种常规应用于这些肽的过程,以生成合理的候选蛋白质鉴定列表。使用多种蛋白酶进行平行蛋白质消化可扩展序列覆盖范围,提供更多的肽鉴定,并增加鉴定仅特定于单个蛋白质的肽的概率,所有这些都对蛋白质推断有价值。我们已经在 MetaMorpheus 中开发并实现了一种多蛋白酶蛋白质推断算法,这是一种从头搜索软件程序,其中包括蛋白酶特异性 - 值的计算,并保留肽序列及其原始蛋白酶的关联。与 MetaMorpheus 中使用单独的蛋白酶分别分析每个蛋白酶产生的肽鉴定的结果(单独方法)的聚合结果,或者使用 Fido、ProteinProphet 或 DTASelect2 获得的结果相比,这种集成的多蛋白酶蛋白质推断算法提供了更准确的结果。MetaMorpheus 的集成多蛋白酶数据分析减少了蛋白质组列表的歧义,减少了错误鉴定的频率,并增加了鉴定的翻译后修饰数量,同时将多蛋白酶搜索和蛋白质推断结合到单个软件程序中。