Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States.
J Proteome Res. 2021 Apr 2;20(4):1936-1942. doi: 10.1021/acs.jproteome.0c00954. Epub 2021 Mar 4.
Bottom-up proteomics is currently the dominant strategy for proteome analysis. It relies critically upon the use of a protease to digest proteins into peptides, which are then identified by liquid chromatography-mass spectrometry (LC-MS). The choice of protease(s) has a substantial impact upon the utility of the bottom-up results obtained. Protease selection determines the nature of the peptides produced, which in turn affects the ability to infer the presence and quantities of the parent proteins and post-translational modifications in the sample. We present here the software tool ProteaseGuru, which provides in silico digestions by candidate proteases, allowing evaluation of their utility for bottom-up proteomic experiments. This information is useful for both studies focused on a single or small number of proteins, and for analysis of entire complex proteomes. ProteaseGuru provides a convenient user interface, valuable peptide information, and data visualizations enabling the comparison of digestion results of different proteases. The information provided includes data tables of theoretical peptide sequences and their biophysical properties, results summaries outlining the numbers of shared and unique peptides per protease, histograms facilitating the comparison of proteome-wide proteolytic data, protein-specific summaries, and sequence coverage maps. Examples are provided of its use to inform analysis of variant-containing proteins in the human proteome, as well as for studies requiring the use of multiple proteomic databases such as a human:mouse xenograft model, and microbiome metaproteomics.
自下而上的蛋白质组学目前是蛋白质组分析的主要策略。它严重依赖于使用蛋白酶将蛋白质消化成肽,然后通过液相色谱-质谱(LC-MS)进行鉴定。蛋白酶的选择对获得的自下而上结果的实用性有很大的影响。蛋白酶的选择决定了产生的肽的性质,这反过来又影响了推断样品中母蛋白和翻译后修饰的存在和数量的能力。我们在这里介绍 ProteaseGuru 软件工具,它提供候选蛋白酶的虚拟消化,允许评估它们在自下而上蛋白质组实验中的实用性。这些信息对于专注于单个或少数蛋白质的研究以及整个复杂蛋白质组的分析都非常有用。ProteaseGuru 提供了一个方便的用户界面、有价值的肽信息和数据可视化,使不同蛋白酶消化结果的比较变得容易。提供的信息包括理论肽序列及其物理化学性质的数据表、每个蛋白酶的共享和独特肽数量概述的结果摘要、有助于比较全蛋白质组蛋白酶数据的直方图、蛋白质特异性摘要和序列覆盖图。提供了一些示例,说明如何使用它来告知人类蛋白质组中含变异蛋白的分析,以及需要使用多个蛋白质组数据库的研究,如人类:小鼠异种移植模型和微生物组宏蛋白质组学。