Hinkle Trent B, Bakalarski Corey E
Department of Proteomic & Genomic Technologies, Genentech Inc., 1 DNA Way, South San Francisco, California 94080, United States.
Department of Computational Catalysts, Genentech Inc., 1 DNA Way, South San Francisco, California 94080, United States.
J Proteome Res. 2025 Apr 4;24(4):2135-2140. doi: 10.1021/acs.jproteome.4c00734. Epub 2025 Feb 28.
Selection and application of protein inference algorithms can have a significant impact on the data output from tandem mass spectrometry (MS/MS) experiments. However, this critical step is often taken for granted, with many studies simply utilizing the inference method embedded within the end-to-end software pipeline employed for analysis without consideration of the particular algorithm's suitability for the experiment at hand or its effects on the resulting data. Although many individual inference algorithms have been demonstrated, few unified tools are available that allow the researcher to quickly apply a variety of different inference algorithms to meet the needs of their analysis, are agnostic of other tools in the analysis pipeline, and are easy to use for the bench biologist. PyProteinInference provides a comprehensive suite of tools that enable researchers to apply different inference algorithms and compute protein-level set-based false discovery rates (FDR) from MS/MS data through a unified interface. Here, we describe the software and its application to a traditional protein inference benchmarking data set and to a K562 whole-cell lysate to demonstrate its utility in facilitating conclusions about underlying biological mechanisms in proteomic data.
蛋白质推断算法的选择和应用会对串联质谱(MS/MS)实验的数据输出产生重大影响。然而,这一关键步骤常常被视为理所当然,许多研究只是简单地使用用于分析的端到端软件流程中嵌入的推断方法,而没有考虑特定算法对手头实验的适用性或其对所得数据的影响。尽管已经展示了许多单独的推断算法,但很少有统一的工具可供研究人员快速应用各种不同的推断算法以满足其分析需求,这些工具与分析流程中的其他工具无关,并且便于实验生物学家使用。PyProteinInference提供了一套全面的工具,使研究人员能够通过统一界面应用不同的推断算法,并从MS/MS数据计算基于蛋白质水平集的错误发现率(FDR)。在此,我们描述该软件及其在传统蛋白质推断基准数据集和K562全细胞裂解物中的应用,以证明其在促进蛋白质组学数据中潜在生物学机制结论方面的效用。