Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology and College of Medicine or College of Pharmacy, Seoul National University, Seoul, 03080, South Korea.
Department of Systems Immunology, Division of Biomedical Convergence, College of Biomedical Science, Kangwon National University, Gangwon, 24341, South Korea.
Sci Rep. 2019 Sep 20;9(1):13653. doi: 10.1038/s41598-019-49665-1.
Mass spectrometry-based spectral count has been a common choice of label-free proteome quantification due to the simplicity for the sample preparation and data generation. The discriminatory nature of spectral count in the MS data-dependent acquisition, however, inherently introduces the spectral count variation for low-abundance proteins in multiplicative LC-MS/MS analysis, which hampers sensitive proteome quantification. As many low-abundance proteins play important roles in cellular processes, deducing low-abundance proteins in a quantitatively reliable manner greatly expands the depth of biological insights. Here, we implemented the Moment Adjusted Imputation error model in the spectral count refinement as a post PLGEM-STN for improving sensitivity for quantitation of low-abundance proteins by reducing spectral count variability. The statistical framework, automated spectral count refinement by integrating the two statistical tools, was tested with LC-MS/MS datasets of MDA-MB468 breast cancer cells grown under normal and glucose deprivation conditions. We identified about 30% more quantifiable proteins that were found to be low-abundance proteins, which were initially filtered out by the PLGEM-STN analysis. This newly developed statistical framework provides a reliable abundance measurement of low-abundance proteins in the spectral count-based label-free proteome quantification and enabled us to detect low-abundance proteins that could be functionally important in cellular processes.
基于质谱的谱计数一直是一种常用的无标记蛋白质组定量选择,因为其样品制备和数据生成简单。然而,在 MS 数据依赖采集中,谱计数的区分性质固有地引入了 LC-MS/MS 分析中低丰度蛋白质的谱计数变化,这阻碍了敏感的蛋白质组定量。由于许多低丰度蛋白质在细胞过程中起着重要作用,以定量可靠的方式推断低丰度蛋白质极大地扩展了生物学见解的深度。在这里,我们在谱计数精修中实现了矩调整插补误差模型作为 PLGEM-STN 的后处理,通过减少谱计数变异性来提高低丰度蛋白质定量的灵敏度。该统计框架通过整合两种统计工具来自动进行谱计数精修,在正常和葡萄糖剥夺条件下生长的 MDA-MB468 乳腺癌细胞的 LC-MS/MS 数据集上进行了测试。我们鉴定出了约 30%的可定量蛋白质,这些蛋白质最初被 PLGEM-STN 分析过滤掉了。这个新开发的统计框架为基于谱计数的无标记蛋白质组定量中的低丰度蛋白质提供了可靠的丰度测量,并使我们能够检测到在细胞过程中可能具有功能重要性的低丰度蛋白质。