Department of Statistics, Purdue University, West Lafayette, Indiana, USA.
Mol Cell Proteomics. 2012 Apr;11(4):M111.014662. doi: 10.1074/mcp.M111.014662. Epub 2011 Dec 21.
Selected reaction monitoring (SRM) is a targeted mass spectrometry technique that provides sensitive and accurate protein detection and quantification in complex biological mixtures. Statistical and computational tools are essential for the design and analysis of SRM experiments, particularly in studies with large sample throughput. Currently, most such tools focus on the selection of optimized transitions and on processing signals from SRM assays. Little attention is devoted to protein significance analysis, which combines the quantitative measurements for a protein across isotopic labels, peptides, charge states, transitions, samples, and conditions, and detects proteins that change in abundance between conditions while controlling the false discovery rate. We propose a statistical modeling framework for protein significance analysis. It is based on linear mixed-effects models and is applicable to most experimental designs for both isotope label-based and label-free SRM workflows. We illustrate the utility of the framework in two studies: one with a group comparison experimental design and the other with a time course experimental design. We further verify the accuracy of the framework in two controlled data sets, one from the NCI-CPTAC reproducibility investigation and the other from an in-house spike-in study. The proposed framework is sensitive and specific, produces accurate results in broad experimental circumstances, and helps to optimally design future SRM experiments. The statistical framework is implemented in an open-source R-based software package SRMstats, and can be used by researchers with a limited statistics background as a stand-alone tool or in integration with the existing computational pipelines.
选择反应监测 (SRM) 是一种靶向质谱技术,可在复杂的生物混合物中提供敏感和准确的蛋白质检测和定量。统计和计算工具对于 SRM 实验的设计和分析至关重要,特别是在具有大量样本通量的研究中。目前,大多数此类工具都集中在优化过渡的选择和处理 SRM 测定的信号上。很少关注蛋白质显著性分析,它结合了跨同位素标签、肽、电荷状态、过渡、样品和条件的蛋白质定量测量,并在控制假发现率的同时检测丰度在条件之间变化的蛋白质。我们提出了一种用于蛋白质显著性分析的统计建模框架。它基于线性混合效应模型,适用于基于同位素标记和无标记 SRM 工作流程的大多数实验设计。我们在两个研究中说明了该框架的实用性:一个是组比较实验设计,另一个是时间过程实验设计。我们进一步在两个对照数据集上验证了该框架的准确性,一个来自 NCI-CPTAC 重现性研究,另一个来自内部掺入研究。所提出的框架灵敏且特异性高,在广泛的实验环境下产生准确的结果,并有助于优化未来的 SRM 实验设计。该统计框架在基于开源 R 的软件包 SRMstats 中实现,具有有限统计学背景的研究人员可以将其用作独立工具,也可以与现有的计算管道集成使用。