Department of Chemistry and Biology, Ryerson University, 350 Victoria Street, Toronto, Canada.
J Proteome Res. 2012 Apr 6;11(4):2032-47. doi: 10.1021/pr2000013. Epub 2012 Mar 15.
It will be important to determine if the parent and fragment ion intensity results of liquid chromatography, electrospray ionization and tandem mass spectrometry (LC-ESI-MS/MS) experiments have been randomly and independently sampled from a normal population for the purpose of statistical analysis by general linear models and ANOVA. The tryptic parent peptide and fragment ion m/z and intensity data in the mascot generic files from LC-ESI-MS/MS of purified standard proteins, and human blood protein fractionated by partition chromatography, were parsed into a Structured Query Language (SQL) database and were matched with protein and peptide sequences provided by the X!TANDEM algorithm. The many parent and/or fragment ion intensity values were log transformed, tested for normality, and analyzed using the generic Statistical Analysis System (SAS). Transformation of both parent and fragment intensity values by logarithmic functions yielded intensity distributions that closely approximate the log-normal distribution. ANOVA models of the transformed parent and fragment intensity values showed significant effects of treatments, proteins, and peptides, as well as parent versus fragment ion types, with a low probability of false positive results. Transformed parent and fragment intensity values were compared over all sample treatments, proteins or peptides by the Tukey-Kramer Honestly Significant Difference (HSD) test. The approach provided a complete and quantitative statistical analysis of LC-ESI-MS/MS data from human blood.
对于目的为通过一般线性模型和 ANOVA 进行统计分析的来自正常人群的随机和独立采样的液相色谱、电喷雾电离和串联质谱 (LC-ESI-MS/MS) 实验的亲本离子和碎片离子强度结果,确定其是否重要。从通过分配色谱分离的纯化标准蛋白质和人血蛋白质的 LC-ESI-MS/MS 的 Mascot 通用文件中的胰蛋白酶亲本肽和碎片离子 m/z 和强度数据被解析到结构化查询语言 (SQL) 数据库中,并与由 X!TANDEM 算法提供的蛋白质和肽序列相匹配。许多亲本和/或碎片离子强度值被对数转换,检验正态性,并使用通用统计分析系统 (SAS) 进行分析。亲本和片段强度值的对数转换产生的强度分布非常接近对数正态分布。转化的亲本和片段强度值的 ANOVA 模型显示处理、蛋白质和肽以及亲本与片段离子类型的显著影响,并且假阳性结果的可能性很低。通过 Tukey-Kramer Honestly Significant Difference (HSD) 检验比较了所有样品处理、蛋白质或肽的转化亲本和片段强度值。该方法提供了来自人血的 LC-ESI-MS/MS 数据的完整和定量统计分析。