Searle Brian C, Egertson Jarrett D, Bollinger James G, Stergachis Andrew B, MacCoss Michael J
From the ‡Department of Genome Sciences, University of Washington, Seattle, Washington 98195; §Proteome Software Inc., Portland, OR 97219.
From the ‡Department of Genome Sciences, University of Washington, Seattle, Washington 98195;
Mol Cell Proteomics. 2015 Sep;14(9):2331-40. doi: 10.1074/mcp.M115.051300. Epub 2015 Jun 22.
Targeted mass spectrometry is an essential tool for detecting quantitative changes in low abundant proteins throughout the proteome. Although selected reaction monitoring (SRM) is the preferred method for quantifying peptides in complex samples, the process of designing SRM assays is laborious. Peptides have widely varying signal responses dictated by sequence-specific physiochemical properties; one major challenge is in selecting representative peptides to target as a proxy for protein abundance. Here we present PREGO, a software tool that predicts high-responding peptides for SRM experiments. PREGO predicts peptide responses with an artificial neural network trained using 11 minimally redundant, maximally relevant properties. Crucial to its success, PREGO is trained using fragment ion intensities of equimolar synthetic peptides extracted from data independent acquisition experiments. Because of similarities in instrumentation and the nature of data collection, relative peptide responses from data independent acquisition experiments are a suitable substitute for SRM experiments because they both make quantitative measurements from integrated fragment ion chromatograms. Using an SRM experiment containing 12,973 peptides from 724 synthetic proteins, PREGO exhibits a 40-85% improvement over previously published approaches at selecting high-responding peptides. These results also represent a dramatic improvement over the rules-based peptide selection approaches commonly used in the literature.
靶向质谱分析是检测蛋白质组中低丰度蛋白质定量变化的重要工具。尽管选择反应监测(SRM)是复杂样品中肽段定量的首选方法,但设计SRM分析方法的过程却很繁琐。肽段的信号响应因序列特异性理化性质而有很大差异;一个主要挑战在于选择具有代表性的肽段作为蛋白质丰度的替代指标。在此,我们展示了PREGO,一种为SRM实验预测高响应肽段的软件工具。PREGO使用一个人工神经网络来预测肽段响应,该网络是用11种最小冗余、最大相关的性质进行训练的。PREGO成功的关键在于,它使用从数据非依赖采集实验中提取的等摩尔合成肽段的碎片离子强度进行训练。由于仪器设备和数据收集性质的相似性,数据非依赖采集实验中的相对肽段响应是SRM实验的合适替代,因为它们都是从积分碎片离子色谱图进行定量测量的。在一个包含来自724种合成蛋白质的12,973个肽段的SRM实验中,PREGO在选择高响应肽段方面比先前发表的方法有40 - 85%的改进。这些结果也比文献中常用的基于规则的肽段选择方法有显著改进。