Hart-Smith Gene, Reis Rodrigo S, Waterhouse Peter M, Wilkins Marc R
NSW Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia.
School of Biological Sciences, University of Sydney, Sydney, NSW, Australia.
Front Plant Sci. 2017 Sep 27;8:1669. doi: 10.3389/fpls.2017.01669. eCollection 2017.
Quantitative proteomics strategies - which are playing important roles in the expanding field of plant molecular systems biology - are traditionally designated as either hypothesis driven or non-hypothesis driven. Many of these strategies aim to select individual peptide ions for tandem mass spectrometry (MS/MS), and to do this mixed hypothesis driven and non-hypothesis driven approaches are theoretically simple to implement. In-depth investigations into the efficacies of such approaches have, however, yet to be described. In this study, using combined samples of unlabeled and metabolically N-labeled proteins, we investigate the mixed use of targeted data acquisition (TDA) and data dependent acquisition (DDA) - referred to as TDA/DDA - to facilitate both hypothesis driven and non-hypothesis driven quantitative data collection in individual LC-MS/MS experiments. To investigate TDA/DDA for hypothesis driven data collection, 7 miRNA target proteins of differing size and abundance were targeted using inclusion lists comprised of 1558 values, using 3 different TDA/DDA experimental designs. In samples in which targeted peptide ions were of particularly low abundance (i.e., predominantly only marginally above mass analyser detection limits), TDA/DDA produced statistically significant increases in the number of targeted peptides identified (230 ± 8 versus 80 ± 3 for DDA; = 1.1 × 10) and quantified (35 ± 3 versus 21 ± 2 for DDA; = 0.038) per experiment relative to the use of DDA only. These expected improvements in hypothesis driven data collection were observed alongside unexpected improvements in non-hypothesis driven data collection. Untargeted peptide ions with values matching those in inclusion lists were repeatedly identified and quantified across technical replicate TDA/DDA experiments, resulting in significant increases in the percentages of proteins repeatedly quantified in TDA/DDA experiments only relative to DDA experiments only (33.0 ± 2.6% versus 8.0 ± 2.7%, respectively; = 0.011). These results were observed together with uncompromised broad-scale MS/MS data collection in TDA/DDA experiments relative to DDA experiments. Using our observations we provide guidelines for TDA/DDA method design for quantitative plant proteomics studies, and suggest that TDA/DDA is a broadly underutilized proteomics data acquisition strategy.
定量蛋白质组学策略——在不断扩展的植物分子系统生物学领域发挥着重要作用——传统上被划分为假设驱动型或非假设驱动型。这些策略中的许多旨在选择单个肽离子进行串联质谱分析(MS/MS),从理论上讲,将假设驱动和非假设驱动方法结合起来实施这一操作很简单。然而,尚未有对这些方法有效性的深入研究报道。在本研究中,我们使用未标记和代谢性N标记蛋白质的混合样本,研究了靶向数据采集(TDA)和数据依赖采集(DDA)的混合使用——称为TDA/DDA——以便在单个液相色谱-质谱/质谱(LC-MS/MS)实验中促进假设驱动和非假设驱动的定量数据收集。为了研究用于假设驱动数据收集的TDA/DDA,我们使用由1558个值组成的包含列表,采用3种不同的TDA/DDA实验设计,对7种大小和丰度不同的miRNA靶蛋白进行靶向分析。在靶向肽离子丰度特别低的样本中(即主要仅略高于质量分析器检测限),相对于仅使用DDA,TDA/DDA在每次实验中鉴定出的靶向肽数量(DDA为80±3,TDA/DDA为230±8;P = 1.1×10)和定量的靶向肽数量(DDA为21±2,TDA/DDA为35±3;P = 0.038)在统计学上有显著增加。在假设驱动数据收集方面出现这些预期改进的同时,非假设驱动数据收集也有意外的改进。在TDA/DDA技术重复实验中,m/z值与包含列表中的值匹配的非靶向肽离子被反复鉴定和定量,导致仅在TDA/DDA实验中反复定量的蛋白质百分比相对于仅DDA实验有显著增加(分别为33.0±2.6%和8.0±2.7%;P = 0.011)。这些结果与相对于DDA实验,TDA/DDA实验中未受影响的大规模MS/MS数据收集一同被观察到。基于我们的观察结果,我们为植物定量蛋白质组学研究的TDA/DDA方法设计提供了指导方针,并表明TDA/DDA是一种广泛未被充分利用的蛋白质组学数据采集策略。