Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States.
Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia 22903, United States.
J Am Soc Mass Spectrom. 2024 Nov 6;35(11):2614-2630. doi: 10.1021/jasms.4c00119. Epub 2024 Jul 16.
Alternative splicing is a major contributor of transcriptomic complexity, but the extent to which transcript isoforms are translated into stable, functional protein isoforms is unclear. Furthermore, detection of relatively scarce isoform-specific peptides is challenging, with many protein isoforms remaining uncharted due to technical limitations. Recently, a family of advanced targeted MS strategies, termed internal standard parallel reaction monitoring (IS-PRM), have demonstrated multiplexed, sensitive detection of predefined peptides of interest. Such approaches have not yet been used to confirm existence of novel peptides. Here, we present a targeted proteogenomic approach that leverages sample-matched long-read RNA sequencing (lrRNA-seq) data to predict potential protein isoforms with prior transcript evidence. Predicted tryptic isoform-specific peptides, which are specific to individual gene product isoforms, serve as "triggers" and "targets" in the IS-PRM method, Tomahto. Using the model human stem cell line WTC11, LR RNaseq data were generated and used to inform the generation of synthetic standards for 192 isoform-specific peptides (114 isoforms from 55 genes). These synthetic "trigger" peptides were labeled with super heavy tandem mass tags (TMT) and spiked into TMT-labeled WTC11 tryptic digest, predicted to contain corresponding endogenous "target" peptides. Compared to DDA mode, Tomahto increased detectability of isoforms by 3.6-fold, resulting in the identification of five previously unannotated isoforms. Our method detected protein isoform expression for 43 out of 55 genes corresponding to 54 resolved isoforms. This lrRNA-seq-informed Tomahto targeted approach is a new modality for generating protein-level evidence of alternative isoforms─a critical first step in designing functional studies and eventually clinical assays.
可变剪接是转录组复杂性的主要贡献者,但转录本异构体转化为稳定、有功能的蛋白质异构体的程度尚不清楚。此外,由于技术限制,相对稀缺的同工型特异性肽的检测具有挑战性,许多蛋白质同工型仍然未被发现。最近,一类先进的靶向 MS 策略,称为内标平行反应监测(IS-PRM),已经证明了对感兴趣的预定义肽的多重、敏感检测。这些方法尚未用于确认新肽的存在。在这里,我们提出了一种靶向蛋白质基因组学方法,该方法利用样本匹配的长读 RNA 测序(lrRNA-seq)数据来预测具有先前转录证据的潜在蛋白质同工型。预测的胰蛋白酶同工型特异性肽,这些肽特异性地针对单个基因产物同工型,作为 IS-PRM 方法 Tomahto 的“触发器”和“靶标”。使用模型人类干细胞系 WTC11,生成了 LR RNaseq 数据,并用于生成 192 个同工型特异性肽(来自 55 个基因的 114 个同工型)的合成标准品。这些合成的“触发”肽用超重串联质量标签(TMT)标记,并掺入到 TMT 标记的 WTC11 胰蛋白酶消化物中,预测其中含有相应的内源性“靶”肽。与 DDA 模式相比,Tomahto 将同工型的可检测性提高了 3.6 倍,从而鉴定了五个以前未注释的同工型。我们的方法检测到 55 个基因中的 43 个基因对应 54 个解析同工型的蛋白质同工型表达。这种基于 lrRNA-seq 的 Tomahto 靶向方法是生成替代同工型蛋白质水平证据的新方法——这是设计功能研究并最终临床检测的关键第一步。