From the ‡T. J. Watson IBM Research Center, 1101 Kitchawan Road, Yorktown Heights, NY 10598.
§Department of Bioengineering, Northeastern University, Boston, MA 02115.
Mol Cell Proteomics. 2019 Jan;18(1):162-168. doi: 10.1074/mcp.TIR118.000947. Epub 2018 Oct 3.
Many proteoforms-arising from alternative splicing, post-translational modifications (PTM), or paralogous genes-have distinct biological functions, such as histone PTM proteoforms. However, their quantification by existing bottom-up mass-spectrometry (MS) methods is undermined by peptide-specific biases. To avoid these biases, we developed and implemented a first-principles model (HI) for quantifying proteoform stoichiometries. We characterized when MS data allow inferring proteoform stoichiometries by HI and derived an algorithm for optimal inference. We applied this algorithm to infer proteoform stoichiometries in two experimental systems that supported rigorous bench-marking: alkylated proteoforms spiked-in at known ratios and endogenous histone 3 PTM proteoforms quantified relative to internal heavy standards. When compared with the benchmarks, the proteoform stoichiometries interfered by HI without using external standards had relative error of 5-15% for simple proteoforms and 20-30% for complex proteoforms. A HI server is implemented at: https://web.northeastern.edu/slavov/2014HIquant/.
许多蛋白异构体是由选择性剪接、翻译后修饰 (PTM) 或基因的旁系同源产生的,具有不同的生物学功能,如组蛋白 PTM 蛋白异构体。然而,现有的基于质谱 (MS) 的方法对其进行定量受到肽特异性偏倚的影响。为了避免这些偏差,我们开发并实现了一种用于定量蛋白异构体丰度的第一性原理模型 (HI)。我们描述了 MS 数据何时可以通过 HI 推断蛋白异构体丰度,并得出了最佳推断的算法。我们将该算法应用于两个支持严格基准测试的实验系统中,以推断内源性组蛋白 3 PTM 蛋白异构体的丰度:以已知比例掺入的烷基化蛋白异构体和相对于内部重标准定量的蛋白异构体。与基准相比,未使用外部标准的 HI 干扰的蛋白异构体丰度的相对误差对于简单的蛋白异构体为 5-15%,对于复杂的蛋白异构体为 20-30%。HI 服务器在以下网址实现:https://web.northeastern.edu/slavov/2014HIquant/。