Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544, USA.
Mol Cell Proteomics. 2010 May;9(5):764-79. doi: 10.1074/mcp.M900487-MCP200. Epub 2010 Jan 26.
A novel algorithm, PILOT_PTM, has been developed for the untargeted identification of post-translational modifications (PTMs) on a template sequence. The algorithm consists of an analysis of an MS/MS spectrum via an integer linear optimization model to output a rank-ordered list of PTMs that best match the experimental data. Each MS/MS spectrum is analyzed by a preprocessing algorithm to reduce spectral noise and label potential complimentary, offset, isotope, and multiply charged peaks. Postprocessing of the rank-ordered list from the integer linear optimization model will resolve fragment mass errors and will reorder the list of PTMs based on the cross-correlation between the experimental and theoretical MS/MS spectrum. PILOT_PTM is instrument-independent, capable of handling multiple fragmentation technologies, and can address the universe of PTMs for every amino acid on the template sequence. The various features of PILOT_PTM are presented, and it is tested on several modified and unmodified data sets including chemically synthesized phosphopeptides, histone H3-(1-50) polypeptides, histone H3-(1-50) tryptic fragments, and peptides generated from proteins extracted from chromatin-enriched fractions. The data sets consist of spectra derived from fragmentation via collision-induced dissociation, electron transfer dissociation, and electron capture dissociation. The capability of PILOT_PTM is then benchmarked using five state-of-the-art methods, InsPecT, Virtual Expert Mass Spectrometrist (VEMS), Mod(i), Mascot, and X!Tandem. PILOT_PTM demonstrates superior accuracy on both the small and large scale proteome experiments. A protocol is finally developed for the analysis of a complete LC-MS/MS scan using template sequences generated from SEQUEST and is demonstrated on over 270,000 MS/MS spectra collected from a total chromatin digest.
一种新算法 PILOT_PTM 已经被开发出来,用于在模板序列上对翻译后修饰(PTM)进行非靶向鉴定。该算法包括通过整数线性优化模型对 MS/MS 谱进行分析,以输出与实验数据最佳匹配的 PTM 排序列表。每个 MS/MS 谱都通过预处理算法进行分析,以减少光谱噪声并标记潜在的互补、偏移、同位素和多重电荷峰。整数线性优化模型的排序列表的后处理将解决碎片质量误差,并根据实验和理论 MS/MS 谱之间的互相关重新排列 PTM 列表。PILOT_PTM 是独立于仪器的,能够处理多种碎片化技术,并可以解决模板序列上每个氨基酸的所有 PTM 问题。介绍了 PILOT_PTM 的各种特征,并在包括化学合成的磷酸肽、组蛋白 H3-(1-50)多肽、组蛋白 H3-(1-50)胰蛋白酶片段以及从富含染色质的蛋白质提取物中产生的肽的几个修饰和未修饰数据集上进行了测试。数据集由通过碰撞诱导解离、电子转移解离和电子俘获解离碎片化衍生的光谱组成。然后使用 InsPecT、Virtual Expert Mass Spectrometrist (VEMS)、Mod(i)、Mascot 和 X!Tandem 等五种最先进的方法对 PILOT_PTM 的能力进行基准测试。PILOT_PTM 在小规模和大规模蛋白质组实验中都表现出了卓越的准确性。最后还制定了使用 SEQUEST 生成的模板序列分析完整 LC-MS/MS 扫描的协议,并在从总染色质消化物中收集的超过 270,000 个 MS/MS 谱上进行了演示。