iHuman Institute, ShanghaiTech University, Shanghai, 201210, China.
School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China.
Nat Commun. 2021 Nov 18;12(1):6685. doi: 10.1038/s41467-021-26979-1.
Phosphoproteomics integrating data-independent acquisition (DIA) enables deep phosphoproteome profiling with improved quantification reproducibility and accuracy compared to data-dependent acquisition (DDA)-based phosphoproteomics. DIA data mining heavily relies on a spectral library that in most cases is built on DDA analysis of the same sample. Construction of this project-specific DDA library impairs the analytical throughput, limits the proteome coverage, and increases the sample size for DIA phosphoproteomics. Herein we introduce a deep neural network, DeepPhospho, which conceptually differs from previous deep learning models to achieve accurate predictions of LC-MS/MS data for phosphopeptides. By leveraging in silico libraries generated by DeepPhospho, we establish a DIA workflow for phosphoproteome profiling which involves DIA data acquisition and data mining with DeepPhospho predicted libraries, thus circumventing the need of DDA library construction. Our DeepPhospho-empowered workflow substantially expands the phosphoproteome coverage while maintaining high quantification performance, which leads to the discovery of more signaling pathways and regulated kinases in an EGF signaling study than the DDA library-based approach. DeepPhospho is provided as a web server as well as an offline app to facilitate user access to model training, predictions and library generation.
磷酸化蛋白质组学整合了数据非依赖性采集(DIA),与基于数据依赖性采集(DDA)的磷酸化蛋白质组学相比,能够更深入地进行磷酸化蛋白质组分析,并提高定量重现性和准确性。DIA 数据挖掘严重依赖于光谱库,而在大多数情况下,该光谱库是基于对相同样本的 DDA 分析构建的。构建这个特定于项目的 DDA 库会降低分析通量,限制蛋白质组覆盖率,并增加 DIA 磷酸化蛋白质组学的样本量。在此,我们介绍了一个深度神经网络 DeepPhospho,它与以前的深度学习模型在概念上有所不同,能够实现对 LC-MS/MS 数据中磷酸肽的准确预测。通过利用 DeepPhospho 生成的计算机库,我们建立了一种用于磷酸化蛋白质组分析的 DIA 工作流程,该流程涉及 DIA 数据采集和使用 DeepPhospho 预测库进行数据挖掘,从而避免了 DDA 库构建的需要。我们的 DeepPhospho 赋能工作流程在保持高定量性能的同时,大大扩展了磷酸化蛋白质组的覆盖范围,这使得在 EGF 信号研究中发现了更多的信号通路和调控激酶,比基于 DDA 库的方法更多。DeepPhospho 提供了一个网络服务器和一个离线应用程序,方便用户访问模型训练、预测和库生成。