School of Medicine, Southeast University, Nanjing 210009, China.
Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac214.
Multiplexed single-cell proteomes (SCPs) quantification by mass spectrometry greatly improves the SCP coverage. However, it still suffers from a low number of protein identifications and there is much room to boost proteins identification by computational methods. In this study, we present a novel framework DeepSCP, utilizing deep learning to boost SCP coverage. DeepSCP constructs a series of features of peptide-spectrum matches (PSMs) by predicting the retention time based on the multiple SCP sample sets and fragment ion intensities based on deep learning, and predicts PSM labels with an optimized-ensemble learning model. Evaluation of DeepSCP on public and in-house SCP datasets showed superior performances compared with other state-of-the-art methods. DeepSCP identified more confident peptides and proteins by controlling q-value at 0.01 using target-decoy competition method. As a convenient and low-cost computing framework, DeepSCP will help boost single-cell proteome identification and facilitate the future development and application of single-cell proteomics.
通过质谱法进行的多重单细胞蛋白质组学(SCP)定量极大地提高了 SCP 的覆盖范围。然而,它仍然存在蛋白质鉴定数量低的问题,并且通过计算方法提高蛋白质鉴定的空间还很大。在这项研究中,我们提出了一种新的框架 DeepSCP,利用深度学习来提高 SCP 的覆盖范围。DeepSCP 通过基于多个 SCP 样本集预测保留时间和基于深度学习预测碎片离子强度来构建肽谱匹配(PSM)的一系列特征,并使用优化集成学习模型预测 PSM 标签。在公共和内部 SCP 数据集上的 DeepSCP 评估结果表明,与其他最先进的方法相比,DeepSCP 具有更好的性能。DeepSCP 通过使用目标-诱饵竞争方法控制 q 值在 0.01 来识别更有信心的肽和蛋白质。作为一种方便且低成本的计算框架,DeepSCP 将有助于提高单细胞蛋白质组学的鉴定水平,并促进单细胞蛋白质组学的未来发展和应用。