• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

π-PrimeNovo:一种用于从头肽测序的准确高效的非自回归深度学习模型。

π-PrimeNovo: an accurate and efficient non-autoregressive deep learning model for de novo peptide sequencing.

作者信息

Zhang Xiang, Ling Tianze, Jin Zhi, Xu Sheng, Gao Zhiqiang, Sun Boyan, Qiu Zijie, Wei Jiaqi, Dong Nanqing, Wang Guangshuai, Wang Guibin, Li Leyuan, Abdul-Mageed Muhammad, Lakshmanan Laks V S, He Fuchu, Ouyang Wanli, Chang Cheng, Sun Siqi

机构信息

Shanghai Artificial Intelligence Laboratory, Shanghai, China.

University of British Columbia, Vancouver, BC, Canada.

出版信息

Nat Commun. 2025 Jan 2;16(1):267. doi: 10.1038/s41467-024-55021-3.

DOI:10.1038/s41467-024-55021-3
PMID:39747823
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11695716/
Abstract

Peptide sequencing via tandem mass spectrometry (MS/MS) is essential in proteomics. Unlike traditional database searches, deep learning excels at de novo peptide sequencing, even for peptides missing from existing databases. Current deep learning models often rely on autoregressive generation, which suffers from error accumulation and slow inference speeds. In this work, we introduce π-PrimeNovo, a non-autoregressive Transformer-based model for peptide sequencing. With our architecture design and a CUDA-enhanced decoding module for precise mass control, π-PrimeNovo achieves significantly higher accuracy and up to 89x faster inference than state-of-the-art methods, making it ideal for large-scale applications like metaproteomics. Additionally, it excels in phosphopeptide mining and detecting low-abundance post-translational modifications (PTMs), marking a substantial advance in peptide sequencing with broad potential in biological research.

摘要

通过串联质谱(MS/MS)进行肽段测序在蛋白质组学中至关重要。与传统的数据库搜索不同,深度学习在从头肽段测序方面表现出色,即使对于现有数据库中缺失的肽段也是如此。当前的深度学习模型通常依赖自回归生成,这存在误差累积和推理速度慢的问题。在这项工作中,我们引入了π-PrimeNovo,这是一种基于非自回归Transformer的肽段测序模型。通过我们的架构设计和用于精确质量控制的CUDA增强解码模块,π-PrimeNovo实现了比现有方法显著更高的准确性,推理速度快达89倍,使其成为宏蛋白质组学等大规模应用的理想选择。此外,它在磷酸肽挖掘和检测低丰度翻译后修饰(PTM)方面表现出色,标志着肽段测序取得了重大进展,在生物学研究中具有广泛的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/7d74c2ac3c61/41467_2024_55021_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/1fee24bc37d9/41467_2024_55021_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/3c92eceb0f59/41467_2024_55021_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/915a65f00b92/41467_2024_55021_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/b36416a3b94c/41467_2024_55021_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/8ccfb66c8226/41467_2024_55021_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/7d74c2ac3c61/41467_2024_55021_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/1fee24bc37d9/41467_2024_55021_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/3c92eceb0f59/41467_2024_55021_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/915a65f00b92/41467_2024_55021_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/b36416a3b94c/41467_2024_55021_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/8ccfb66c8226/41467_2024_55021_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c91b/11695716/7d74c2ac3c61/41467_2024_55021_Fig6_HTML.jpg

相似文献

1
π-PrimeNovo: an accurate and efficient non-autoregressive deep learning model for de novo peptide sequencing.π-PrimeNovo:一种用于从头肽测序的准确高效的非自回归深度学习模型。
Nat Commun. 2025 Jan 2;16(1):267. doi: 10.1038/s41467-024-55021-3.
2
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序:是否已经成为数据库驱动肽鉴定的准确替代方法?
Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.
3
Computational Methods in Mass Spectrometry-Based Proteomics.基于质谱的蛋白质组学中的计算方法
Adv Exp Med Biol. 2016;939:63-89. doi: 10.1007/978-981-10-1503-8_4.
4
NovoRank: Refinement for Peptide Sequencing Based on Spectral Clustering and Deep Learning.NovoRank:基于谱聚类和深度学习的肽段测序优化方法
J Proteome Res. 2025 Feb 7;24(2):903-910. doi: 10.1021/acs.jproteome.4c00300. Epub 2024 Dec 31.
5
De novo sequencing with limited number of post-translational modifications per peptide.对每个肽段具有有限数量翻译后修饰的从头测序。
J Bioinform Comput Biol. 2013 Aug;11(4):1350007. doi: 10.1142/S0219720013500078. Epub 2013 Apr 11.
6
A transformer-based semi-autoregressive framework for high-speed and accurate de novo peptide sequencing.一种基于变压器的半自回归框架,用于高速且准确的从头肽测序。
Commun Biol. 2025 Feb 14;8(1):234. doi: 10.1038/s42003-025-07584-0.
7
Application of de Novo Sequencing to Large-Scale Complex Proteomics Data Sets.从头测序在大规模复杂蛋白质组学数据集上的应用。
J Proteome Res. 2016 Mar 4;15(3):732-42. doi: 10.1021/acs.jproteome.5b00861. Epub 2016 Jan 25.
8
De novo sequencing methods in proteomics.蛋白质组学中的从头测序方法。
Methods Mol Biol. 2010;604:105-21. doi: 10.1007/978-1-60761-444-9_8.
9
Algorithms for the de novo sequencing of peptides from tandem mass spectra.串联质谱肽从头测序的算法。
Expert Rev Proteomics. 2011 Oct;8(5):645-57. doi: 10.1586/epr.11.54.
10
Recent Developments in Computational Methods for De Novo Peptide Sequencing from Tandem Mass Spectrometry (MS/MS).串联质谱(MS/MS)从头肽测序计算方法的最新进展
Protein Pept Lett. 2015;22(11):983-91. doi: 10.2174/0929866522666150821113127.

本文引用的文献

1
Sequence-to-sequence translation from mass spectra to peptides with a transformer model.基于 Transformer 模型的从质谱到肽的序列到序列翻译。
Nat Commun. 2024 Jul 30;15(1):6427. doi: 10.1038/s41467-024-49731-x.
2
Introducing π-HelixNovo for practical large-scale de novo peptide sequencing.介绍用于实际大规模从头多肽测序的 π-螺旋 Novo。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae021.
3
Deep learning-driven fragment ion series classification enables highly precise and sensitive de novo peptide sequencing.
深度学习驱动的碎片离子序列分类可实现高度精确和敏感的从头肽测序。
Nat Commun. 2024 Jan 2;15(1):151. doi: 10.1038/s41467-023-44323-7.
4
Accurate de novo peptide sequencing using fully convolutional neural networks.利用全卷积神经网络进行精确从头肽测序。
Nat Commun. 2023 Dec 2;14(1):7974. doi: 10.1038/s41467-023-43010-x.
5
Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly.从头测序工具对单克隆抗体组装的综合评价。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac542.
6
DeepPhospho accelerates DIA phosphoproteome profiling through in silico library generation.DeepPhospho 通过计算机文库生成加速 DIA 磷酸化蛋白质组学分析。
Nat Commun. 2021 Nov 18;12(1):6685. doi: 10.1038/s41467-021-26979-1.
7
Integrative Proteomic Characterization of Human Lung Adenocarcinoma.人类肺腺癌的综合蛋白质组学特征分析。
Cell. 2020 Jul 9;182(1):245-261.e17. doi: 10.1016/j.cell.2020.05.043.
8
The proteome landscape of the kingdoms of life.生命王国的蛋白质组全景
Nature. 2020 Jun;582(7813):592-596. doi: 10.1038/s41586-020-2402-x. Epub 2020 Jun 17.
9
Uncovering Thousands of New Peptides with Sequence-Mask-Search Hybrid Peptide Sequencing Framework.利用序列掩膜搜索混合肽测序框架揭示数千种新肽。
Mol Cell Proteomics. 2019 Dec;18(12):2478-2491. doi: 10.1074/mcp.TIR119.001656. Epub 2019 Oct 7.
10
Interspecies Competition Impacts Targeted Manipulation of Human Gut Bacteria by Fiber-Derived Glycans.种间竞争影响膳食纤维衍生聚糖靶向人体肠道细菌的操纵。
Cell. 2019 Sep 19;179(1):59-73.e13. doi: 10.1016/j.cell.2019.08.011.