• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Transformer 模型的双向从头肽测序。

Bidirectional de novo peptide sequencing using a transformer model.

机构信息

Center for Biomedical Computing, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.

出版信息

PLoS Comput Biol. 2024 Feb 28;20(2):e1011892. doi: 10.1371/journal.pcbi.1011892. eCollection 2024 Feb.

DOI:10.1371/journal.pcbi.1011892
PMID:38416757
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10901305/
Abstract

In proteomics, a crucial aspect is to identify peptide sequences. De novo sequencing methods have been widely employed to identify peptide sequences, and numerous tools have been proposed over the past two decades. Recently, deep learning approaches have been introduced for de novo sequencing. Previous methods focused on encoding tandem mass spectra and predicting peptide sequences from the first amino acid onwards. However, when predicting peptides using tandem mass spectra, the peptide sequence can be predicted not only from the first amino acid but also from the last amino acid due to the coexistence of b-ion (or a- or c-ion) and y-ion (or x- or z-ion) fragments in the tandem mass spectra. Therefore, it is essential to predict peptide sequences bidirectionally. Our approach, called NovoB, utilizes a Transformer model to predict peptide sequences bidirectionally, starting with both the first and last amino acids. In comparison to Casanovo, our method achieved an improvement of the average peptide-level accuracy rate of approximately 9.8% across all species.

摘要

在蛋白质组学中,一个关键的方面是识别肽序列。从头测序方法已被广泛用于识别肽序列,在过去的二十年中提出了许多工具。最近,深度学习方法已被引入从头测序。以前的方法侧重于对串联质谱进行编码,并从第一个氨基酸开始预测肽序列。然而,在用串联质谱预测肽时,由于串联质谱中存在 b 离子(或 a 或 c 离子)和 y 离子(或 x 或 z 离子)片段,因此不仅可以从第一个氨基酸,也可以从最后一个氨基酸预测肽序列。因此,双向预测肽序列是至关重要的。我们的方法称为 NovoB,利用 Transformer 模型从第一个和最后一个氨基酸开始双向预测肽序列。与 Casanovo 相比,我们的方法在所有物种上的平均肽级准确率提高了约 9.8%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b92/10901305/aad7d3e0a05f/pcbi.1011892.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b92/10901305/fa97cdd17c5f/pcbi.1011892.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b92/10901305/2203e6bbddf5/pcbi.1011892.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b92/10901305/aad7d3e0a05f/pcbi.1011892.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b92/10901305/fa97cdd17c5f/pcbi.1011892.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b92/10901305/2203e6bbddf5/pcbi.1011892.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b92/10901305/aad7d3e0a05f/pcbi.1011892.g003.jpg

相似文献

1
Bidirectional de novo peptide sequencing using a transformer model.基于 Transformer 模型的双向从头肽测序。
PLoS Comput Biol. 2024 Feb 28;20(2):e1011892. doi: 10.1371/journal.pcbi.1011892. eCollection 2024 Feb.
2
Sequence-to-sequence translation from mass spectra to peptides with a transformer model.基于 Transformer 模型的从质谱到肽的序列到序列翻译。
Nat Commun. 2024 Jul 30;15(1):6427. doi: 10.1038/s41467-024-49731-x.
3
PowerNovo: de novo peptide sequencing via tandem mass spectrometry using an ensemble of transformer and BERT models.PowerNovo:基于Transformer 和 BERT 模型集的串联质谱新肽测序。
Sci Rep. 2024 Jul 1;14(1):15000. doi: 10.1038/s41598-024-65861-0.
4
Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly.从头测序工具对单克隆抗体组装的综合评价。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac542.
5
How much peptide sequence information is contained in ion trap tandem mass spectra?离子阱串联质谱中包含多少肽段序列信息?
J Am Soc Mass Spectrom. 2008 Dec;19(12):1813-20. doi: 10.1016/j.jasms.2008.07.024. Epub 2008 Aug 7.
6
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序:是否已经成为数据库驱动肽鉴定的准确替代方法?
Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.
7
pNovo: de novo peptide sequencing and identification using HCD spectra.pNovo:利用 HCD 谱进行从头多肽测序和鉴定。
J Proteome Res. 2010 May 7;9(5):2713-24. doi: 10.1021/pr100182k.
8
Algorithms for de-novo sequencing of peptides by tandem mass spectrometry: A review.串联质谱法从头测序肽的算法:综述。
Anal Chim Acta. 2023 Aug 8;1268:341330. doi: 10.1016/j.aca.2023.341330. Epub 2023 May 8.
9
Algorithms for the de novo sequencing of peptides from tandem mass spectra.串联质谱肽从头测序的算法。
Expert Rev Proteomics. 2011 Oct;8(5):645-57. doi: 10.1586/epr.11.54.
10
Application of de Novo Sequencing to Large-Scale Complex Proteomics Data Sets.从头测序在大规模复杂蛋白质组学数据集上的应用。
J Proteome Res. 2016 Mar 4;15(3):732-42. doi: 10.1021/acs.jproteome.5b00861. Epub 2016 Jan 25.

引用本文的文献

1
De novo peptide databases enable protein-based stable isotope probing of microbial communities with up to species-level resolution.从头合成肽数据库能够对微生物群落进行基于蛋白质的稳定同位素探测,分辨率可达物种水平。
Environ Microbiome. 2025 Aug 26;20(1):111. doi: 10.1186/s40793-025-00767-6.
2
In Silico Discovery and Sensory Validation of Umami Peptides in Fermented Sausages: A Study Integrating Deep Learning and Molecular Modeling.发酵香肠中鲜味肽的计算机发现与感官验证:一项整合深度学习与分子建模的研究
Foods. 2025 Jul 9;14(14):2422. doi: 10.3390/foods14142422.
3
A transformer-based semi-autoregressive framework for high-speed and accurate de novo peptide sequencing.

本文引用的文献

1
NFYB-1 regulates mitochondrial function and longevity via lysosomal prosaposin.NFYB-1 通过溶酶体神经酰胺酶调节线粒体功能和寿命。
Nat Metab. 2020 May;2(5):387-396. doi: 10.1038/s42255-020-0200-2. Epub 2020 May 18.
2
Predictive Signatures of 19 Antibiotic-Induced Proteomes.19 种抗生素诱导蛋白质组的预测特征。
ACS Infect Dis. 2020 Aug 14;6(8):2120-2129. doi: 10.1021/acsinfecdis.0c00196. Epub 2020 Aug 2.
3
The beta Subunit of Nascent Polypeptide Associated Complex Plays A Role in Flowers and Siliques Development of .
一种基于变压器的半自回归框架,用于高速且准确的从头肽测序。
Commun Biol. 2025 Feb 14;8(1):234. doi: 10.1038/s42003-025-07584-0.
4
A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models.用于训练和验证质谱蛋白质组学机器学习模型的多物种基准。
Sci Data. 2024 Nov 8;11(1):1207. doi: 10.1038/s41597-024-04068-4.
5
Sequence-to-sequence translation from mass spectra to peptides with a transformer model.基于 Transformer 模型的从质谱到肽的序列到序列翻译。
Nat Commun. 2024 Jul 30;15(1):6427. doi: 10.1038/s41467-024-49731-x.
6
A learned score function improves the power of mass spectrometry database search.一个有学问的评分函数提高了质谱数据库搜索的能力。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i410-i417. doi: 10.1093/bioinformatics/btae218.
新生多肽相关复合物的β亚基在 的花和蒴果发育中起作用。
Int J Mol Sci. 2020 Mar 17;21(6):2065. doi: 10.3390/ijms21062065.
4
BoxCar acquisition method enables single-shot proteomics at a depth of 10,000 proteins in 100 minutes.盒车采集方法可实现单次蛋白质组学分析,在 100 分钟内检测 10000 种蛋白质。
Nat Methods. 2018 Jun;15(6):440-448. doi: 10.1038/s41592-018-0003-5. Epub 2018 May 7.
5
De novo peptide sequencing by deep learning.通过深度学习进行从头肽测序。
Proc Natl Acad Sci U S A. 2017 Aug 1;114(31):8247-8252. doi: 10.1073/pnas.1705691114. Epub 2017 Jul 18.
6
Impact of Cystinosin Glycosylation on Protein Stability by Differential Dynamic Stable Isotope Labeling by Amino Acids in Cell Culture (SILAC).通过细胞培养中氨基酸的差异动态稳定同位素标记(SILAC)研究胱氨酸转运蛋白糖基化对蛋白质稳定性的影响。
Mol Cell Proteomics. 2017 Mar;16(3):457-468. doi: 10.1074/mcp.M116.063867. Epub 2017 Jan 12.
7
Quantitative Global Proteomics of Yeast PBP1 Deletion Mutants and Their Stress Responses Identifies Glucose Metabolism, Mitochondrial, and Stress Granule Changes.酵母PBP1缺失突变体的定量全蛋白质组学及其应激反应确定了葡萄糖代谢、线粒体和应激颗粒的变化。
J Proteome Res. 2017 Feb 3;16(2):504-515. doi: 10.1021/acs.jproteome.6b00647. Epub 2016 Dec 22.
8
Large-scale reduction of the Bacillus subtilis genome: consequences for the transcriptional network, resource allocation, and metabolism.枯草芽孢杆菌基因组的大规模缩减:对转录网络、资源分配和代谢的影响
Genome Res. 2017 Feb;27(2):289-299. doi: 10.1101/gr.215293.116. Epub 2016 Dec 13.
9
In-depth characterization of the tomato fruit pericarp proteome.番茄果实果皮蛋白质组的深入表征
Proteomics. 2017 Jan;17(1-2). doi: 10.1002/pmic.201600406.
10
Label-free Proteomic Reveals that Cowpea Severe Mosaic Virus Transiently Suppresses the Host Leaf Protein Accumulation During the Compatible Interaction with Cowpea (Vigna unguiculata [L.] Walp.).无标记蛋白质组学研究表明,豇豆严重花叶病毒在与豇豆(Vigna unguiculata [L.] Walp.)的亲和互作过程中会短暂抑制寄主叶片蛋白质积累。
J Proteome Res. 2016 Dec 2;15(12):4208-4220. doi: 10.1021/acs.jproteome.6b00211. Epub 2016 Nov 16.