• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从头测序工具对单克隆抗体组装的综合评价。

Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly.

机构信息

Robert Koch Institute, MF1, Nordufer 20, 13353 Berlin.

Federal Institute for Materials Research and Testing (BAM), Richard-Willstätter-Straße 11, 12489 Berlin.

出版信息

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac542.

DOI:10.1093/bib/bbac542
PMID:36545804
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9851299/
Abstract

Monoclonal antibodies are biotechnologically produced proteins with various applications in research, therapeutics and diagnostics. Their ability to recognize and bind to specific molecule structures makes them essential research tools and therapeutic agents. Sequence information of antibodies is helpful for understanding antibody-antigen interactions and ensuring their affinity and specificity. De novo protein sequencing based on mass spectrometry is a valuable method to obtain the amino acid sequence of peptides and proteins without a priori knowledge. In this study, we evaluated six recently developed de novo peptide sequencing algorithms (Novor, pNovo 3, DeepNovo, SMSNet, PointNovo and Casanovo), which were not specifically designed for antibody data. We validated their ability to identify and assemble antibody sequences on three multi-enzymatic data sets. The deep learning-based tools Casanovo and PointNovo showed an increased peptide recall across different enzymes and data sets compared with spectrum-graph-based approaches. We evaluated different error types of de novo peptide sequencing tools and their performance for different numbers of missing cleavage sites, noisy spectra and peptides of various lengths. We achieved a sequence coverage of 97.69-99.53% on the light chains of three different antibody data sets using the de Bruijn assembler ALPS and the predictions from Casanovo. However, low sequence coverage and accuracy on the heavy chains demonstrate that complete de novo protein sequencing remains a challenging issue in proteomics that requires improved de novo error correction, alternative digestion strategies and hybrid approaches such as homology search to achieve high accuracy on long protein sequences.

摘要

单克隆抗体是生物技术生产的蛋白质,具有广泛的应用于研究、治疗和诊断。它们识别和结合特定分子结构的能力使它们成为重要的研究工具和治疗剂。抗体的序列信息有助于了解抗体-抗原相互作用,并确保其亲和力和特异性。基于质谱的从头蛋白质测序是一种获得肽和蛋白质氨基酸序列的有价值的方法,无需先验知识。在这项研究中,我们评估了六种最近开发的从头多肽测序算法(Novor、pNovo 3、DeepNovo、SMSNet、PointNovo 和 Casanovo),这些算法并非专门为抗体数据设计。我们验证了它们在三个多酶数据集上识别和组装抗体序列的能力。基于深度学习的工具 Casanovo 和 PointNovo 与基于谱图的方法相比,在不同的酶和数据集上提高了肽的召回率。我们评估了从头多肽测序工具的不同错误类型及其在不同数量的缺失切割位点、嘈杂光谱和不同长度肽的性能。我们使用 de Bruijn 组装器 ALPS 和 Casanovo 的预测,在三个不同的抗体数据集的轻链上实现了 97.69-99.53%的序列覆盖率。然而,重链上的低序列覆盖率和准确性表明,完整的从头蛋白质测序仍然是蛋白质组学中的一个具有挑战性的问题,需要改进从头错误纠正、替代消化策略和混合方法,如同源搜索,以实现长蛋白质序列的高精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/465b87483dbf/bbac542f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/bfa5b87a13b0/bbac542f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/a283a5e8be28/bbac542f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/f29b5775804a/bbac542f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/465b87483dbf/bbac542f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/bfa5b87a13b0/bbac542f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/a283a5e8be28/bbac542f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/f29b5775804a/bbac542f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5283/9851299/465b87483dbf/bbac542f4.jpg

相似文献

1
Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly.从头测序工具对单克隆抗体组装的综合评价。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac542.
2
Complete De Novo Assembly of Monoclonal Antibody Sequences.完成单克隆抗体序列的从头组装。
Sci Rep. 2016 Aug 26;6:31730. doi: 10.1038/srep31730.
3
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序:是否已经成为数据库驱动肽鉴定的准确替代方法?
Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.
4
PowerNovo: de novo peptide sequencing via tandem mass spectrometry using an ensemble of transformer and BERT models.PowerNovo:基于Transformer 和 BERT 模型集的串联质谱新肽测序。
Sci Rep. 2024 Jul 1;14(1):15000. doi: 10.1038/s41598-024-65861-0.
5
A Handle on Mass Coincidence Errors in Sequencing of Antibodies by Bottom-up Proteomics.通过自下而上的蛋白质组学对抗体测序中的大量巧合误差进行处理。
J Proteome Res. 2024 Aug 2;23(8):3552-3559. doi: 10.1021/acs.jproteome.4c00188. Epub 2024 Jun 27.
6
pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework.pNovo 3:基于排序学习的从头肽序列精确分析。
Bioinformatics. 2019 Jul 15;35(14):i183-i190. doi: 10.1093/bioinformatics/btz366.
7
Accurate de novo peptide sequencing using fully convolutional neural networks.利用全卷积神经网络进行精确从头肽测序。
Nat Commun. 2023 Dec 2;14(1):7974. doi: 10.1038/s41467-023-43010-x.
8
pNovo+: de novo peptide sequencing using complementary HCD and ETD tandem mass spectra.pNovo+:使用互补的 HCD 和 ETD 串联质谱进行从头多肽测序。
J Proteome Res. 2013 Feb 1;12(2):615-25. doi: 10.1021/pr3006843. Epub 2012 Dec 28.
9
Antibody sequences assembly method based on weighted de Bruijn graph.基于加权 de Bruijn 图的抗体序列组装方法。
Math Biosci Eng. 2023 Jan 31;20(4):6174-6190. doi: 10.3934/mbe.2023266.
10
Bidirectional de novo peptide sequencing using a transformer model.基于 Transformer 模型的双向从头肽测序。
PLoS Comput Biol. 2024 Feb 28;20(2):e1011892. doi: 10.1371/journal.pcbi.1011892. eCollection 2024 Feb.

引用本文的文献

1
De novo peptide databases enable protein-based stable isotope probing of microbial communities with up to species-level resolution.从头合成肽数据库能够对微生物群落进行基于蛋白质的稳定同位素探测,分辨率可达物种水平。
Environ Microbiome. 2025 Aug 26;20(1):111. doi: 10.1186/s40793-025-00767-6.
2
π-PrimeNovo: an accurate and efficient non-autoregressive deep learning model for de novo peptide sequencing.π-PrimeNovo:一种用于从头肽测序的准确高效的非自回归深度学习模型。
Nat Commun. 2025 Jan 2;16(1):267. doi: 10.1038/s41467-024-55021-3.
3
NovoLign: metaproteomics by sequence alignment.

本文引用的文献

1
Accurate de novo peptide sequencing using fully convolutional neural networks.利用全卷积神经网络进行精确从头肽测序。
Nat Commun. 2023 Dec 2;14(1):7974. doi: 10.1038/s41467-023-43010-x.
2
AbLang: an antibody language model for completing antibody sequences.AbLang:一种用于完成抗体序列的抗体语言模型。
Bioinform Adv. 2022 Jun 17;2(1):vbac046. doi: 10.1093/bioadv/vbac046. eCollection 2022.
3
Multienzyme deep learning models improve peptide de novo sequencing by mass spectrometry proteomics.多酶深度学习模型通过质谱蛋白质组学提高肽从头测序。
NovoLign:通过序列比对进行宏蛋白质组学分析。
ISME Commun. 2024 Oct 12;4(1):ycae121. doi: 10.1093/ismeco/ycae121. eCollection 2024 Jan.
4
Revamping Hepatocellular Carcinoma Immunotherapy: The Advent of Microbial Neoantigen Vaccines.重塑肝细胞癌免疫疗法:微生物新抗原疫苗的问世。
Vaccines (Basel). 2024 Aug 21;12(8):930. doi: 10.3390/vaccines12080930.
5
Exploring the dynamic landscape of immunopeptidomics: Unravelling posttranslational modifications and navigating bioinformatics terrain.探索免疫肽组学的动态格局:揭示翻译后修饰并跨越生物信息学领域。
Mass Spectrom Rev. 2025 Jul-Aug;44(4):599-629. doi: 10.1002/mas.21905. Epub 2024 Aug 16.
6
Sequence-to-sequence translation from mass spectra to peptides with a transformer model.基于 Transformer 模型的从质谱到肽的序列到序列翻译。
Nat Commun. 2024 Jul 30;15(1):6427. doi: 10.1038/s41467-024-49731-x.
7
PowerNovo: de novo peptide sequencing via tandem mass spectrometry using an ensemble of transformer and BERT models.PowerNovo:基于Transformer 和 BERT 模型集的串联质谱新肽测序。
Sci Rep. 2024 Jul 1;14(1):15000. doi: 10.1038/s41598-024-65861-0.
8
Transformer-based de novo peptide sequencing for data-independent acquisition mass spectrometry.基于Transformer的从头肽段测序用于非数据依赖采集质谱分析
Proc IEEE Int Symp Bioinformatics Bioeng. 2023 Dec;2023:28-35. doi: 10.1109/bibe60311.2023.00013. Epub 2024 Feb 19.
9
Transformer-based de novo peptide sequencing for data-independent acquisition mass spectrometry.基于Transformer的从头肽段测序用于数据非依赖采集质谱分析
ArXiv. 2024 Jun 26:arXiv:2402.11363v3.
10
Introducing π-HelixNovo for practical large-scale de novo peptide sequencing.介绍用于实际大规模从头多肽测序的 π-螺旋 Novo。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae021.
PLoS Comput Biol. 2023 Jan 20;19(1):e1010457. doi: 10.1371/journal.pcbi.1010457. eCollection 2023 Jan.
4
Template-Based Assembly of Proteomic Short Reads For Antibody Sequencing and Repertoire Profiling.基于模板的蛋白质组学短读段组装,用于抗体测序和库特征分析。
Anal Chem. 2022 Jul 26;94(29):10391-10399. doi: 10.1021/acs.analchem.2c01300. Epub 2022 Jul 14.
5
Accurate discrimination of leucine and isoleucine residues by combining continuous digestion with multiple MS spectra integration in protein sequence.通过结合连续消化和多个 MS 谱图积分,在蛋白质序列中准确区分亮氨酸和异亮氨酸残基。
Talanta. 2022 Nov 1;249:123666. doi: 10.1016/j.talanta.2022.123666. Epub 2022 Jun 14.
6
A perspective toward mass spectrometry-based de novo sequencing of endogenous antibodies.基于质谱的内源性抗体从头测序的研究进展。
MAbs. 2022 Jan-Dec;14(1):2079449. doi: 10.1080/19420862.2022.2079449.
7
The impact of noise and missing fragmentation cleavages on peptide identification algorithms.噪声和缺失片段裂解对肽段鉴定算法的影响。
Comput Struct Biotechnol J. 2022 Mar 19;20:1402-1412. doi: 10.1016/j.csbj.2022.03.008. eCollection 2022.
8
Highly Robust Full-Length Protein Sequencing.高度稳健的全长蛋白质测序。
Anal Chem. 2022 Mar 1;94(8):3467-3475. doi: 10.1021/acs.analchem.1c03718. Epub 2022 Feb 16.
9
Progress and challenges in mass spectrometry-based analysis of antibody repertoires.基于质谱的抗体库分析的进展与挑战
Trends Biotechnol. 2022 Apr;40(4):463-481. doi: 10.1016/j.tibtech.2021.08.006. Epub 2021 Sep 14.
10
Mass Spectrometry-Based Sequencing of Monoclonal Antibodies Using Multiple Proteases and a Dual Fragmentation Scheme.基于质谱的单克隆抗体的测序使用多种蛋白酶和双重断裂方案。
J Proteome Res. 2021 Jul 2;20(7):3559-3566. doi: 10.1021/acs.jproteome.1c00169. Epub 2021 Jun 14.