独立于数据库的蛋白质测序（DiPS）可实现全长蛋白质和抗体序列的测定。

Database-independent Protein Sequencing (DiPS) Enables Full-length Protein and Antibody Sequence Determination.

作者信息

Savidor Alon, Barzilay Rotem, Elinger Dalia, Yarden Yosef, Lindzen Moshit, Gabashvili Alexandra, Adiv Tal Ophir, Levin Yishai

机构信息

From ‡The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot.

the §Department of Biological Regulation, Weizmann Institute of Science, Rehovot, Israel 76100.

出版信息

Mol Cell Proteomics. 2017 Jun;16(6):1151-1161. doi: 10.1074/mcp.O116.065417. Epub 2017 Mar 27.

DOI:10.1074/mcp.O116.065417

PMID:28348172

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5461544/

Abstract

Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence.

摘要

传统的“自下而上”蛋白质组学方法利用蛋白酶解、液相色谱-串联质谱（LC-MS/MS）和数据库搜索来阐明肽段的身份及其母蛋白。数据库中不存在的蛋白质序列无法被识别，而且即使存在于数据库中，对于样品中最丰富的蛋白质，也很少能实现完整的序列覆盖。因此，对未知蛋白质（如抗体或宏蛋白质组的成分）进行测序仍然是一个具有挑战性的问题。迄今为止，还没有一种高通量的、独立于参考数据库的全长蛋白质测序方法。在这里，我们提出了独立于数据库的蛋白质测序方法，这是一种用于明确、快速、独立于数据库的全长蛋白质测序的方法。该方法是蛋白质的非酶促、半随机切割、LC-MS/MS分析、肽段测序、肽段标签提取以及使用名为“肽段标签组装器”的算法将它们组装成一致序列的新颖组合。作为概念验证，该方法应用于代表三种大小类别的三种已知蛋白质的样品以及一种先前未测序的、临床相关的单克隆抗体。排除亮氨酸/异亮氨酸和谷氨酸/脱酰胺谷氨酰胺的模糊性后，所有基准蛋白质和抗体轻链均以99 - 100%的准确率实现了端到端的全长测序。测序的抗体重链（包括整个可变区）的准确率也为100%，但恒定区序列中有一个23个残基的缺口。

相似文献

Database-independent Protein Sequencing (DiPS) Enables Full-length Protein and Antibody Sequence Determination.独立于数据库的蛋白质测序（DiPS）可实现全长蛋白质和抗体序列的测定。

Mol Cell Proteomics. 2017 Jun;16(6):1151-1161. doi: 10.1074/mcp.O116.065417. Epub 2017 Mar 27.

Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序：是否已经成为数据库驱动肽鉴定的准确替代方法？

Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.

Automated protein (re)sequencing with MS/MS and a homologous database yields almost full coverage and accuracy.使用串联质谱（MS/MS）和同源数据库进行自动蛋白质（重）测序可实现几乎完全的覆盖范围和准确性。

Bioinformatics. 2009 Sep 1;25(17):2174-80. doi: 10.1093/bioinformatics/btp366. Epub 2009 Jun 17.

Ultrahigh-resolution Fourier transform ion cyclotron resonance mass spectrometry and tandem mass spectrometry for peptide de novo amino acid sequencing for a seven-protein mixture by paired single-residue transposed Lys-N and Lys-C digestion.通过配对单残基转位Lys-N和Lys-C消化，利用超高分辨率傅里叶变换离子回旋共振质谱和串联质谱对七种蛋白质混合物进行肽段从头氨基酸测序。

Rapid Commun Mass Spectrom. 2017 Jan 30;31(2):207-217. doi: 10.1002/rcm.7783.

Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics.结合从头肽序列算法，协同方法可提高下向蛋白质组学的鉴定数量和置信度。

J Proteome Res. 2017 Sep 1;16(9):3209-3218. doi: 10.1021/acs.jproteome.7b00198. Epub 2017 Aug 22.

Complementary Methods for de Novo Monoclonal Antibody Sequencing to Achieve Complete Sequence Coverage.从头开始的单克隆抗体测序的补充方法，以实现完整的序列覆盖。

J Proteome Res. 2020 Jul 2;19(7):2700-2707. doi: 10.1021/acs.jproteome.0c00223. Epub 2020 May 7.

Application of de Novo Sequencing to Large-Scale Complex Proteomics Data Sets.从头测序在大规模复杂蛋白质组学数据集上的应用。

J Proteome Res. 2016 Mar 4;15(3):732-42. doi: 10.1021/acs.jproteome.5b00861. Epub 2016 Jan 25.

Complete De Novo Assembly of Monoclonal Antibody Sequences.完成单克隆抗体序列的从头组装。

Sci Rep. 2016 Aug 26;6:31730. doi: 10.1038/srep31730.

Proteomics-grade de novo sequencing approach.蛋白质组学级别的从头测序方法。

J Proteome Res. 2005 Nov-Dec;4(6):2348-54. doi: 10.1021/pr050288x.

Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly.从头测序工具对单克隆抗体组装的综合评价。

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac542.

引用本文的文献

Simultaneous polyclonal antibody sequencing and epitope mapping by cryo electron microscopy and mass spectrometry.通过冷冻电子显微镜和质谱法同时进行多克隆抗体测序和表位定位

Elife. 2025 Apr 23;14:RP101322. doi: 10.7554/eLife.101322.

A Handle on Mass Coincidence Errors in Sequencing of Antibodies by Bottom-up Proteomics.通过自下而上的蛋白质组学对抗体测序中的大量巧合误差进行处理。

J Proteome Res. 2024 Aug 2;23(8):3552-3559. doi: 10.1021/acs.jproteome.4c00188. Epub 2024 Jun 27.

Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly.从头测序工具对单克隆抗体组装的综合评价。

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac542.

Template-Based Assembly of Proteomic Short Reads For Antibody Sequencing and Repertoire Profiling.基于模板的蛋白质组学短读段组装，用于抗体测序和库特征分析。

Anal Chem. 2022 Jul 26;94(29):10391-10399. doi: 10.1021/acs.analchem.2c01300. Epub 2022 Jul 14.

A perspective toward mass spectrometry-based de novo sequencing of endogenous antibodies.基于质谱的内源性抗体从头测序的研究进展。

MAbs. 2022 Jan-Dec;14(1):2079449. doi: 10.1080/19420862.2022.2079449.

Mass Spectrometry-Based Sequencing of Monoclonal Antibodies Using Multiple Proteases and a Dual Fragmentation Scheme.基于质谱的单克隆抗体的测序使用多种蛋白酶和双重断裂方案。

J Proteome Res. 2021 Jul 2;20(7):3559-3566. doi: 10.1021/acs.jproteome.1c00169. Epub 2021 Jun 14.

ProAlanase is an Effective Alternative to Trypsin for Proteomics Applications and Disulfide Bond Mapping.ProAlanase 是一种用于蛋白质组学应用和二硫键 mapping 的有效胰蛋白酶替代物。

Mol Cell Proteomics. 2020 Dec;19(12):2139-2157. doi: 10.1074/mcp.TIR120.002129. Epub 2020 Oct 5.

Flying blind, or just flying under the radar? The underappreciated power of de novo methods of mass spectrometric peptide identification.盲目飞行，还是只是在雷达下飞行？从头开始的质谱肽鉴定方法的未被充分认识的威力。

Protein Sci. 2020 Sep;29(9):1864-1878. doi: 10.1002/pro.3919. Epub 2020 Aug 17.

2018 YPIC Challenge: A Case Study in Characterizing an Unknown Protein Sample.2018 YPIC 挑战赛：一种未知蛋白质样本特征分析的案例研究。

J Proteome Res. 2019 Nov 1;18(11):3936-3943. doi: 10.1021/acs.jproteome.9b00384. Epub 2019 Oct 7.

Proteomics Pipeline for Identifying Variant Proteins in Parasites Isolated from Children Presenting with Malaria.寄生虫蛋白质组学分析管道鉴定疟疾患儿样本中的变异蛋白

J Proteome Res. 2019 Nov 1;18(11):3831-3839. doi: 10.1021/acs.jproteome.9b00169. Epub 2019 Oct 8.

本文引用的文献

Complete De Novo Assembly of Monoclonal Antibody Sequences.完成单克隆抗体序列的从头组装。

Sci Rep. 2016 Aug 26;6:31730. doi: 10.1038/srep31730.

2016 update of the PRIDE database and its related tools.PRIDE数据库及其相关工具的2016年更新。

Nucleic Acids Res. 2016 Jan 4;44(D1):D447-56. doi: 10.1093/nar/gkv1145. Epub 2015 Nov 2.

De Novo Sequencing of Peptides from Top-Down Tandem Mass Spectra.自上而下串联质谱法对肽段的从头测序

J Proteome Res. 2015 Nov 6;14(11):4450-62. doi: 10.1021/pr501244v. Epub 2015 Oct 13.

Convenient and Precise Strategy for Mapping N-Glycosylation Sites Using Microwave-Assisted Acid Hydrolysis and Characteristic Ions Recognition.一种利用微波辅助酸水解和特征离子识别来绘制N-糖基化位点的便捷精确策略。

Anal Chem. 2015 Aug 4;87(15):7833-9. doi: 10.1021/acs.analchem.5b02177. Epub 2015 Jul 23.

An antibody to amphiregulin, an abundant growth factor in patients' fluids, inhibits ovarian tumors.一种针对双调蛋白（患者体液中一种丰富的生长因子）的抗体可抑制卵巢肿瘤。

Oncogene. 2016 Jan 28;35(4):438-47. doi: 10.1038/onc.2015.93. Epub 2015 Apr 27.

In vitro and in vivo modifications of recombinant and human IgG antibodies.重组和人IgG抗体的体外和体内修饰

MAbs. 2014;6(5):1145-54. doi: 10.4161/mabs.29883. Epub 2014 Oct 30.

Microwave-assisted acid hydrolysis of proteins combined with peptide fractionation and mass spectrometry analysis for characterizing protein terminal sequences.微波辅助蛋白质酸水解结合肽段分级分离和质谱分析用于表征蛋白质末端序列

J Proteomics. 2014 Apr 4;100:68-78. doi: 10.1016/j.jprot.2013.10.014. Epub 2013 Oct 18.

Sequencing-grade de novo analysis of MS/MS triplets (CID/HCD/ETD) from overlapping peptides.从头分析（CID/HCD/ETD）重叠肽的 MS/MS 三重体的测序级分析。

J Proteome Res. 2013 Jun 7;12(6):2846-57. doi: 10.1021/pr400173d. Epub 2013 May 30.

Protein analysis by shotgun/bottom-up proteomics.通过鸟枪法/自下而上蛋白质组学进行蛋白质分析。

Chem Rev. 2013 Apr 10;113(4):2343-94. doi: 10.1021/cr3003533. Epub 2013 Feb 26.

Inhibition of triple-negative breast cancer models by combinations of antibodies to EGFR.三阴性乳腺癌模型中 EGFR 抗体联合抑制作用

Proc Natl Acad Sci U S A. 2013 Jan 29;110(5):1815-20. doi: 10.1073/pnas.1220763110. Epub 2013 Jan 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验