Suppr超能文献

自上而下串联质谱法对肽段的从头测序

De Novo Sequencing of Peptides from Top-Down Tandem Mass Spectra.

作者信息

Vyatkina Kira, Wu Si, Dekker Lennard J M, VanDuijn Martijn M, Liu Xiaowen, Tolić Nikola, Dvorkin Mikhail, Alexandrova Sonya, Luider Theo M, Paša-Tolić Ljiljana, Pevzner Pavel A

机构信息

Algorithmic Biology Laboratory, Saint Petersburg Academic University , 8/3 Khlopina Str, Saint Petersburg 194021, Russia.

Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University , 7-9 Universitetskaya nab., Saint Petersburg 199034, Russia.

出版信息

J Proteome Res. 2015 Nov 6;14(11):4450-62. doi: 10.1021/pr501244v. Epub 2015 Oct 13.

Abstract

De novo sequencing of proteins and peptides is one of the most important problems in mass spectrometry-driven proteomics. A variety of methods have been developed to accomplish this task from a set of bottom-up tandem (MS/MS) mass spectra. However, a more recently emerged top-down technology, now gaining more and more popularity, opens new perspectives for protein analysis and characterization, implying a need for efficient algorithms to process this kind of MS/MS data. Here, we describe a method that allows for the retrieval, from a set of top-down MS/MS spectra, of long and accurate sequence fragments of the proteins contained in the sample. To this end, we outline a strategy for generating high-quality sequence tags from top-down spectra, and introduce the concept of a T-Bruijn graph by adapting to the case of tags the notion of an A-Bruijn graph widely used in genomics. The output of the proposed approach represents the set of amino acid strings spelled out by optimal paths in the connected components of a T-Bruijn graph. We illustrate its performance on top-down data sets acquired from carbonic anhydrase 2 (CAH2) and the Fab region of alemtuzumab.

摘要

蛋白质和肽段的从头测序是质谱驱动的蛋白质组学中最重要的问题之一。已经开发了多种方法来从一组自下而上的串联(MS/MS)质谱完成这项任务。然而,一种最近出现且越来越受欢迎的自上而下技术为蛋白质分析和表征开辟了新的视角,这意味着需要高效算法来处理这类MS/MS数据。在此,我们描述了一种方法,该方法能够从一组自上而下的MS/MS谱图中检索出样品中所含蛋白质的长且准确的序列片段。为此,我们概述了一种从自上而下的谱图生成高质量序列标签的策略,并通过将基因组学中广泛使用的A - Bruijn图的概念应用于标签的情况,引入了T - Bruijn图的概念。所提出方法的输出表示由T - Bruijn图的连通分量中的最优路径所拼出的氨基酸串集合。我们展示了其在从碳酸酐酶2(CAH2)和阿仑单抗的Fab区域获取的自上而下数据集上的性能。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验