Suppr超能文献

复杂完整蛋白质串联质谱的解卷积和数据库搜索:一种组合方法。

Deconvolution and database search of complex tandem mass spectra of intact proteins: a combinatorial approach.

机构信息

Department of Computer Science and Engineering, University of California, San Diego, California 92093, USA.

出版信息

Mol Cell Proteomics. 2010 Dec;9(12):2772-82. doi: 10.1074/mcp.M110.002766. Epub 2010 Sep 20.

Abstract

Top-down proteomics studies intact proteins, enabling new opportunities for analyzing post-translational modifications. Because tandem mass spectra of intact proteins are very complex, spectral deconvolution (grouping peaks into isotopomer envelopes) is a key initial stage for their interpretation. In such spectra, isotopomer envelopes of different protein fragments span overlapping regions on the m/z axis and even share spectral peaks. This raises both pattern recognition and combinatorial challenges for spectral deconvolution. We present MS-Deconv, a combinatorial algorithm for spectral deconvolution. The algorithm first generates a large set of candidate isotopomer envelopes for a spectrum, then represents the spectrum as a graph, and finally selects its highest scoring subset of envelopes as a heaviest path in the graph. In contrast with other approaches, the algorithm scores sets of envelopes rather than individual envelopes. We demonstrate that MS-Deconv improves on Thrash and Xtract in the number of correctly recovered monoisotopic masses and speed. We applied MS-Deconv to a large set of top-down spectra from Yersinia rohdei (with a still unsequenced genome) and further matched them against the protein database of related and sequenced bacterium Yersinia enterocolitica. MS-Deconv is available at http://proteomics.ucsd.edu/Software.html.

摘要

自上而下的蛋白质组学研究完整的蛋白质,为分析翻译后修饰提供了新的机会。由于完整蛋白质的串联质谱非常复杂,因此光谱解卷积(将峰分组到同位素包络中)是解释其的关键初始阶段。在这样的光谱中,不同蛋白质片段的同位素包络在 m/z 轴上跨越重叠区域,甚至共享光谱峰。这给光谱解卷积带来了模式识别和组合方面的挑战。我们提出了 MS-Deconv,这是一种用于光谱解卷积的组合算法。该算法首先为光谱生成一组候选同位素包络,然后将光谱表示为一个图,最后选择其得分最高的一组包络作为图中的最重路径。与其他方法相比,该算法对包络集而不是单个包络进行评分。我们证明 MS-Deconv 在正确恢复的单同位素质量数量和速度方面优于 Thrash 和 Xtract。我们将 MS-Deconv 应用于来自罗得西亚耶尔森氏菌(其基因组尚未测序)的大量自上而下的光谱,并将其进一步与相关且已测序的细菌肠炎耶尔森氏菌的蛋白质数据库进行匹配。MS-Deconv 可在 http://proteomics.ucsd.edu/Software.html 获得。

相似文献

1
Deconvolution and database search of complex tandem mass spectra of intact proteins: a combinatorial approach.
Mol Cell Proteomics. 2010 Dec;9(12):2772-82. doi: 10.1074/mcp.M110.002766. Epub 2010 Sep 20.
2
A new scoring function for top-down spectral deconvolution.
BMC Genomics. 2014 Dec 18;15(1):1140. doi: 10.1186/1471-2164-15-1140.
3
Enhancing Top-Down Proteomics Data Analysis by Combining Deconvolution Results through a Machine Learning Strategy.
J Am Soc Mass Spectrom. 2020 May 6;31(5):1104-1113. doi: 10.1021/jasms.0c00035. Epub 2020 Apr 8.
4
The spectral networks paradigm in high throughput mass spectrometry.
Mol Biosyst. 2012 Oct;8(10):2535-44. doi: 10.1039/c2mb25085c.
5
EnvCNN: A Convolutional Neural Network Model for Evaluating Isotopic Envelopes in Top-Down Mass-Spectral Deconvolution.
Anal Chem. 2020 Jun 2;92(11):7778-7785. doi: 10.1021/acs.analchem.0c00903. Epub 2020 May 13.
7
Protein identification by spectral networks analysis.
Proc Natl Acad Sci U S A. 2007 Apr 10;104(15):6140-5. doi: 10.1073/pnas.0701130104. Epub 2007 Apr 2.
9
pClean: An Algorithm To Preprocess High-Resolution Tandem Mass Spectra for Database Searching.
J Proteome Res. 2019 Sep 6;18(9):3235-3244. doi: 10.1021/acs.jproteome.9b00141. Epub 2019 Aug 14.
10
Interpreting raw biological mass spectra using isotopic mass-to-charge ratio and envelope fingerprinting.
Rapid Commun Mass Spectrom. 2013 Jun 15;27(11):1267-77. doi: 10.1002/rcm.6565.

引用本文的文献

1
A Fast Neural Network for Isotopic Charge State Assignment.
J Am Chem Soc. 2025 Jun 25;147(25):21610-21620. doi: 10.1021/jacs.5c03162. Epub 2025 Jun 10.
2
FAST MS: Software for the Automated Analysis of Top-Down Mass Spectra of Polymeric Molecules Including RNA, DNA, and Proteins.
J Am Soc Mass Spectrom. 2025 Feb 5;36(2):247-257. doi: 10.1021/jasms.4c00236. Epub 2024 Dec 23.
3
TopDIA: A Software Tool for Top-Down Data-Independent Acquisition Proteomics.
J Proteome Res. 2025 Jan 3;24(1):55-64. doi: 10.1021/acs.jproteome.4c00293. Epub 2024 Dec 6.
4
Improving Top-Down Sequence Coverage with Targeted Fragment Matching.
J Am Soc Mass Spectrom. 2024 Dec 4;35(12):3296-3300. doi: 10.1021/jasms.4c00161. Epub 2024 Oct 22.
5
Web of venom: exploration of big data resources in animal toxin research.
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae054.
6
Top-down proteomics.
Nat Rev Methods Primers. 2024;4(1). doi: 10.1038/s43586-024-00318-2. Epub 2024 Jun 13.
7
Top-down mass spectrometry of native proteoforms and their complexes: a community study.
Nat Methods. 2024 Dec;21(12):2388-2396. doi: 10.1038/s41592-024-02279-6. Epub 2024 May 14.
8
Mass Spectrometric De Novo Sequencing of Natural Peptides.
Methods Mol Biol. 2024;2758:61-75. doi: 10.1007/978-1-0716-3646-6_3.
10
MASH Native: a unified solution for native top-down proteomics data processing.
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad359.

本文引用的文献

1
Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions.
J Am Soc Mass Spectrom. 1995 Apr;6(4):229-33. doi: 10.1016/1044-0305(95)00017-8.
3
Top-down identification of protein biomarkers in bacteria with unsequenced genomes.
Anal Chem. 2009 Dec 1;81(23):9633-42. doi: 10.1021/ac9016677.
4
Precursor ion independent algorithm for top-down shotgun proteomics.
J Am Soc Mass Spectrom. 2009 Nov;20(11):2154-66. doi: 10.1016/j.jasms.2009.07.024. Epub 2009 Aug 13.
5
Integrated workflow for characterizing intact phosphoproteins from complex mixtures.
Anal Chem. 2009 Jun 1;81(11):4210-9. doi: 10.1021/ac802487q.
7
NITPICK: peak identification for mass spectrometry data.
BMC Bioinformatics. 2008 Aug 28;9:355. doi: 10.1186/1471-2105-9-355.
9
Interpreting top-down mass spectra using spectral alignment.
Anal Chem. 2008 Apr 1;80(7):2499-505. doi: 10.1021/ac702324u. Epub 2008 Feb 27.
10
Top-down MS, a powerful complement to the high capabilities of proteolysis proteomics.
FEBS J. 2007 Dec;274(24):6256-68. doi: 10.1111/j.1742-4658.2007.06147.x. Epub 2007 Nov 16.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验