Suppr超能文献

使用MIScore对具有未知翻译后修饰的蛋白质异构体进行表征。

Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore.

作者信息

Kou Qiang, Zhu Binhai, Wu Si, Ansong Charles, Tolić Nikola, Paša-Tolić Ljiljana, Liu Xiaowen

机构信息

Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis , Indianapolis, Indiana 46202, United States.

Department of Computer Science, Montana State University , Bozeman, Montana 59717, United States.

出版信息

J Proteome Res. 2016 Aug 5;15(8):2422-32. doi: 10.1021/acs.jproteome.5b01098. Epub 2016 Jul 1.

Abstract

Various proteoforms may be generated from a single gene due to primary structure alterations (PSAs) such as genetic variations, alternative splicing, and post-translational modifications (PTMs). Top-down mass spectrometry is capable of analyzing intact proteins and identifying patterns of multiple PSAs, making it the method of choice for studying complex proteoforms. In top-down proteomics, proteoform identification is often performed by searching tandem mass spectra against a protein sequence database that contains only one reference protein sequence for each gene or transcript variant in a proteome. Because of the incompleteness of the protein database, an identified proteoform may contain unknown PSAs compared with the reference sequence. Proteoform characterization is to identify and localize PSAs in a proteoform. Although many software tools have been proposed for proteoform identification by top-down mass spectrometry, the characterization of proteoforms in identified proteoform-spectrum matches still relies mainly on manual annotation. We propose to use the Modification Identification Score (MIScore), which is based on Bayesian models, to automatically identify and localize PTMs in proteoforms. Experiments showed that the MIScore is accurate in identifying and localizing one or two modifications.

摘要

由于基因变异、可变剪接和翻译后修饰(PTM)等一级结构改变(PSA),单个基因可能产生多种蛋白质异构体。自上而下的质谱分析能够分析完整的蛋白质并识别多种PSA模式,使其成为研究复杂蛋白质异构体的首选方法。在自上而下的蛋白质组学中,蛋白质异构体鉴定通常是通过将串联质谱与蛋白质序列数据库进行比对来完成的,该数据库在蛋白质组中每个基因或转录本变体仅包含一个参考蛋白质序列。由于蛋白质数据库的不完整性,与参考序列相比,鉴定出的蛋白质异构体可能包含未知的PSA。蛋白质异构体表征是指在蛋白质异构体中识别和定位PSA。虽然已经提出了许多软件工具用于通过自上而下的质谱分析鉴定蛋白质异构体,但在已鉴定的蛋白质异构体-谱匹配中对蛋白质异构体的表征仍主要依赖于人工注释。我们建议使用基于贝叶斯模型的修饰鉴定分数(MIScore)来自动识别和定位蛋白质异构体中的PTM。实验表明,MIScore在识别和定位一种或两种修饰方面是准确的。

相似文献

1
Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore.
J Proteome Res. 2016 Aug 5;15(8):2422-32. doi: 10.1021/acs.jproteome.5b01098. Epub 2016 Jul 1.
3
A mass graph-based approach for the identification of modified proteoforms using top-down tandem mass spectra.
Bioinformatics. 2017 May 1;33(9):1309-1316. doi: 10.1093/bioinformatics/btw806.
4
Improving Proteoform Identifications in Complex Systems Through Integration of Bottom-Up and Top-Down Data.
J Proteome Res. 2020 Aug 7;19(8):3510-3517. doi: 10.1021/acs.jproteome.0c00332. Epub 2020 Jul 10.
5
Expanding Proteoform Identifications in Top-Down Proteomic Analyses by Constructing Proteoform Families.
Anal Chem. 2018 Jan 16;90(2):1325-1333. doi: 10.1021/acs.analchem.7b04221. Epub 2017 Dec 22.
6
TopPIC: a software tool for top-down mass spectrometry-based proteoform identification and characterization.
Bioinformatics. 2016 Nov 15;32(22):3495-3497. doi: 10.1093/bioinformatics/btw398. Epub 2016 Jul 16.
8
Top-Down Proteomics and the Challenges of True Proteoform Characterization.
J Proteome Res. 2023 Dec 1;22(12):3663-3675. doi: 10.1021/acs.jproteome.3c00416. Epub 2023 Nov 8.
9
Intact-Mass Analysis Facilitating the Identification of Large Human Heart Proteoforms.
Anal Chem. 2019 Sep 3;91(17):10937-10942. doi: 10.1021/acs.analchem.9b02343. Epub 2019 Aug 14.
10
Proteoform Analysis and Construction of Proteoform Families in Proteoform Suite.
Methods Mol Biol. 2022;2500:67-81. doi: 10.1007/978-1-0716-2325-1_7.

引用本文的文献

1
Deep Plasma Proteomics with Data-Independent Acquisition: Clinical Study Protocol Optimization with a COVID-19 Cohort.
J Proteome Res. 2024 Sep 6;23(9):3806-3822. doi: 10.1021/acs.jproteome.4c00104. Epub 2024 Aug 19.
2
Top-down proteomics.
Nat Rev Methods Primers. 2024;4(1). doi: 10.1038/s43586-024-00318-2. Epub 2024 Jun 13.
3
Characterization of Proteoform Post-Translational Modifications by Top-Down and Bottom-Up Mass Spectrometry in Conjunction with Annotations.
J Proteome Res. 2023 Oct 6;22(10):3178-3189. doi: 10.1021/acs.jproteome.3c00207. Epub 2023 Sep 20.
4
Large-scale top-down proteomics of the Arabidopsis thaliana leaf and chloroplast proteomes.
Proteomics. 2023 Feb;23(3-4):e2100377. doi: 10.1002/pmic.202100377. Epub 2022 Oct 1.
5
TopPIC Gateway: A Web Gateway for Top-Down Mass Spectrometry Data Interpretation.
PEARC20 (2020). 2020 Jul;2020:461-464. doi: 10.1145/3311790.3400853.
6
TopMSV: A Web-Based Tool for Top-Down Mass Spectrometry Data Visualization.
J Am Soc Mass Spectrom. 2021 Jun 2;32(6):1312-1318. doi: 10.1021/jasms.0c00460. Epub 2021 Mar 29.
7
Integrating Top-Down and Bottom-Up Mass Spectrometric Strategies for Proteomic Profiling of Iranian Saw-Scaled Viper, , Venom.
J Proteome Res. 2021 Jan 1;20(1):895-908. doi: 10.1021/acs.jproteome.0c00687. Epub 2020 Nov 22.
8
Proteoform Identification by Combining RNA-Seq and Top-Down Mass Spectrometry.
J Proteome Res. 2021 Jan 1;20(1):261-269. doi: 10.1021/acs.jproteome.0c00369. Epub 2020 Nov 12.
9
Identification and Quantification of Proteoforms by Mass Spectrometry.
Proteomics. 2019 May;19(10):e1800361. doi: 10.1002/pmic.201800361.
10
Top-down Mass Spectrometry Analysis of Human Serum Autoantibody Antigen-Binding Fragments.
Sci Rep. 2019 Feb 20;9(1):2345. doi: 10.1038/s41598-018-38380-y.

本文引用的文献

1
TopPIC: a software tool for top-down mass spectrometry-based proteoform identification and characterization.
Bioinformatics. 2016 Nov 15;32(22):3495-3497. doi: 10.1093/bioinformatics/btw398. Epub 2016 Jul 16.
2
A new scoring function for top-down spectral deconvolution.
BMC Genomics. 2014 Dec 18;15(1):1140. doi: 10.1186/1471-2164-15-1140.
3
MS-GF+ makes progress towards a universal database search tool for proteomics.
Nat Commun. 2014 Oct 31;5:5277. doi: 10.1038/ncomms6277.
4
The C-score: a Bayesian framework to sharply improve proteoform scoring in high-throughput top down proteomics.
J Proteome Res. 2014 Jul 3;13(7):3231-40. doi: 10.1021/pr401277r. Epub 2014 Jun 12.
5
The first pilot project of the consortium for top-down proteomics: a status report.
Proteomics. 2014 May;14(10):1130-40. doi: 10.1002/pmic.201300438. Epub 2014 Apr 14.
6
Identification of ultramodified proteins using top-down tandem mass spectra.
J Proteome Res. 2013 Dec 6;12(12):5830-8. doi: 10.1021/pr400849y. Epub 2013 Nov 15.
7
Top-down proteomics reveals a unique protein S-thiolation switch in Salmonella Typhimurium in response to infection-like conditions.
Proc Natl Acad Sci U S A. 2013 Jun 18;110(25):10153-8. doi: 10.1073/pnas.1221210110. Epub 2013 May 29.
8
Interpreting raw biological mass spectra using isotopic mass-to-charge ratio and envelope fingerprinting.
Rapid Commun Mass Spectrom. 2013 Jun 15;27(11):1267-77. doi: 10.1002/rcm.6565.
9
Proteoform: a single term describing protein complexity.
Nat Methods. 2013 Mar;10(3):186-7. doi: 10.1038/nmeth.2369.
10
Non-parametric Bayesian approach to post-translational modification refinement of predictions from tandem mass spectrometry.
Bioinformatics. 2013 Apr 1;29(7):821-9. doi: 10.1093/bioinformatics/btt056. Epub 2013 Feb 17.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验