通过考虑mRNA和肽段的丰度来改进肽段鉴定

Improvement of peptide identification with considering the abundance of mRNA and peptide.

作者信息

Ma Chunwei, Xu Shaohang, Liu Geng, Liu Xin, Xu Xun, Wen Bo, Liu Siqi

机构信息

BGI-Shenzhen, Shenzhen, 518083, China.

出版信息

BMC Bioinformatics. 2017 Feb 16;18(1):109. doi: 10.1186/s12859-017-1491-5.

DOI:10.1186/s12859-017-1491-5

PMID:28201984

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5311845/

Abstract

BACKGROUND

Tandem mass spectrometry (MS/MS) followed by database search is a main approach to identify peptides/proteins in proteomic studies. A lot of effort has been devoted to improve the identification accuracy and sensitivity for peptides/proteins, such as developing advanced algorithms and expanding protein databases.

RESULTS

Herein, we described a new strategy for enhancing the sensitivity of protein/peptide identification through combination of mRNA and peptide abundance in Percolator. In our strategy, a new workflow for peptide identification is established on the basis of the abundance of transcripts and potential novel transcripts derived from RNA-Seq and abundance of peptides towards the same life species. We demonstrate the utility of this strategy by two MS/MS datasets and the results indicate that about 5% ~ 8% improvement of peptide identification can be achieved with 1% FDR in peptide level by integrating the peptide abundance, the transcript abundance and potential novel transcripts from RNA-Seq data. Meanwhile, 181 and 154 novel peptides were identified in the two datasets, respectively.

CONCLUSIONS

We have demonstrated that this strategy could enable improvement of peptide/protein identification and discovery of novel peptides, as compared with the traditional search methods.

摘要

背景

串联质谱（MS/MS）结合数据库搜索是蛋白质组学研究中鉴定肽段/蛋白质的主要方法。人们已经付出了很多努力来提高肽段/蛋白质的鉴定准确性和灵敏度，例如开发先进的算法和扩展蛋白质数据库。

结果

在此，我们描述了一种通过结合Percolator中mRNA和肽段丰度来提高蛋白质/肽段鉴定灵敏度的新策略。在我们的策略中，基于来自RNA测序的转录本丰度和潜在新转录本以及同一生物物种的肽段丰度，建立了一种新的肽段鉴定工作流程。我们通过两个MS/MS数据集证明了该策略的实用性，结果表明，通过整合肽段丰度、转录本丰度和来自RNA测序数据的潜在新转录本，在肽段水平上以1%的错误发现率（FDR）可实现约5%至8%的肽段鉴定改进。同时，在这两个数据集中分别鉴定出181个和154个新肽段。

结论

我们已经证明，与传统搜索方法相比，该策略能够改进肽段/蛋白质鉴定并发现新肽段。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/2ba57f09d907/12859_2017_1491_Fig1_HTML.jpg

相似文献

Improvement of peptide identification with considering the abundance of mRNA and peptide.通过考虑mRNA和肽段的丰度来改进肽段鉴定

BMC Bioinformatics. 2017 Feb 16;18(1):109. doi: 10.1186/s12859-017-1491-5.

PGA: an R/Bioconductor package for identification of novel peptides using a customized database derived from RNA-Seq.PGA：一个用于使用源自RNA测序的定制数据库鉴定新型肽段的R/Bioconductor软件包。

BMC Bioinformatics. 2016 Jun 17;17(1):244. doi: 10.1186/s12859-016-1133-3.

Effective Leveraging of Targeted Search Spaces for Improving Peptide Identification in Tandem Mass Spectrometry Based Proteomics.有效利用靶向搜索空间以改善基于串联质谱的蛋白质组学中的肽段鉴定

J Proteome Res. 2015 Dec 4;14(12):5169-78. doi: 10.1021/acs.jproteome.5b00504. Epub 2015 Nov 24.

A protein identification algorithm for tandem mass spectrometry by incorporating the abundance of mRNA into a binomial probability scoring model.将 mRNA 的丰度纳入二项式概率评分模型的串联质谱蛋白质鉴定算法。

J Proteomics. 2019 Apr 15;197:53-59. doi: 10.1016/j.jprot.2019.02.010. Epub 2019 Feb 18.

A peptide-retrieval strategy enables significant improvement of quantitative performance without compromising confidence of identification.肽段检索策略可在不影响鉴定置信度的情况下显著提高定量性能。

J Proteomics. 2017 Jan 30;152:276-282. doi: 10.1016/j.jprot.2016.11.020. Epub 2016 Nov 27.

Improving X!Tandem on peptide identification from mass spectrometry by self-boosted Percolator.通过自增强 percolator 提高 X！串联在质谱肽鉴定中的性能。

IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1273-80. doi: 10.1109/TCBB.2012.86.

Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.优化搜索引擎和后处理方法以最大化高分辨率质谱数据的肽段和蛋白质鉴定

J Proteome Res. 2015 Nov 6;14(11):4662-73. doi: 10.1021/acs.jproteome.5b00536. Epub 2015 Sep 30.

The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment.基于质谱的蛋白质组学数据在验证从 RNA-Seq 数据重建的新型替代剪接形式方面的效用：初步评估。

BMC Bioinformatics. 2010 Dec 14;11 Suppl 11(Suppl 11):S14. doi: 10.1186/1471-2105-11-S11-S14.

In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。

J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.

MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm.MUMAL2：使用成本敏感型人工神经网络和阈值选择算法提高鸟枪法蛋白质组学的灵敏度

BMC Bioinformatics. 2016 Dec 15;17(Suppl 18):472. doi: 10.1186/s12859-016-1341-x.

引用本文的文献

IsoBayes: a Bayesian approach for single-isoform proteomics inference.IsoBayes：一种用于单异构体蛋白质组学推断的贝叶斯方法。

Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf450.

: a Bayesian approach for single-isoform proteomics inference.一种用于单异构体蛋白质组学推断的贝叶斯方法。

bioRxiv. 2024 Jun 11:2024.06.10.598223. doi: 10.1101/2024.06.10.598223.

Peptide identifications and false discovery rates using different mass spectrometry platforms.不同质谱平台的肽鉴定和假发现率。

Talanta. 2018 May 15;182:456-463. doi: 10.1016/j.talanta.2018.01.062. Epub 2018 Jan 31.

Proteomic Contributions to Medicinal Plant Research: From Plant Metabolism to Pharmacological Action.蛋白质组学对药用植物研究的贡献：从植物代谢到药理作用

Proteomes. 2017 Dec 7;5(4):35. doi: 10.3390/proteomes5040035.

Proteomics in non-human primates: utilizing RNA-Seq data to improve protein identification by mass spectrometry in vervet monkeys.非人类灵长类动物的蛋白质组学：利用 RNA-Seq 数据提高食蟹猴中质谱法的蛋白质鉴定水平。

BMC Genomics. 2017 Nov 13;18(1):877. doi: 10.1186/s12864-017-4279-0.

本文引用的文献

Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.使用 percolator 3.0 对大规模蛋白质组学数据集进行快速准确的蛋白质假发现率估计。

J Am Soc Mass Spectrom. 2016 Nov;27(11):1719-1727. doi: 10.1007/s13361-016-1460-7. Epub 2016 Aug 29.

BMC Bioinformatics. 2016 Jun 17;17(1):244. doi: 10.1186/s12859-016-1133-3.

JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells.JUMPg：一种整合蛋白质基因组学流程，用于鉴定人脑中未注释的蛋白质以及癌细胞中的未注释蛋白质。

J Proteome Res. 2016 Jul 1;15(7):2309-20. doi: 10.1021/acs.jproteome.6b00344. Epub 2016 Jun 13.

IPeak: An open source tool to combine results from multiple MS/MS search engines.IPeak：一个用于整合多个串联质谱搜索引擎结果的开源工具。

Proteomics. 2015 Sep;15(17):2916-20. doi: 10.1002/pmic.201400208. Epub 2015 Aug 6.

MS-GF+ makes progress towards a universal database search tool for proteomics.MS-GF+朝着蛋白质组学通用数据库搜索工具的方向取得了进展。

Nat Commun. 2014 Oct 31;5:5277. doi: 10.1038/ncomms6277.

sapFinder: an R/Bioconductor package for detection of variant peptides in shotgun proteomics experiments.sapFinder：一种用于在鸟枪法蛋白质组学实验中检测变异肽的 R/Bioconductor 包。

Bioinformatics. 2014 Nov 1;30(21):3136-8. doi: 10.1093/bioinformatics/btu397. Epub 2014 Jul 22.

Utility of RNA-seq and GPMDB protein observation frequency for improving the sensitivity of protein identification by tandem MS.RNA测序和GPMDB蛋白质观察频率在提高串联质谱法蛋白质鉴定灵敏度方面的效用。

J Proteome Res. 2014 Sep 5;13(9):4113-9. doi: 10.1021/pr500496p. Epub 2014 Jul 28.

Discovery of novel genes and gene isoforms by integrating transcriptomic and proteomic profiling from mouse liver.通过整合小鼠肝脏的转录组和蛋白质组分析发现新基因和基因异构体。

J Proteome Res. 2014 May 2;13(5):2409-19. doi: 10.1021/pr4012206. Epub 2014 Apr 18.

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。

J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.

Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.利用 RNA-Seq 发现和质谱分析新型剪接连接肽。

Mol Cell Proteomics. 2013 Aug;12(8):2341-53. doi: 10.1074/mcp.O113.028142. Epub 2013 Apr 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过考虑mRNA和肽段的丰度来改进肽段鉴定

Improvement of peptide identification with considering the abundance of mRNA and peptide.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献