• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过考虑mRNA和肽段的丰度来改进肽段鉴定

Improvement of peptide identification with considering the abundance of mRNA and peptide.

作者信息

Ma Chunwei, Xu Shaohang, Liu Geng, Liu Xin, Xu Xun, Wen Bo, Liu Siqi

机构信息

BGI-Shenzhen, Shenzhen, 518083, China.

出版信息

BMC Bioinformatics. 2017 Feb 16;18(1):109. doi: 10.1186/s12859-017-1491-5.

DOI:10.1186/s12859-017-1491-5
PMID:28201984
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5311845/
Abstract

BACKGROUND

Tandem mass spectrometry (MS/MS) followed by database search is a main approach to identify peptides/proteins in proteomic studies. A lot of effort has been devoted to improve the identification accuracy and sensitivity for peptides/proteins, such as developing advanced algorithms and expanding protein databases.

RESULTS

Herein, we described a new strategy for enhancing the sensitivity of protein/peptide identification through combination of mRNA and peptide abundance in Percolator. In our strategy, a new workflow for peptide identification is established on the basis of the abundance of transcripts and potential novel transcripts derived from RNA-Seq and abundance of peptides towards the same life species. We demonstrate the utility of this strategy by two MS/MS datasets and the results indicate that about 5% ~ 8% improvement of peptide identification can be achieved with 1% FDR in peptide level by integrating the peptide abundance, the transcript abundance and potential novel transcripts from RNA-Seq data. Meanwhile, 181 and 154 novel peptides were identified in the two datasets, respectively.

CONCLUSIONS

We have demonstrated that this strategy could enable improvement of peptide/protein identification and discovery of novel peptides, as compared with the traditional search methods.

摘要

背景

串联质谱(MS/MS)结合数据库搜索是蛋白质组学研究中鉴定肽段/蛋白质的主要方法。人们已经付出了很多努力来提高肽段/蛋白质的鉴定准确性和灵敏度,例如开发先进的算法和扩展蛋白质数据库。

结果

在此,我们描述了一种通过结合Percolator中mRNA和肽段丰度来提高蛋白质/肽段鉴定灵敏度的新策略。在我们的策略中,基于来自RNA测序的转录本丰度和潜在新转录本以及同一生物物种的肽段丰度,建立了一种新的肽段鉴定工作流程。我们通过两个MS/MS数据集证明了该策略的实用性,结果表明,通过整合肽段丰度、转录本丰度和来自RNA测序数据的潜在新转录本,在肽段水平上以1%的错误发现率(FDR)可实现约5%至8%的肽段鉴定改进。同时,在这两个数据集中分别鉴定出181个和154个新肽段。

结论

我们已经证明,与传统搜索方法相比,该策略能够改进肽段/蛋白质鉴定并发现新肽段。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/7b412920141b/12859_2017_1491_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/2ba57f09d907/12859_2017_1491_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/66a05a030feb/12859_2017_1491_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/6c0956b7d86d/12859_2017_1491_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/cccf0ef3f81a/12859_2017_1491_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/7b412920141b/12859_2017_1491_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/2ba57f09d907/12859_2017_1491_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/66a05a030feb/12859_2017_1491_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/6c0956b7d86d/12859_2017_1491_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/cccf0ef3f81a/12859_2017_1491_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b1/5311845/7b412920141b/12859_2017_1491_Fig5_HTML.jpg

相似文献

1
Improvement of peptide identification with considering the abundance of mRNA and peptide.通过考虑mRNA和肽段的丰度来改进肽段鉴定
BMC Bioinformatics. 2017 Feb 16;18(1):109. doi: 10.1186/s12859-017-1491-5.
2
PGA: an R/Bioconductor package for identification of novel peptides using a customized database derived from RNA-Seq.PGA:一个用于使用源自RNA测序的定制数据库鉴定新型肽段的R/Bioconductor软件包。
BMC Bioinformatics. 2016 Jun 17;17(1):244. doi: 10.1186/s12859-016-1133-3.
3
Effective Leveraging of Targeted Search Spaces for Improving Peptide Identification in Tandem Mass Spectrometry Based Proteomics.有效利用靶向搜索空间以改善基于串联质谱的蛋白质组学中的肽段鉴定
J Proteome Res. 2015 Dec 4;14(12):5169-78. doi: 10.1021/acs.jproteome.5b00504. Epub 2015 Nov 24.
4
A protein identification algorithm for tandem mass spectrometry by incorporating the abundance of mRNA into a binomial probability scoring model.将 mRNA 的丰度纳入二项式概率评分模型的串联质谱蛋白质鉴定算法。
J Proteomics. 2019 Apr 15;197:53-59. doi: 10.1016/j.jprot.2019.02.010. Epub 2019 Feb 18.
5
A peptide-retrieval strategy enables significant improvement of quantitative performance without compromising confidence of identification.肽段检索策略可在不影响鉴定置信度的情况下显著提高定量性能。
J Proteomics. 2017 Jan 30;152:276-282. doi: 10.1016/j.jprot.2016.11.020. Epub 2016 Nov 27.
6
Improving X!Tandem on peptide identification from mass spectrometry by self-boosted Percolator.通过自增强 percolator 提高 X!串联在质谱肽鉴定中的性能。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1273-80. doi: 10.1109/TCBB.2012.86.
7
Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.优化搜索引擎和后处理方法以最大化高分辨率质谱数据的肽段和蛋白质鉴定
J Proteome Res. 2015 Nov 6;14(11):4662-73. doi: 10.1021/acs.jproteome.5b00536. Epub 2015 Sep 30.
8
The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment.基于质谱的蛋白质组学数据在验证从 RNA-Seq 数据重建的新型替代剪接形式方面的效用:初步评估。
BMC Bioinformatics. 2010 Dec 14;11 Suppl 11(Suppl 11):S14. doi: 10.1186/1471-2105-11-S11-S14.
9
In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。
J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.
10
MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm.MUMAL2:使用成本敏感型人工神经网络和阈值选择算法提高鸟枪法蛋白质组学的灵敏度
BMC Bioinformatics. 2016 Dec 15;17(Suppl 18):472. doi: 10.1186/s12859-016-1341-x.

引用本文的文献

1
IsoBayes: a Bayesian approach for single-isoform proteomics inference.IsoBayes:一种用于单异构体蛋白质组学推断的贝叶斯方法。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf450.
2
: a Bayesian approach for single-isoform proteomics inference.一种用于单异构体蛋白质组学推断的贝叶斯方法。
bioRxiv. 2024 Jun 11:2024.06.10.598223. doi: 10.1101/2024.06.10.598223.
3
Peptide identifications and false discovery rates using different mass spectrometry platforms.不同质谱平台的肽鉴定和假发现率。

本文引用的文献

1
Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0.使用 percolator 3.0 对大规模蛋白质组学数据集进行快速准确的蛋白质假发现率估计。
J Am Soc Mass Spectrom. 2016 Nov;27(11):1719-1727. doi: 10.1007/s13361-016-1460-7. Epub 2016 Aug 29.
2
PGA: an R/Bioconductor package for identification of novel peptides using a customized database derived from RNA-Seq.PGA:一个用于使用源自RNA测序的定制数据库鉴定新型肽段的R/Bioconductor软件包。
BMC Bioinformatics. 2016 Jun 17;17(1):244. doi: 10.1186/s12859-016-1133-3.
3
JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells.
Talanta. 2018 May 15;182:456-463. doi: 10.1016/j.talanta.2018.01.062. Epub 2018 Jan 31.
4
Proteomic Contributions to Medicinal Plant Research: From Plant Metabolism to Pharmacological Action.蛋白质组学对药用植物研究的贡献:从植物代谢到药理作用
Proteomes. 2017 Dec 7;5(4):35. doi: 10.3390/proteomes5040035.
5
Proteomics in non-human primates: utilizing RNA-Seq data to improve protein identification by mass spectrometry in vervet monkeys.非人类灵长类动物的蛋白质组学:利用 RNA-Seq 数据提高食蟹猴中质谱法的蛋白质鉴定水平。
BMC Genomics. 2017 Nov 13;18(1):877. doi: 10.1186/s12864-017-4279-0.
JUMPg:一种整合蛋白质基因组学流程,用于鉴定人脑中未注释的蛋白质以及癌细胞中的未注释蛋白质。
J Proteome Res. 2016 Jul 1;15(7):2309-20. doi: 10.1021/acs.jproteome.6b00344. Epub 2016 Jun 13.
4
IPeak: An open source tool to combine results from multiple MS/MS search engines.IPeak:一个用于整合多个串联质谱搜索引擎结果的开源工具。
Proteomics. 2015 Sep;15(17):2916-20. doi: 10.1002/pmic.201400208. Epub 2015 Aug 6.
5
MS-GF+ makes progress towards a universal database search tool for proteomics.MS-GF+朝着蛋白质组学通用数据库搜索工具的方向取得了进展。
Nat Commun. 2014 Oct 31;5:5277. doi: 10.1038/ncomms6277.
6
sapFinder: an R/Bioconductor package for detection of variant peptides in shotgun proteomics experiments.sapFinder:一种用于在鸟枪法蛋白质组学实验中检测变异肽的 R/Bioconductor 包。
Bioinformatics. 2014 Nov 1;30(21):3136-8. doi: 10.1093/bioinformatics/btu397. Epub 2014 Jul 22.
7
Utility of RNA-seq and GPMDB protein observation frequency for improving the sensitivity of protein identification by tandem MS.RNA测序和GPMDB蛋白质观察频率在提高串联质谱法蛋白质鉴定灵敏度方面的效用。
J Proteome Res. 2014 Sep 5;13(9):4113-9. doi: 10.1021/pr500496p. Epub 2014 Jul 28.
8
Discovery of novel genes and gene isoforms by integrating transcriptomic and proteomic profiling from mouse liver.通过整合小鼠肝脏的转录组和蛋白质组分析发现新基因和基因异构体。
J Proteome Res. 2014 May 2;13(5):2409-19. doi: 10.1021/pr4012206. Epub 2014 Apr 18.
9
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
10
Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.利用 RNA-Seq 发现和质谱分析新型剪接连接肽。
Mol Cell Proteomics. 2013 Aug;12(8):2341-53. doi: 10.1074/mcp.O113.028142. Epub 2013 Apr 29.