• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

质谱序列减法可加快针对大型核苷酸数据库搜索大型肽 MS/MS 光谱数据集的速度,用于蛋白质基因组学研究。

Mass spectrum sequential subtraction speeds up searching large peptide MS/MS spectra datasets against large nucleotide databases for proteogenomics.

机构信息

Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan.

出版信息

Genes Cells. 2012 Aug;17(8):633-44. doi: 10.1111/j.1365-2443.2012.01615.x. Epub 2012 Jun 12.

DOI:10.1111/j.1365-2443.2012.01615.x
PMID:22686349
Abstract

We have developed a novel bioinformatics method called mass spectrum sequential subtraction (MSSS) to search large peptide spectra datasets produced by liquid chromatography/mass spectrometry (LC-MS/MS) against protein and large-sized nucleotide sequence databases. The main principle in MSSS is to search the peptide spectra set against the protein database, followed by removal of the spectra corresponding to the identified peptides to create a smaller set of the remaining peptide spectra for searching against the nucleotide sequences database. Therefore, we reduce the number of spectra to be searched to limit the peptide search space. Comparing MSSS and conventional search approach using a dataset of 27 LC-MS/MS runs of rice culture cells indicated that MSSS reduced the search queries to 50% and the search time to 75% on average. In addition, MSSS had no effect on the identification false-positive rate (FPR) or the novel peptide sequences identification ability. We used MSSS to analyze another dataset of 34 LC-MS/MS runs, resulting in identifying additional 74 novel peptides. Proteogenomic analysis with these additional peptides yielded 47 new genomic features in 24 rice genes plus 24 intergenic peptides. These results show that the utility of MSSS in searching large databases with large MS/MS datasets for proteogenomics.

摘要

我们开发了一种新的生物信息学方法,称为质谱序列消减(MSSS),用于针对蛋白质和大型核苷酸序列数据库搜索由液相色谱/质谱(LC-MS/MS)产生的大型肽谱数据集。MSSS 的主要原理是先在蛋白质数据库中搜索肽谱集,然后去除对应于已鉴定肽的谱,为搜索核苷酸序列数据库创建更小的剩余肽谱集。因此,我们减少了要搜索的谱的数量,以限制肽的搜索空间。使用水稻培养细胞的 27 个 LC-MS/MS 运行数据集比较 MSSS 和常规搜索方法表明,MSSS 将搜索查询平均减少了 50%,搜索时间减少了 75%。此外,MSSS 对鉴定假阳性率(FPR)或新肽序列的鉴定能力没有影响。我们使用 MSSS 分析了另一个 34 个 LC-MS/MS 运行数据集,结果鉴定出另外 74 个新肽。对这些额外肽的蛋白质基因组分析在 24 个水稻基因和 24 个基因间肽中产生了 47 个新的基因组特征。这些结果表明,MSSS 在使用大型 MS/MS 数据集搜索大型数据库进行蛋白质基因组学方面的实用性。

相似文献

1
Mass spectrum sequential subtraction speeds up searching large peptide MS/MS spectra datasets against large nucleotide databases for proteogenomics.质谱序列减法可加快针对大型核苷酸数据库搜索大型肽 MS/MS 光谱数据集的速度,用于蛋白质基因组学研究。
Genes Cells. 2012 Aug;17(8):633-44. doi: 10.1111/j.1365-2443.2012.01615.x. Epub 2012 Jun 12.
2
VEMS 3.0: algorithms and computational tools for tandem mass spectrometry based identification of post-translational modifications in proteins.VEMS 3.0:用于基于串联质谱法鉴定蛋白质翻译后修饰的算法和计算工具
J Proteome Res. 2005 Nov-Dec;4(6):2338-47. doi: 10.1021/pr050264q.
3
Improving sensitivity in shotgun proteomics using a peptide-centric database with reduced complexity: protease cleavage and SCX elution rules from data mining of MS/MS spectra.使用复杂度降低的以肽段为中心的数据库提高鸟枪法蛋白质组学的灵敏度:基于MS/MS谱数据挖掘的蛋白酶切割和强阳离子交换洗脱规则
Anal Chem. 2006 Feb 15;78(4):1071-84. doi: 10.1021/ac051127f.
4
Improving peptide identification with single-stage mass spectrum peaks.提高单级质谱峰的肽鉴定能力。
Bioinformatics. 2009 Nov 15;25(22):2969-74. doi: 10.1093/bioinformatics/btp501. Epub 2009 Aug 18.
5
Integrated approach for manual evaluation of peptides identified by searching protein sequence databases with tandem mass spectra.通过串联质谱搜索蛋白质序列数据库鉴定肽段的手动评估综合方法。
J Proteome Res. 2005 May-Jun;4(3):998-1005. doi: 10.1021/pr049754t.
6
Improving mass and liquid chromatography based identification of proteins using bayesian scoring.使用贝叶斯评分改进基于质谱和液相色谱的蛋白质鉴定
J Proteome Res. 2005 Nov-Dec;4(6):2174-84. doi: 10.1021/pr050251c.
7
Genome annotation of Anopheles gambiae using mass spectrometry-derived data.利用质谱衍生数据对冈比亚按蚊进行基因组注释。
BMC Genomics. 2005 Sep 19;6:128. doi: 10.1186/1471-2164-6-128.
8
Overview of the HUPO Plasma Proteome Project: results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly-available database.人类蛋白质组组织血浆蛋白质组计划概述:来自35个合作实验室和多个分析团队的试点阶段结果,生成了一个包含3020种蛋白质的核心数据集和一个可公开获取的数据库。
Proteomics. 2005 Aug;5(13):3226-45. doi: 10.1002/pmic.200500358.
9
Sequence similarity-driven proteomics in organisms with unknown genomes by LC-MS/MS and automated de novo sequencing.通过液相色谱-串联质谱法(LC-MS/MS)和自动从头测序,对基因组未知的生物体进行序列相似性驱动的蛋白质组学研究。
Proteomics. 2007 Jul;7(14):2318-29. doi: 10.1002/pmic.200700003.
10
Analysis of the resolution limitations of peptide identification algorithms.分析肽鉴定算法的分辨率限制。
J Proteome Res. 2011 Dec 2;10(12):5555-61. doi: 10.1021/pr200913a. Epub 2011 Oct 26.

引用本文的文献

1
Improving the Genome Annotation of Using Proteogenomics.利用蛋白质基因组学改善[具体物种]的基因组注释 (注:原文中“of”后缺少具体内容)
Curr Genomics. 2021 Dec 30;22(5):373-383. doi: 10.2174/1389202922666211011143957.
2
Advances in Multi-Omics Approaches for Molecular Breeding of Black Rot Resistance in L.甘蓝型油菜黑腐病抗性分子育种的多组学方法进展
Front Plant Sci. 2021 Dec 6;12:742553. doi: 10.3389/fpls.2021.742553. eCollection 2021.
3
Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation.
蛋白质基因组学:整合新一代测序技术与质谱技术以表征人类蛋白质组变异
Annu Rev Anal Chem (Palo Alto Calif). 2016 Jun 12;9(1):521-45. doi: 10.1146/annurev-anchem-071015-041722. Epub 2016 Mar 30.
4
Proteogenomics: concepts, applications and computational strategies.蛋白质基因组学:概念、应用及计算策略
Nat Methods. 2014 Nov;11(11):1114-25. doi: 10.1038/nmeth.3144.
5
Next-generation sequence assembly: four stages of data processing and computational challenges.下一代序列组装:数据处理的四个阶段和计算挑战。
PLoS Comput Biol. 2013;9(12):e1003345. doi: 10.1371/journal.pcbi.1003345. Epub 2013 Dec 12.
6
Peppy: proteogenomic search software.Peppy:蛋白质基因组搜索软件。
J Proteome Res. 2013 Jun 7;12(6):3019-25. doi: 10.1021/pr400208w. Epub 2013 May 6.