• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从大量信息中找到关键:快速消除不可解释的肽 MS/MS 谱和降低噪声,提高蛋白质鉴定的准确性。

Reducing the haystack to find the needle: improved protein identification after fast elimination of non-interpretable peptide MS/MS spectra and noise reduction.

机构信息

Sarajevo School of Science and Technology, Sarajevo, Bosnia-Herzegovina.

出版信息

BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S13. doi: 10.1186/1471-2164-11-S1-S13.

DOI:10.1186/1471-2164-11-S1-S13
PMID:20158870
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2822527/
Abstract

BACKGROUND

Tandem mass spectrometry (MS/MS) has become a standard method for identification of proteins extracted from biological samples but the huge number and the noise contamination of MS/MS spectra obstruct swift and reliable computer-aided interpretation. Typically, a minor fraction of the spectra per sample (most often, only a few %) and about 10% of the peaks per spectrum contribute to the final result if protein identification is not prevented by the noise at all.

RESULTS

Two fast preprocessing screens can substantially reduce the haystack of MS/MS data. (1) Simple sequence ladder rules remove spectra non-interpretable in peptide sequences. (2) Modified Fourier-transform-based criteria clear background in the remaining data. In average, only a remainder of 35% of the MS/MS spectra (each reduced in size by about one quarter) has to be handed over to the interpretation software for reliable protein identification essentially without loss of information, with a trend to improved sequence coverage and with proportional decrease of computer resource consumption.

CONCLUSIONS

The search for sequence ladders in tandem MS/MS spectra with subsequent noise suppression is a promising strategy to reduce the number of MS/MS spectra from electro-spray instruments and to enhance the reliability of protein matches. Supplementary material and the software are available from an accompanying WWW-site with the URL http://mendel.bii.a-star.edu.sg/mass-spectrometry/MSCleaner-2.0/.

摘要

背景

串联质谱(MS/MS)已成为从生物样本中提取蛋白质鉴定的标准方法,但 MS/MS 谱的数量庞大且存在噪声污染,这阻碍了快速可靠的计算机辅助解释。通常情况下,每个样本的谱图只有一小部分(通常只有几个百分点),每个谱图的峰只有大约 10%有助于最终结果,如果噪声完全没有阻止蛋白质鉴定的话。

结果

两个快速预处理筛选可以大大减少 MS/MS 数据的复杂性。(1)简单的序列梯规则去除肽序列不可解释的谱图。(2)基于修正傅里叶变换的标准清除剩余数据中的背景。平均而言,只有剩余的 35%的 MS/MS 谱图(每个谱图的大小减少约四分之一)必须交给解释软件进行可靠的蛋白质鉴定,基本上不会丢失信息,具有提高序列覆盖率的趋势,并相应减少计算机资源消耗。

结论

在串联 MS/MS 谱图中搜索序列梯,然后进行噪声抑制,这是一种很有前途的策略,可以减少电喷雾仪器产生的 MS/MS 谱图数量,并提高蛋白质匹配的可靠性。补充材料和软件可从附带的 WWW 站点获取,网址为 http://mendel.bii.a-star.edu.sg/mass-spectrometry/MSCleaner-2.0/。

相似文献

1
Reducing the haystack to find the needle: improved protein identification after fast elimination of non-interpretable peptide MS/MS spectra and noise reduction.从大量信息中找到关键:快速消除不可解释的肽 MS/MS 谱和降低噪声,提高蛋白质鉴定的准确性。
BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S13. doi: 10.1186/1471-2164-11-S1-S13.
2
Preprocessing Tandem Mass Spectra Using Genetic Programming for Peptide Identification.基于遗传编程的串联质谱预处理在肽段鉴定中的应用。
J Am Soc Mass Spectrom. 2019 Jul;30(7):1294-1307. doi: 10.1007/s13361-019-02196-5. Epub 2019 Apr 25.
3
When less can yield more - Computational preprocessing of MS/MS spectra for peptide identification.何时少能胜多——用于肽段鉴定的串联质谱(MS/MS)谱图的计算预处理
Proteomics. 2009 Nov;9(21):4978-84. doi: 10.1002/pmic.200900326.
4
Filtering of MS/MS data for peptide identification.用于肽段鉴定的MS/MS数据过滤
BMC Genomics. 2013;14 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-14-S7-S2. Epub 2013 Nov 5.
5
Chemical rule-based filtering of MS/MS spectra.基于化学规则的 MS/MS 光谱过滤。
Bioinformatics. 2013 Apr 1;29(7):925-32. doi: 10.1093/bioinformatics/btt061. Epub 2013 Feb 15.
6
Expert system for computer-assisted annotation of MS/MS spectra.用于 MS/MS 谱图计算机辅助注释的专家系统。
Mol Cell Proteomics. 2012 Nov;11(11):1500-9. doi: 10.1074/mcp.M112.020271. Epub 2012 Aug 10.
7
pClean: An Algorithm To Preprocess High-Resolution Tandem Mass Spectra for Database Searching.pClean:一种用于为数据库搜索预处理高分辨率串联质谱的算法。
J Proteome Res. 2019 Sep 6;18(9):3235-3244. doi: 10.1021/acs.jproteome.9b00141. Epub 2019 Aug 14.
8
PeakSelect: preprocessing tandem mass spectra for better peptide identification.PeakSelect:用于更好地鉴定肽段的串联质谱预处理方法。
Rapid Commun Mass Spectrom. 2008 Apr;22(8):1203-12. doi: 10.1002/rcm.3488.
9
Protein identification by spectral networks analysis.通过光谱网络分析进行蛋白质鉴定。
Proc Natl Acad Sci U S A. 2007 Apr 10;104(15):6140-5. doi: 10.1073/pnas.0701130104. Epub 2007 Apr 2.
10
Better score function for peptide identification with ETD MS/MS spectra.具有 ETD MS/MS 谱的肽鉴定的更好得分函数。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-11-S1-S4.

引用本文的文献

1
Improving Spectral Similarity and Molecular Network Reliability through Noise Signal Filtering in MS/MS Spectra.通过对MS/MS谱图中的噪声信号进行滤波来提高谱图相似度和分子网络可靠性
Anal Chem. 2025 Jul 29;97(29):15873-15882. doi: 10.1021/acs.analchem.5c02109. Epub 2025 Jul 17.
2
The impact of noise and missing fragmentation cleavages on peptide identification algorithms.噪声和缺失片段裂解对肽段鉴定算法的影响。
Comput Struct Biotechnol J. 2022 Mar 19;20:1402-1412. doi: 10.1016/j.csbj.2022.03.008. eCollection 2022.
3
Methods for Proteogenomics Data Analysis, Challenges, and Scalability Bottlenecks: A Survey.蛋白质基因组学数据分析方法、挑战及可扩展性瓶颈:一项综述。
IEEE Access. 2021;9:5497-5516. doi: 10.1109/ACCESS.2020.3047588. Epub 2020 Dec 25.
4
GPU-DAEMON: GPU algorithm design, data management & optimization template for array based big omics data.GPU-DAEMON:基于数组的大型组学数据的 GPU 算法设计、数据管理和优化模板。
Comput Biol Med. 2018 Oct 1;101:163-173. doi: 10.1016/j.compbiomed.2018.08.015. Epub 2018 Aug 16.
5
An Out-of-Core GPU based dimensionality reduction algorithm for Big Mass Spectrometry Data and its application in bottom-up Proteomics.一种基于外核GPU的用于海量质谱数据的降维算法及其在自下而上蛋白质组学中的应用。
ACM BCB. 2017 Aug;2017:550-555. doi: 10.1145/3107411.3107466.
6
Peppy: proteogenomic search software.Peppy:蛋白质基因组搜索软件。
J Proteome Res. 2013 Jun 7;12(6):3019-25. doi: 10.1021/pr400208w. Epub 2013 May 6.
7
A mass spectrometry-based method to screen for α-amidated peptides.基于质谱的α-酰胺化肽筛选方法。
Proteomics. 2012 Jan;12(2):173-82. doi: 10.1002/pmic.201100327. Epub 2011 Dec 14.
8
Practical 4'-phosphopantetheine active site discovery from proteomic samples.从蛋白质组学样本中发现实际的 4'-磷酸泛酰巯基乙胺活性位点。
J Proteome Res. 2011 Jan 7;10(1):320-9. doi: 10.1021/pr100953b. Epub 2010 Dec 13.

本文引用的文献

1
Electrospray ionization mass spectrometry: Deconvolution by an Entropy-Based algorithm.电喷雾电离质谱法:基于熵算法的去卷积
J Am Soc Mass Spectrom. 1992 Mar;3(3):207-15. doi: 10.1016/1044-0305(92)87004-I.
2
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
3
Interferences and contaminants encountered in modern mass spectrometry.现代质谱分析中遇到的干扰和污染物。
Anal Chim Acta. 2008 Oct 3;627(1):71-81. doi: 10.1016/j.aca.2008.04.043. Epub 2008 Apr 25.
4
A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics.一种用于预测精确质量和时间蛋白质组学中蛋白型肽段的支持向量机模型。
Bioinformatics. 2008 Jul 1;24(13):1503-9. doi: 10.1093/bioinformatics/btn218. Epub 2008 May 3.
5
Algorithm for peptide sequencing by tandem mass spectrometry based on better preprocessing and anti-symmetric computational model.基于更好的预处理和反对称计算模型的串联质谱肽段测序算法
Comput Syst Bioinformatics Conf. 2007;6:19-30.
6
Analysis and validation of proteomic data generated by tandem mass spectrometry.串联质谱法产生的蛋白质组学数据的分析与验证
Nat Methods. 2007 Oct;4(10):787-97. doi: 10.1038/nmeth1088.
7
Protein identification by spectral networks analysis.通过光谱网络分析进行蛋白质鉴定。
Proc Natl Acad Sci U S A. 2007 Apr 10;104(15):6140-5. doi: 10.1073/pnas.0701130104. Epub 2007 Apr 2.
8
Cleaning of raw peptide MS/MS spectra: improved protein identification following deconvolution of multiply charged peaks, isotope clusters, and removal of background noise.原始肽段质谱/质谱谱图的清洗:多重电荷峰、同位素簇解卷积以及背景噪声去除后蛋白质鉴定的改进。
Proteomics. 2006 Oct;6(19):5117-31. doi: 10.1002/pmic.200500928.
9
Quality classification of tandem mass spectrometry data.串联质谱数据的质量分类
Bioinformatics. 2006 Feb 15;22(4):400-6. doi: 10.1093/bioinformatics/bti829. Epub 2005 Dec 13.
10
InsPecT: identification of posttranslationally modified peptides from tandem mass spectra.InsPecT:从串联质谱中鉴定翻译后修饰的肽段。
Anal Chem. 2005 Jul 15;77(14):4626-39. doi: 10.1021/ac050102d.