• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过光谱比较鉴定肽段的方法。

Methods for peptide identification by spectral comparison.

作者信息

Liu Jian, Bell Alexander W, Bergeron John J M, Yanofsky Corey M, Carrillo Brian, Beaudrie Christian E H, Kearney Robert E

机构信息

Center for Cellular & Biomolecular Research, University of Toronto, Toronto, Canada.

出版信息

Proteome Sci. 2007 Jan 16;5:3. doi: 10.1186/1477-5956-5-3.

DOI:10.1186/1477-5956-5-3
PMID:17227583
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1783643/
Abstract

BACKGROUND

Tandem mass spectrometry followed by database search is currently the predominant technology for peptide sequencing in shotgun proteomics experiments. Most methods compare experimentally observed spectra to the theoretical spectra predicted from the sequences in protein databases. There is a growing interest, however, in comparing unknown experimental spectra to a library of previously identified spectra. This approach has the advantage of taking into account instrument-dependent factors and peptide-specific differences in fragmentation probabilities. It is also computationally more efficient for high-throughput proteomics studies.

RESULTS

This paper investigates computational issues related to this spectral comparison approach. Different methods have been empirically evaluated over several large sets of spectra. First, we illustrate that the peak intensities follow a Poisson distribution. This implies that applying a square root transform will optimally stabilize the peak intensity variance. Our results show that the square root did indeed outperform other transforms, resulting in improved accuracy of spectral matching. Second, different measures of spectral similarity were compared, and the results illustrated that the correlation coefficient was most robust. Finally, we examine how to assemble multiple spectra associated with the same peptide to generate a synthetic reference spectrum. Ensemble averaging is shown to provide the best combination of accuracy and efficiency.

CONCLUSION

Our results demonstrate that when combined, these methods can boost the sensitivity and specificity of spectral comparison. Therefore they are capable of enhancing and complementing existing tools for consistent and accurate peptide identification.

摘要

背景

在鸟枪法蛋白质组学实验中,串联质谱结合数据库搜索是目前肽段测序的主要技术。大多数方法将实验观测到的谱图与从蛋白质数据库序列预测的理论谱图进行比较。然而,将未知的实验谱图与先前鉴定的谱图库进行比较的兴趣日益浓厚。这种方法的优点是考虑了仪器相关因素以及肽段特异性的碎片化概率差异。对于高通量蛋白质组学研究,它在计算上也更高效。

结果

本文研究了与这种谱图比较方法相关的计算问题。在几组大量的谱图上对不同方法进行了实证评估。首先,我们表明峰强度遵循泊松分布。这意味着应用平方根变换将最佳地稳定峰强度方差。我们的结果表明,平方根变换确实优于其他变换,从而提高了谱图匹配的准确性。其次,比较了不同的谱图相似性度量,结果表明相关系数最为稳健。最后,我们研究了如何组装与同一肽段相关的多个谱图以生成合成参考谱图。结果表明,总体平均提供了准确性和效率的最佳组合。

结论

我们的结果表明,这些方法结合使用时,可以提高谱图比较的灵敏度和特异性。因此,它们能够增强和补充现有的工具,以实现一致且准确的肽段鉴定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/7478269079c3/1477-5956-5-3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/d02993058983/1477-5956-5-3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/096773a19e86/1477-5956-5-3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/a23f07a4b201/1477-5956-5-3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/c551650923db/1477-5956-5-3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/32fb2b8afdd9/1477-5956-5-3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/7478269079c3/1477-5956-5-3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/d02993058983/1477-5956-5-3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/096773a19e86/1477-5956-5-3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/a23f07a4b201/1477-5956-5-3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/c551650923db/1477-5956-5-3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/32fb2b8afdd9/1477-5956-5-3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3c/1783643/7478269079c3/1477-5956-5-3-6.jpg

相似文献

1
Methods for peptide identification by spectral comparison.通过光谱比较鉴定肽段的方法。
Proteome Sci. 2007 Jan 16;5:3. doi: 10.1186/1477-5956-5-3.
2
Spectral library generating function for assessing spectrum-spectrum match significance.用于评估谱图-谱图匹配显著性的光谱库生成函数。
J Proteome Res. 2013 Sep 6;12(9):3944-51. doi: 10.1021/pr400230p. Epub 2013 Jul 31.
3
Denoising peptide tandem mass spectra for spectral libraries: a Bayesian approach.为谱库对肽串联质谱进行去噪:一种贝叶斯方法。
J Proteome Res. 2013 Jul 5;12(7):3223-32. doi: 10.1021/pr400080b. Epub 2013 Jun 6.
4
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序:是否已经成为数据库驱动肽鉴定的准确替代方法?
Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.
5
Understanding the improved sensitivity of spectral library searching over sequence database searching in proteomics data analysis.理解在蛋白质组学数据分析中,光谱库搜索相对于序列数据库搜索的灵敏度提高。
Proteomics. 2011 Mar;11(6):1075-85. doi: 10.1002/pmic.201000492. Epub 2011 Feb 7.
6
MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm.MUMAL2:使用成本敏感型人工神经网络和阈值选择算法提高鸟枪法蛋白质组学的灵敏度
BMC Bioinformatics. 2016 Dec 15;17(Suppl 18):472. doi: 10.1186/s12859-016-1341-x.
7
Proteomics in non-human primates: utilizing RNA-Seq data to improve protein identification by mass spectrometry in vervet monkeys.非人类灵长类动物的蛋白质组学:利用 RNA-Seq 数据提高食蟹猴中质谱法的蛋白质鉴定水平。
BMC Genomics. 2017 Nov 13;18(1):877. doi: 10.1186/s12864-017-4279-0.
8
Statistical models for protein validation using tandem mass spectral data and protein amino acid sequence databases.使用串联质谱数据和蛋白质氨基酸序列数据库进行蛋白质验证的统计模型。
Anal Chem. 2004 Mar 15;76(6):1664-71. doi: 10.1021/ac035112y.
9
A semi-empirical approach for predicting unobserved peptide MS/MS spectra from spectral libraries.一种从光谱库中预测未观测到的肽 MS/MS 光谱的半经验方法。
Proteomics. 2011 Dec;11(24):4702-11. doi: 10.1002/pmic.201100316. Epub 2011 Nov 23.
10
Constructing a Tandem Mass Spectral Library for Forensic Ricin Identification.构建用于法医蓖麻毒素鉴定的串联质谱文库。
J Proteome Res. 2019 Nov 1;18(11):3926-3935. doi: 10.1021/acs.jproteome.9b00377. Epub 2019 Oct 14.

引用本文的文献

1
Statistical Framework for Identifying Differences in Similar Mass Spectra: Expanding Possibilities for Isomer Identification.用于识别相似质谱差异的统计框架:拓展同系物鉴定的可能性。
Anal Chem. 2023 May 2;95(17):6996-7005. doi: 10.1021/acs.analchem.3c00495. Epub 2023 Apr 17.
2
The Emerging Role of uORF-Encoded uPeptides and HLA uLigands in Cellular and Tumor Biology.uORF编码的u肽和HLA u配体在细胞和肿瘤生物学中的新兴作用
Cancers (Basel). 2022 Dec 7;14(24):6031. doi: 10.3390/cancers14246031.
3
Characterization of core fucosylation via sequential enzymatic treatments of intact glycopeptides and mass spectrometry analysis.

本文引用的文献

1
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
2
Quantitative proteomics analysis of the secretory pathway.分泌途径的定量蛋白质组学分析
Cell. 2006 Dec 15;127(6):1265-81. doi: 10.1016/j.cell.2006.10.036.
3
Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries.使用谱库对大规模蛋白质组学实验中的肽段MS/MS谱进行分析。
通过对完整糖肽进行顺序酶处理和质谱分析来表征核心岩藻糖基化。
Nat Commun. 2022 Jul 7;13(1):3910. doi: 10.1038/s41467-022-31472-4.
4
Review of Liquid Chromatography-Mass Spectrometry-Based Proteomic Analyses of Body Fluids to Diagnose Infectious Diseases.体液的基于液相色谱-质谱联用的蛋白质组学分析在感染性疾病诊断中的研究进展。
Int J Mol Sci. 2022 Feb 16;23(4):2187. doi: 10.3390/ijms23042187.
5
Proteogenomic characterization of pancreatic ductal adenocarcinoma.胰腺导管腺癌的蛋白质基因组学特征分析。
Cell. 2021 Sep 16;184(19):5031-5052.e26. doi: 10.1016/j.cell.2021.08.023.
6
Integrated Proteomic and Glycoproteomic Characterization of Human High-Grade Serous Ovarian Carcinoma.人类高级别浆液性卵巢癌的蛋白质组学和糖蛋白质组学综合分析。
Cell Rep. 2020 Oct 20;33(3):108276. doi: 10.1016/j.celrep.2020.108276.
7
Full-Spectrum Prediction of Peptides Tandem Mass Spectra using Deep Neural Network.使用深度神经网络进行肽串联质谱的全谱预测。
Anal Chem. 2020 Mar 17;92(6):4275-4283. doi: 10.1021/acs.analchem.9b04867. Epub 2020 Feb 25.
8
Reanalysis of Global Proteomic and Phosphoproteomic Data Identified a Large Number of Glycopeptides.重新分析全球蛋白质组学和磷酸化蛋白质组学数据鉴定出大量糖肽。
Anal Chem. 2018 Jul 3;90(13):8065-8071. doi: 10.1021/acs.analchem.8b01137. Epub 2018 Jun 11.
9
Large scale analysis of amino acid substitutions in bacterial proteomics.细菌蛋白质组学中氨基酸取代的大规模分析。
BMC Bioinformatics. 2016 Nov 8;17(1):450. doi: 10.1186/s12859-016-1301-5.
10
Quantitative Comparison of Tandem Mass Spectra Obtained on Various Instruments.各种仪器上获得的串联质谱的定量比较。
J Am Soc Mass Spectrom. 2016 Aug;27(8):1357-65. doi: 10.1007/s13361-016-1408-y. Epub 2016 May 20.
Anal Chem. 2006 Aug 15;78(16):5678-84. doi: 10.1021/ac060279n.
4
Using annotated peptide mass spectrum libraries for protein identification.使用带注释的肽质谱库进行蛋白质鉴定。
J Proteome Res. 2006 Aug;5(8):1843-9. doi: 10.1021/pr0602085.
5
PRIMA: peptide robust identification from MS/MS spectra.
J Bioinform Comput Biol. 2006 Feb;4(1):125-38. doi: 10.1142/s0219720006001746.
6
Peptide charge state determination for low-resolution tandem mass spectra.低分辨率串联质谱的肽电荷态测定
Proc IEEE Comput Syst Bioinform Conf. 2005:175-85. doi: 10.1109/csb.2005.44.
7
Identification of post-translational modifications by blind search of mass spectra.通过对质谱进行盲目搜索来鉴定翻译后修饰。
Nat Biotechnol. 2005 Dec;23(12):1562-7. doi: 10.1038/nbt1168. Epub 2005 Nov 27.
8
NovoHMM: a hidden Markov model for de novo peptide sequencing.NovoHMM:一种用于从头肽测序的隐马尔可夫模型。
Anal Chem. 2005 Nov 15;77(22):7265-73. doi: 10.1021/ac0508853.
9
Peptide sequence tags for fast database search in mass-spectrometry.用于质谱中快速数据库搜索的肽序列标签
J Proteome Res. 2005 Jul-Aug;4(4):1287-95. doi: 10.1021/pr050011x.
10
DeNovoID: a web-based tool for identifying peptides from sequence and mass tags deduced from de novo peptide sequencing by mass spectroscopy.DeNovoID:一种基于网络的工具,用于从通过质谱从头肽测序推导的序列和质量标签中鉴定肽段。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W376-81. doi: 10.1093/nar/gki461.