使用蒙特卡洛k置换诱饵数据库对神经肽鉴定结果进行准确的显著性赋值。

Accurate assignment of significance to neuropeptide identifications using Monte Carlo k-permuted decoy databases.

作者信息

Akhtar Malik N, Southey Bruce R, Andrén Per E, Sweedler Jonathan V, Rodriguez-Zas Sandra L

机构信息

Department of Animal Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America.

Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden.

出版信息

PLoS One. 2014 Oct 17;9(10):e111112. doi: 10.1371/journal.pone.0111112. eCollection 2014.

DOI:10.1371/journal.pone.0111112

PMID:25329667

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4201571/

Abstract

In support of accurate neuropeptide identification in mass spectrometry experiments, novel Monte Carlo permutation testing was used to compute significance values. Testing was based on k-permuted decoy databases, where k denotes the number of permutations. These databases were integrated with a range of peptide identification indicators from three popular open-source database search software (OMSSA, Crux, and X! Tandem) to assess the statistical significance of neuropeptide spectra matches. Significance p-values were computed as the fraction of the sequences in the database with match indicator value better than or equal to the true target spectra. When applied to a test-bed of all known manually annotated mouse neuropeptides, permutation tests with k-permuted decoy databases identified up to 100% of the neuropeptides at p-value < 10(-5). The permutation test p-values using hyperscore (X! Tandem), E-value (OMSSA) and Sp score (Crux) match indicators outperformed all other match indicators. The robust performance to detect peptides of the intuitive indicator "number of matched ions between the experimental and theoretical spectra" highlights the importance of considering this indicator when the p-value was borderline significant. Our findings suggest permutation decoy databases of size 1×105 are adequate to accurately detect neuropeptides and this can be exploited to increase the speed of the search. The straightforward Monte Carlo permutation testing (comparable to a zero order Markov model) can be easily combined with existing peptide identification software to enable accurate and effective neuropeptide detection. The source code is available at http://stagbeetle.animal.uiuc.edu/pepshop/MSMSpermutationtesting.

摘要

为支持质谱实验中神经肽的准确鉴定，采用了新型蒙特卡罗置换检验来计算显著性值。检验基于k重排列的诱饵数据库，其中k表示排列数。这些数据库与来自三种流行的开源数据库搜索软件（OMSSA、Crux和X! Tandem）的一系列肽段鉴定指标相结合，以评估神经肽谱匹配的统计显著性。显著性p值计算为数据库中匹配指标值优于或等于真实目标谱的序列比例。当应用于所有已知的手动注释小鼠神经肽测试平台时，使用k重排列诱饵数据库的置换检验在p值<10^(-5)时可鉴定出高达100%的神经肽。使用超得分（X! Tandem）、E值（OMSSA）和Sp得分（Crux）匹配指标的置换检验p值优于所有其他匹配指标。直观指标“实验光谱与理论光谱之间匹配离子数”检测肽段的稳健性能突出了在p值临界显著时考虑该指标的重要性。我们的研究结果表明，大小为1×10^5的置换诱饵数据库足以准确检测神经肽，并且可以利用这一点来提高搜索速度。直接的蒙特卡罗置换检验（类似于零阶马尔可夫模型）可以轻松地与现有的肽段鉴定软件相结合，以实现准确有效的神经肽检测。源代码可在http://stagbeetle.animal.uiuc.edu/pepshop/MSMSpermutationtesting获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5829/4201571/dcd066c34d71/pone.0111112.g001.jpg

相似文献

Accurate assignment of significance to neuropeptide identifications using Monte Carlo k-permuted decoy databases.

PLoS One. 2014 Oct 17;9(10):e111112. doi: 10.1371/journal.pone.0111112. eCollection 2014.

Identification of best indicators of peptide-spectrum match using a permutation resampling approach.

J Bioinform Comput Biol. 2014 Oct;12(5):1440001. doi: 10.1142/S0219720014400010.

Evaluation of database search programs for accurate detection of neuropeptides in tandem mass spectrometry experiments.

J Proteome Res. 2012 Dec 7;11(12):6044-55. doi: 10.1021/pr3007123. Epub 2012 Nov 6.

Rapid and accurate peptide identification from tandem mass spectra.

J Proteome Res. 2008 Jul;7(7):3022-7. doi: 10.1021/pr800127y. Epub 2008 May 28.

Analysis of the resolution limitations of peptide identification algorithms.

J Proteome Res. 2011 Dec 2;10(12):5555-61. doi: 10.1021/pr200913a. Epub 2011 Oct 26.

Assigning spectrum-specific P-values to protein identifications by mass spectrometry.

Bioinformatics. 2011 Apr 15;27(8):1128-34. doi: 10.1093/bioinformatics/btr089. Epub 2011 Feb 23.

EndoGenius: Optimized Neuropeptide Identification from Mass Spectrometry Datasets.

J Proteome Res. 2024 Aug 2;23(8):3041-3051. doi: 10.1021/acs.jproteome.3c00758. Epub 2024 Mar 1.

In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.

J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.

Artificial decoy spectral libraries for false discovery rate estimation in spectral library searching in proteomics.

J Proteome Res. 2010 Jan;9(1):605-10. doi: 10.1021/pr900947u.

NeuroPedia: neuropeptide database and spectral library.

Bioinformatics. 2011 Oct 1;27(19):2772-3. doi: 10.1093/bioinformatics/btr445. Epub 2011 Aug 5.

引用本文的文献

Neuropeptidomics Mass Spectrometry Reveals Signaling Networks Generated by Distinct Protease Pathways in Human Systems.

J Am Soc Mass Spectrom. 2015 Dec;26(12):1970-80. doi: 10.1007/s13361-015-1251-6. Epub 2015 Oct 19.

Peptidomics for the discovery and characterization of neuropeptides and hormones.

Trends Pharmacol Sci. 2015 Sep;36(9):579-86. doi: 10.1016/j.tips.2015.05.009. Epub 2015 Jul 1.

本文引用的文献

Comparing label-free quantitative peptidomics approaches to characterize diurnal variation of peptides in the rat suprachiasmatic nucleus.

Anal Chem. 2014 Jan 7;86(1):443-52. doi: 10.1021/ac4023378. Epub 2013 Dec 16.

A multi-scale strategy for discovery of novel endogenous neuropeptides in the crustacean nervous system.

J Proteomics. 2013 Oct 8;91:1-12. doi: 10.1016/j.jprot.2013.06.021. Epub 2013 Jun 24.

Profiling of diet-induced neuropeptide changes in rat brain by quantitative mass spectrometry.

Anal Chem. 2013 May 7;85(9):4594-604. doi: 10.1021/ac400232y. Epub 2013 Apr 26.

Evaluation of database search programs for accurate detection of neuropeptides in tandem mass spectrometry experiments.

J Proteome Res. 2012 Dec 7;11(12):6044-55. doi: 10.1021/pr3007123. Epub 2012 Nov 6.

High identification rates of endogenous neuropeptides from mouse brain.

J Proteome Res. 2012 May 4;11(5):2819-27. doi: 10.1021/pr3001699. Epub 2012 Mar 30.

Probing the production of amidated peptides following genetic and dietary copper manipulations.

PLoS One. 2011;6(12):e28679. doi: 10.1371/journal.pone.0028679. Epub 2011 Dec 16.

RAId_aPS: MS/MS analysis with multiple scoring functions and spectrum-specific statistics.

PLoS One. 2010 Nov 16;5(11):e15438. doi: 10.1371/journal.pone.0015438.

The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.

Mol Cell Proteomics. 2010 Dec;9(12):2840-52. doi: 10.1074/mcp.M110.003731. Epub 2010 Sep 9.

A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8.

The zebra finch neuropeptidome: prediction, detection and expression.

BMC Biol. 2010 Apr 1;8:28. doi: 10.1186/1741-7007-8-28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用蒙特卡洛k置换诱饵数据库对神经肽鉴定结果进行准确的显著性赋值。

Accurate assignment of significance to neuropeptide identifications using Monte Carlo k-permuted decoy databases.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献