Suppr超能文献

通过分析多个搜索引擎的错误发现率提高蛋白质组学研究的灵敏度。

Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines.

作者信息

Jones Andrew R, Siepen Jennifer A, Hubbard Simon J, Paton Norman W

机构信息

Department of Preclinical Veterinary Science, Faculty of Veterinary Science, University of Liverpool, Liverpool, UK.

出版信息

Proteomics. 2009 Mar;9(5):1220-9. doi: 10.1002/pmic.200800473.

Abstract

LC-MS experiments can generate large quantities of data, for which a variety of database search engines are available to make peptide and protein identifications. Decoy databases are becoming widely used to place statistical confidence in result sets, allowing the false discovery rate (FDR) to be estimated. Different search engines produce different identification sets so employing more than one search engine could result in an increased number of peptides (and proteins) being identified, if an appropriate mechanism for combining data can be defined. We have developed a search engine independent score, based on FDR, which allows peptide identifications from different search engines to be combined, called the FDR Score. The results demonstrate that the observed FDR is significantly different when analysing the set of identifications made by all three search engines, by each pair of search engines or by a single search engine. Our algorithm assigns identifications to groups according to the set of search engines that have made the identification, and re-assigns the score (combined FDR Score). The combined FDR Score can differentiate between correct and incorrect peptide identifications with high accuracy, allowing on average 35% more peptide identifications to be made at a fixed FDR than using a single search engine.

摘要

液相色谱-质谱联用(LC-MS)实验能够产生大量数据,针对这些数据有多种数据库搜索引擎可用于进行肽段和蛋白质鉴定。反向数据库正被广泛用于对结果集进行统计学置信度评估,从而能够估计错误发现率(FDR)。不同的搜索引擎会产生不同的鉴定集,因此如果能够定义一种合适的数据合并机制,使用多个搜索引擎可能会使鉴定出的肽段(和蛋白质)数量增加。我们基于错误发现率开发了一种独立于搜索引擎的评分方法,它能够将来自不同搜索引擎的肽段鉴定结果进行合并,称为错误发现率评分(FDR评分)。结果表明,在分析由所有三个搜索引擎、每对搜索引擎或单个搜索引擎做出的鉴定集时,观察到的错误发现率存在显著差异。我们的算法根据做出鉴定的搜索引擎集将鉴定结果分配到不同组,并重新分配评分(合并后的错误发现率评分)。合并后的错误发现率评分能够以高精度区分正确和错误的肽段鉴定结果,与使用单个搜索引擎相比,在固定的错误发现率下平均能够多鉴定出35%的肽段。

相似文献

5
Analysis of the resolution limitations of peptide identification algorithms.分析肽鉴定算法的分辨率限制。
J Proteome Res. 2011 Dec 2;10(12):5555-61. doi: 10.1021/pr200913a. Epub 2011 Oct 26.
9
False discovery rates in spectral identification.光谱识别中的假发现率。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S2. doi: 10.1186/1471-2105-13-S16-S2. Epub 2012 Nov 5.

引用本文的文献

8
Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal.通过 Ursgal 辅助的联合方法增强开放修饰搜索。
J Proteome Res. 2021 Apr 2;20(4):1986-1996. doi: 10.1021/acs.jproteome.0c00799. Epub 2021 Jan 29.

本文引用的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验