• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于理论质谱比较的开放检索方法评估。

Evaluation of open search methods based on theoretical mass spectra comparison.

机构信息

CNRS, LS2N, Université de Nantes, 44000, Nantes, France.

UR BIA, INRAE, 44316, Nantes, France.

出版信息

BMC Bioinformatics. 2021 Apr 26;22(Suppl 2):65. doi: 10.1186/s12859-021-03963-6.

DOI:10.1186/s12859-021-03963-6
PMID:33902435
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8073971/
Abstract

BACKGROUND

Mass spectrometry remains the privileged method to characterize proteins. Nevertheless, most of the spectra generated by an experiment remain unidentified after their analysis, mostly because of the modifications they carry. Open Modification Search (OMS) methods offer a promising answer to this problem. However, assessing the quality of OMS identifications remains a difficult task.

METHODS

Aiming at better understanding the relationship between (1) similarity of pairs of spectra provided by OMS methods and (2) relevance of their corresponding peptide sequences, we used a dataset composed of theoretical spectra only, on which we applied two OMS strategies. We also introduced two appropriately defined measures for evaluating the above mentioned spectra/sequence relevance in this context: one is a color classification representing the level of difficulty to retrieve the proper sequence of the peptide that generated the identified spectrum ; the other, called LIPR, is the proportion of common masses, in a given Peptide Spectrum Match (PSM), that represent dissimilar sequences. These two measures were also considered in conjunction with the False Discovery Rate (FDR).

RESULTS

According to our measures, the strategy that selects the best candidate by taking the mass difference between two spectra into account yields better quality results. Besides, although the FDR remains an interesting indicator in OMS methods (as shown by LIPR), it is questionable: indeed, our color classification shows that a non negligible proportion of relevant spectra/sequence interpretations corresponds to PSMs coming from the decoy database.

CONCLUSIONS

The three above mentioned measures allowed us to clearly determine which of the two studied OMS strategies outperformed the other, both in terms of number of identifications and of accuracy of these identifications. Even though quality evaluation of PSMs in OMS methods remains challenging, the study of theoretical spectra is a favorable framework for going further in this direction.

摘要

背景

质谱仍然是鉴定蛋白质的首选方法。然而,大多数实验生成的光谱在分析后仍然无法识别,主要是因为它们携带的修饰。开放修饰搜索 (OMS) 方法为此问题提供了一个有前途的答案。然而,评估 OMS 鉴定的质量仍然是一项艰巨的任务。

方法

为了更好地理解 (1) OMS 方法提供的一对光谱之间的相似性和 (2) 它们对应的肽序列的相关性,我们仅使用理论光谱数据集,在该数据集中应用了两种 OMS 策略。我们还引入了两个适当定义的度量标准来评估上述光谱/序列相关性:一个是颜色分类,代表检索生成鉴定光谱的肽的正确序列的难度级别;另一个称为 LIPR,是给定肽谱匹配 (PSM) 中代表不同序列的共同质量的比例。这两个度量标准也与错误发现率 (FDR) 一起考虑。

结果

根据我们的度量标准,考虑两个光谱之间质量差异来选择最佳候选的策略产生了更好质量的结果。此外,尽管 FDR 仍然是 OMS 方法中的一个有趣指标(如 LIPR 所示),但它值得怀疑:实际上,我们的颜色分类表明,大量相关的光谱/序列解释对应于来自诱饵数据库的 PSM。

结论

上述三个度量标准使我们能够清楚地确定两种研究 OMS 策略中的哪一种优于另一种,无论是在鉴定数量还是鉴定准确性方面。尽管 OMS 方法中 PSM 的质量评估仍然具有挑战性,但对理论光谱的研究是朝着这个方向进一步发展的有利框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/17f468110500/12859_2021_3963_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/aa5f69427242/12859_2021_3963_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/f188e6567251/12859_2021_3963_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/23f9443996fa/12859_2021_3963_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/b71e2030a0e1/12859_2021_3963_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/726075a4b894/12859_2021_3963_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/a5eaf877730f/12859_2021_3963_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/77173134df8c/12859_2021_3963_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/17f468110500/12859_2021_3963_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/aa5f69427242/12859_2021_3963_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/f188e6567251/12859_2021_3963_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/23f9443996fa/12859_2021_3963_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/b71e2030a0e1/12859_2021_3963_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/726075a4b894/12859_2021_3963_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/a5eaf877730f/12859_2021_3963_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/77173134df8c/12859_2021_3963_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd9/8073971/17f468110500/12859_2021_3963_Fig8_HTML.jpg

相似文献

1
Evaluation of open search methods based on theoretical mass spectra comparison.基于理论质谱比较的开放检索方法评估。
BMC Bioinformatics. 2021 Apr 26;22(Suppl 2):65. doi: 10.1186/s12859-021-03963-6.
2
Fast alignment of mass spectra in large proteomics datasets, capturing dissimilarities arising from multiple complex modifications of peptides.在大型蛋白质组学数据集快速对齐质谱,捕获肽的多种复杂修饰引起的差异。
BMC Bioinformatics. 2023 Nov 8;24(1):421. doi: 10.1186/s12859-023-05555-y.
3
Two-dimensional target decoy strategy for shotgun proteomics. shotgun 蛋白质组学的二维靶标诱饵策略。
J Proteome Res. 2011 Dec 2;10(12):5296-301. doi: 10.1021/pr200780j. Epub 2011 Nov 7.
4
False discovery rate estimation using candidate peptides for each spectrum.使用每个谱图的候选肽进行错误发现率估计。
BMC Bioinformatics. 2022 Nov 1;23(1):454. doi: 10.1186/s12859-022-05002-4.
5
An improved method for the construction of decoy peptide MS/MS spectra suitable for the accurate estimation of false discovery rates.一种改进的适用于准确估计假发现率的诱饵肽 MS/MS 谱构建方法。
Proteomics. 2011 Oct;11(20):4085-95. doi: 10.1002/pmic.201000665. Epub 2011 Sep 7.
6
False Discovery Rate Estimation for Hybrid Mass Spectral Library Search Identifications in Bottom-up Proteomics.用于 Bottom-up 蛋白质组学中混合质谱文库搜索鉴定的假发现率估计。
J Proteome Res. 2019 Sep 6;18(9):3223-3234. doi: 10.1021/acs.jproteome.8b00863. Epub 2019 Aug 14.
7
Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics.通用诱饵分布简化了鸟枪法蛋白质组学中的错误发现率估计
J Proteome Res. 2022 Feb 4;21(2):339-348. doi: 10.1021/acs.jproteome.1c00600. Epub 2022 Jan 6.
8
Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing.快速开放修改谱库搜索通过近似最近邻索引。
J Proteome Res. 2018 Oct 5;17(10):3463-3474. doi: 10.1021/acs.jproteome.8b00359. Epub 2018 Sep 13.
9
MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm.MUMAL2:使用成本敏感型人工神经网络和阈值选择算法提高鸟枪法蛋白质组学的灵敏度
BMC Bioinformatics. 2016 Dec 15;17(Suppl 18):472. doi: 10.1186/s12859-016-1341-x.
10
Crescendo: A Protein Sequence Database Search Engine for Tandem Mass Spectra.Crescendo:一种用于串联质谱的蛋白质序列数据库搜索引擎。
J Am Soc Mass Spectrom. 2015 Jul;26(7):1077-84. doi: 10.1007/s13361-015-1120-3. Epub 2015 Apr 21.

引用本文的文献

1
LibGen: Generating High Quality Spectral Libraries of Natural Products for EAD-, UVPD-, and HCD-High Resolution Mass Spectrometers.LibGen:为电子捕获解离(EAD)、紫外光解离(UVPD)和高能碰撞解离(HCD)高分辨率质谱仪生成高质量天然产物光谱库。
Anal Chem. 2023 Nov 21;95(46):16810-16818. doi: 10.1021/acs.analchem.3c02263. Epub 2023 Nov 8.
2
Spatially Resolved Top-Down Proteomics of Tissue Sections Based on a Microfluidic Nanodroplet Sample Preparation Platform.基于微流控纳升级样制备平台的组织切片的空间分辨自上而下蛋白质组学。
Mol Cell Proteomics. 2023 Feb;22(2):100491. doi: 10.1016/j.mcpro.2022.100491. Epub 2023 Jan 2.

本文引用的文献

1
Ensembl 2020.Ensembl 2020.
Nucleic Acids Res. 2020 Jan 8;48(D1):D682-D688. doi: 10.1093/nar/gkz966.
2
Extremely Fast and Accurate Open Modification Spectral Library Searching of High-Resolution Mass Spectra Using Feature Hashing and Graphics Processing Units.使用特征哈希和图形处理单元进行高分辨率质谱的极快速准确开放修饰谱库搜索。
J Proteome Res. 2019 Oct 4;18(10):3792-3799. doi: 10.1021/acs.jproteome.9b00291. Epub 2019 Aug 30.
3
MODplus: Robust and Unrestrictive Identification of Post-Translational Modifications Using Mass Spectrometry.
MODplus:使用质谱法对翻译后修饰进行稳健且无限制的鉴定
Anal Chem. 2019 Sep 3;91(17):11324-11333. doi: 10.1021/acs.analchem.9b02445. Epub 2019 Aug 14.
4
TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets.TagGraph 揭示了来自大型串联质谱数据集的广泛蛋白质修饰图谱。
Nat Biotechnol. 2019 Apr;37(4):469-479. doi: 10.1038/s41587-019-0067-5. Epub 2019 Apr 1.
5
Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine.使用高效的开放式搜索引擎对串联质谱中的肽段进行全面鉴定。
Nat Biotechnol. 2018 Oct 8. doi: 10.1038/nbt.4236.
6
Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing.快速开放修改谱库搜索通过近似最近邻索引。
J Proteome Res. 2018 Oct 5;17(10):3463-3474. doi: 10.1021/acs.jproteome.8b00359. Epub 2018 Sep 13.
7
Enhanced Global Post-translational Modification Discovery with MetaMorpheus.MetaMorpheus 增强全局翻译后修饰发现。
J Proteome Res. 2018 May 4;17(5):1844-1851. doi: 10.1021/acs.jproteome.7b00873. Epub 2018 Apr 2.
8
Mass spectrometrists should search for all peptides, but assess only the ones they care about.质谱分析人员应搜索所有肽段,但只评估他们关注的那些。
Nat Methods. 2017 Jun 29;14(7):643-644. doi: 10.1038/nmeth.4338.
9
SpecOMS: A Full Open Modification Search Method Performing All-to-All Spectra Comparisons within Minutes.SpecOMS:一种全开放式的修饰谱检索方法,可在数分钟内完成全谱比较。
J Proteome Res. 2017 Aug 4;16(8):3030-3038. doi: 10.1021/acs.jproteome.7b00308. Epub 2017 Jul 11.
10
MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.MSFragger:基于质谱的蛋白质组学中实现超快速且全面的肽段鉴定
Nat Methods. 2017 May;14(5):513-520. doi: 10.1038/nmeth.4256. Epub 2017 Apr 10.