用于 MS/MS 谱图计算机辅助注释的专家系统。

Expert system for computer-assisted annotation of MS/MS spectra.

机构信息

Department of Proteomics and Signal Transduction, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, D-82152 Martinsried, Germany.

出版信息

Mol Cell Proteomics. 2012 Nov;11(11):1500-9. doi: 10.1074/mcp.M112.020271. Epub 2012 Aug 10.

DOI:10.1074/mcp.M112.020271

PMID:22888147

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3494176/

Abstract

An important step in mass spectrometry (MS)-based proteomics is the identification of peptides by their fragment spectra. Regardless of the identification score achieved, almost all tandem-MS (MS/MS) spectra contain remaining peaks that are not assigned by the search engine. These peaks may be explainable by human experts but the scale of modern proteomics experiments makes this impractical. In computer science, Expert Systems are a mature technology to implement a list of rules generated by interviews with practitioners. We here develop such an Expert System, making use of literature knowledge as well as a large body of high mass accuracy and pure fragmentation spectra. Interestingly, we find that even with high mass accuracy data, rule sets can quickly become too complex, leading to over-annotation. Therefore we establish a rigorous false discovery rate, calculated by random insertion of peaks from a large collection of other MS/MS spectra, and use it to develop an optimized knowledge base. This rule set correctly annotates almost all peaks of medium or high abundance. For high resolution HCD data, median intensity coverage of fragment peaks in MS/MS spectra increases from 58% by search engine annotation alone to 86%. The resulting annotation performance surpasses a human expert, especially on complex spectra such as those of larger phosphorylated peptides. Our system is also applicable to high resolution collision-induced dissociation data. It is available both as a part of MaxQuant and via a webserver that only requires an MS/MS spectrum and the corresponding peptides sequence, and which outputs publication quality, annotated MS/MS spectra (www.biochem.mpg.de/mann/tools/). It provides expert knowledge to beginners in the field of MS-based proteomics and helps advanced users to focus on unusual and possibly novel types of fragment ions.

摘要

基于质谱（MS）的蛋白质组学的一个重要步骤是通过其片段谱鉴定肽。无论达到的鉴定分数如何，几乎所有串联-MS（MS/MS）谱都包含未被搜索引擎分配的剩余峰。这些峰可能可以由人类专家解释，但现代蛋白质组学实验的规模使得这变得不切实际。在计算机科学中，专家系统是一种实现由从业者访谈生成的规则列表的成熟技术。我们在此开发了这样的专家系统，利用文献知识以及大量高质量和纯片段谱。有趣的是，我们发现，即使使用高质量精度数据，规则集也可能很快变得过于复杂，导致过度注释。因此，我们建立了一个严格的错误发现率，通过从大量其他 MS/MS 谱中随机插入峰来计算，并使用它来开发一个优化的知识库。这个规则集可以正确注释中等或高丰度的几乎所有峰。对于高分辨率 HCD 数据，仅通过搜索引擎注释，MS/MS 谱中片段峰的中值强度覆盖率从 58%增加到 86%。由此产生的注释性能超过了人类专家，尤其是对于较大磷酸化肽等复杂谱。我们的系统也适用于高分辨率碰撞诱导解离数据。它既可以作为 MaxQuant 的一部分，也可以通过一个仅需要 MS/MS 谱和相应肽序列的网络服务器使用，该服务器输出具有出版质量的注释 MS/MS 谱（www.biochem.mpg.de/mann/tools/）。它为基于 MS 的蛋白质组学领域的初学者提供了专家知识，并帮助高级用户专注于不寻常且可能是新型的片段离子。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e4/3494176/1c431546facc/zjw0111242830001.jpg

相似文献

Expert system for computer-assisted annotation of MS/MS spectra.用于 MS/MS 谱图计算机辅助注释的专家系统。

Mol Cell Proteomics. 2012 Nov;11(11):1500-9. doi: 10.1074/mcp.M112.020271. Epub 2012 Aug 10.

A systematic investigation into the nature of tryptic HCD spectra.系统研究胰蛋白酶 HCD 谱的本质。

J Proteome Res. 2012 Nov 2;11(11):5479-91. doi: 10.1021/pr3007045. Epub 2012 Oct 10.

Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序：是否已经成为数据库驱动肽鉴定的准确替代方法？

Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.

MS Amanda, a universal identification algorithm optimized for high accuracy tandem mass spectra.阿曼达质谱（MS Amanda），一种针对高精度串联质谱进行优化的通用识别算法。

J Proteome Res. 2014 Aug 1;13(8):3679-84. doi: 10.1021/pr500202e. Epub 2014 Jun 26.

pNovo: de novo peptide sequencing and identification using HCD spectra.pNovo：利用 HCD 谱进行从头多肽测序和鉴定。

J Proteome Res. 2010 May 7;9(5):2713-24. doi: 10.1021/pr100182k.

Chemical rule-based filtering of MS/MS spectra.基于化学规则的 MS/MS 光谱过滤。

Bioinformatics. 2013 Apr 1;29(7):925-32. doi: 10.1093/bioinformatics/btt061. Epub 2013 Feb 15.

Andromeda: a peptide search engine integrated into the MaxQuant environment.Andromeda：集成到 MaxQuant 环境中的肽搜索引擎。

J Proteome Res. 2011 Apr 1;10(4):1794-805. doi: 10.1021/pr101065j. Epub 2011 Feb 22.

A novel approach for untargeted post-translational modification identification using integer linear optimization and tandem mass spectrometry.一种利用整数线性优化和串联质谱进行非靶向翻译后修饰鉴定的新方法。

Mol Cell Proteomics. 2010 May;9(5):764-79. doi: 10.1074/mcp.M900487-MCP200. Epub 2010 Jan 26.

Optimization of Search Engines and Postprocessing Approaches to Maximize Peptide and Protein Identification for High-Resolution Mass Data.优化搜索引擎和后处理方法以最大化高分辨率质谱数据的肽段和蛋白质鉴定

J Proteome Res. 2015 Nov 6;14(11):4662-73. doi: 10.1021/acs.jproteome.5b00536. Epub 2015 Sep 30.

Unassigned MS/MS Spectra: Who Am I?未分配的串联质谱图：我是谁？

Methods Mol Biol. 2017;1549:67-74. doi: 10.1007/978-1-4939-6740-7_6.

引用本文的文献

MS Ana: Improving Sensitivity in Peptide Identification with Spectral Library Search.MS Ana：通过谱库检索提高肽段鉴定的灵敏度。

J Proteome Res. 2023 Feb 3;22(2):462-470. doi: 10.1021/acs.jproteome.2c00658. Epub 2023 Jan 23.

Effect of Insulin and Pioglitazone on Protein Phosphatase 2A Interaction Partners in Primary Human Skeletal Muscle Cells Derived from Obese Insulin-Resistant Participants.胰岛素和吡格列酮对源自肥胖胰岛素抵抗参与者的原代人骨骼肌细胞中蛋白磷酸酶2A相互作用伙伴的影响。

ACS Omega. 2022 Nov 15;7(47):42763-42773. doi: 10.1021/acsomega.2c04473. eCollection 2022 Nov 29.

Identification of Isopeptides Between Human Tissue Transglutaminase and Wheat, Rye, and Barley Gluten Peptides.人组织转谷氨酰胺酶与小麦、黑麦和大麦麸质肽之间异肽的鉴定。

Sci Rep. 2020 May 4;10(1):7426. doi: 10.1038/s41598-020-64143-9.

Comprehensive Detection of Isopeptides between Human Tissue Transglutaminase and Gluten Peptides.全面检测人组织转谷氨酰胺酶与谷胶肽之间的同型肽。

Nutrients. 2019 Sep 20;11(10):2263. doi: 10.3390/nu11102263.

MaxQuant.Live Enables Global Targeting of More Than 25,000 Peptides.MaxQuant.Live 实现了全球范围内 25000 多种肽段的靶向分析。

Mol Cell Proteomics. 2019 May;18(5):982-994. doi: 10.1074/mcp.TIR118.001131. Epub 2019 Feb 12.

In-depth proteomic analyses of (greenlip abalone) nacre and prismatic organic shell matrix.对（绿唇鲍）珍珠层和棱柱形有机贝壳基质进行深入的蛋白质组学分析。

Proteome Sci. 2018 Jun 15;16:11. doi: 10.1186/s12953-018-0139-3. eCollection 2018.

ProteomicsDB.蛋白质组数据库。

Nucleic Acids Res. 2018 Jan 4;46(D1):D1271-D1281. doi: 10.1093/nar/gkx1029.

Increased serotransferrin and ceruloplasmin turnover in diet-controlled patients with type 2 diabetes.2 型糖尿病患者经饮食控制后，转铁蛋白和铜蓝蛋白周转率增加。

Free Radic Biol Med. 2017 Dec;113:461-469. doi: 10.1016/j.freeradbiomed.2017.10.373. Epub 2017 Oct 25.

Multiplexed Temporal Quantification of the Exercise-regulated Plasma Peptidome.多指标时间分辨的运动调节血浆肽组学研究。

Mol Cell Proteomics. 2017 Dec;16(12):2055-2068. doi: 10.1074/mcp.RA117.000020. Epub 2017 Oct 5.

The MaxQuant computational platform for mass spectrometry-based shotgun proteomics.MaxQuant 计算平台用于基于质谱的鸟枪法蛋白质组学。

Nat Protoc. 2016 Dec;11(12):2301-2319. doi: 10.1038/nprot.2016.136. Epub 2016 Oct 27.

本文引用的文献

Ultra high resolution linear ion trap Orbitrap mass spectrometer (Orbitrap Elite) facilitates top down LC MS/MS and versatile peptide fragmentation modes.超高分辨率线性离子阱轨道阱质谱仪（Orbitrap Elite）可实现自上而下的 LC-MS/MS 和多种肽片段化模式。

Mol Cell Proteomics. 2012 Mar;11(3):O111.013698. doi: 10.1074/mcp.O111.013698. Epub 2011 Dec 9.

De novo sequencing and homology searching.从头测序和同源搜索。

Mol Cell Proteomics. 2012 Feb;11(2):O111.014902. doi: 10.1074/mcp.O111.014902. Epub 2011 Nov 16.

Mass spectrometry-based proteomics using Q Exactive, a high-performance benchtop quadrupole Orbitrap mass spectrometer.基于 Q Exactive 的质谱蛋白质组学，Q Exactive 是一种高性能台式四极轨道阱质谱仪。

Mol Cell Proteomics. 2011 Sep;10(9):M111.011015. doi: 10.1074/mcp.M111.011015. Epub 2011 Jun 3.

Pinpointing phosphorylation sites: Quantitative filtering and a novel site-specific x-ion fragment.精确定位磷酸化位点：定量过滤和新型的位点特异性 x 离子片段。

J Proteome Res. 2011 Jul 1;10(7):2937-48. doi: 10.1021/pr200154t. Epub 2011 Apr 28.

Quality assessments of peptide-spectrum matches in shotgun proteomics.肽谱匹配在鸟枪法蛋白质组学中的质量评估。

Proteomics. 2011 Mar;11(6):1086-93. doi: 10.1002/pmic.201000432. Epub 2011 Feb 7.

More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS.在单次鸟枪法蛋白质组学运行中可洗脱超过 100,000 种可检测肽，但大多数肽是无法通过基于数据的 LC-MS/MS 获得的。

J Proteome Res. 2011 Apr 1;10(4):1785-93. doi: 10.1021/pr101060v. Epub 2011 Feb 28.

Andromeda: a peptide search engine integrated into the MaxQuant environment.Andromeda：集成到 MaxQuant 环境中的肽搜索引擎。

J Proteome Res. 2011 Apr 1;10(4):1794-805. doi: 10.1021/pr101065j. Epub 2011 Feb 22.

Quantifying the impact of chimera MS/MS spectra on peptide identification in large-scale proteomics studies.定量分析嵌合体 MS/MS 谱图对大规模蛋白质组学研究中肽段鉴定的影响。

J Proteome Res. 2010 Aug 6;9(8):4152-60. doi: 10.1021/pr1003856.

Deconvolution of mixture spectra from ion-trap data-independent-acquisition tandem mass spectrometry.从离子阱数据非依赖采集串联质谱中解卷积混合物谱。

Anal Chem. 2010 Feb 1;82(3):833-41. doi: 10.1021/ac901801b.

A dual pressure linear ion trap Orbitrap instrument with very high sequencing speed.一款具有极高测序速度的双压线性离子阱轨道阱仪器。

Mol Cell Proteomics. 2009 Dec;8(12):2759-69. doi: 10.1074/mcp.M900375-MCP200. Epub 2009 Oct 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于 MS/MS 谱图计算机辅助注释的专家系统。

Expert system for computer-assisted annotation of MS/MS spectra.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献