• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PAnalyzer:一种用于鸟枪法蛋白质组学中蛋白质推断的软件工具。

PAnalyzer: a software tool for protein inference in shotgun proteomics.

机构信息

Department of Communications Engineering, University of the Basque Country (UPV/EHU), Alda, Urquijo s/n, Bilbao, 48013, Spain.

出版信息

BMC Bioinformatics. 2012 Nov 5;13:288. doi: 10.1186/1471-2105-13-288.

DOI:10.1186/1471-2105-13-288
PMID:23126499
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3548767/
Abstract

BACKGROUND

Protein inference from peptide identifications in shotgun proteomics must deal with ambiguities that arise due to the presence of peptides shared between different proteins, which is common in higher eukaryotes. Recently data independent acquisition (DIA) approaches have emerged as an alternative to the traditional data dependent acquisition (DDA) in shotgun proteomics experiments. MSE is the term used to name one of the DIA approaches used in QTOF instruments. MSE data require specialized software to process acquired spectra and to perform peptide and protein identifications. However the software available at the moment does not group the identified proteins in a transparent way by taking into account peptide evidence categories. Furthermore the inspection, comparison and report of the obtained results require tedious manual intervention. Here we report a software tool to address these limitations for MSE data.

RESULTS

In this paper we present PAnalyzer, a software tool focused on the protein inference process of shotgun proteomics. Our approach considers all the identified proteins and groups them when necessary indicating their confidence using different evidence categories. PAnalyzer can read protein identification files in the XML output format of the ProteinLynx Global Server (PLGS) software provided by Waters Corporation for their MSE data, and also in the mzIdentML format recently standardized by HUPO-PSI. Multiple files can also be read simultaneously and are considered as technical replicates. Results are saved to CSV, HTML and mzIdentML (in the case of a single mzIdentML input file) files. An MSE analysis of a real sample is presented to compare the results of PAnalyzer and ProteinLynx Global Server.

CONCLUSIONS

We present a software tool to deal with the ambiguities that arise in the protein inference process. Key contributions are support for MSE data analysis by ProteinLynx Global Server and technical replicates integration. PAnalyzer is an easy to use multiplatform and free software tool.

摘要

背景

在 shotgun 蛋白质组学中,从肽鉴定推断蛋白质时必须处理由于不同蛋白质之间存在共享肽而产生的歧义,这在高等真核生物中很常见。最近,数据独立采集 (DIA) 方法已成为 shotgun 蛋白质组学实验中传统数据依赖采集 (DDA) 的替代方法。MSE 是用于命名 QTOF 仪器中使用的 DIA 方法之一的术语。MSE 数据需要专门的软件来处理采集的光谱并执行肽和蛋白质鉴定。然而,目前可用的软件没有考虑肽证据类别,以透明的方式将鉴定的蛋白质分组。此外,获得的结果的检查、比较和报告需要繁琐的手动干预。在这里,我们报告了一种软件工具,用于解决 MSE 数据的这些限制。

结果

在本文中,我们介绍了 PAnalyzer,这是一种专注于 shotgun 蛋白质组学蛋白质推断过程的软件工具。我们的方法考虑了所有鉴定的蛋白质,并在必要时使用不同的证据类别对其进行分组并指示其置信度。PAnalyzer 可以读取 Waters 公司提供的 ProteinLynx Global Server (PLGS) 软件的 XML 输出格式的蛋白质鉴定文件,以及最近由 HUPO-PSI 标准化的 mzIdentML 格式。也可以同时读取多个文件,并将其视为技术重复。结果保存到 CSV、HTML 和 mzIdentML(在单个 mzIdentML 输入文件的情况下)文件中。还呈现了一个真实样品的 MSE 分析,以比较 PAnalyzer 和 ProteinLynx Global Server 的结果。

结论

我们提出了一种软件工具来处理蛋白质推断过程中出现的歧义。主要贡献是支持 ProteinLynx Global Server 的 MSE 数据分析和技术重复集成。PAnalyzer 是一个易于使用的跨平台免费软件工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/4af0ca5a9ceb/1471-2105-13-288-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/605e8d4ca20c/1471-2105-13-288-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/4f36f1dc918a/1471-2105-13-288-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/627ea752647b/1471-2105-13-288-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/a58c1d1a069a/1471-2105-13-288-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/4af0ca5a9ceb/1471-2105-13-288-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/605e8d4ca20c/1471-2105-13-288-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/4f36f1dc918a/1471-2105-13-288-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/627ea752647b/1471-2105-13-288-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/a58c1d1a069a/1471-2105-13-288-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d18/3548767/4af0ca5a9ceb/1471-2105-13-288-5.jpg

相似文献

1
PAnalyzer: a software tool for protein inference in shotgun proteomics.PAnalyzer:一种用于鸟枪法蛋白质组学中蛋白质推断的软件工具。
BMC Bioinformatics. 2012 Nov 5;13:288. doi: 10.1186/1471-2105-13-288.
2
A standardized framing for reporting protein identifications in mzIdentML 1.2.mzIdentML 1.2中用于报告蛋白质鉴定结果的标准化框架。
Proteomics. 2014 Nov;14(21-22):2389-99. doi: 10.1002/pmic.201400080. Epub 2014 Sep 23.
3
Calculation of False Discovery Rate for Peptide and Protein Identification.肽段和蛋白质鉴定的错误发现率计算
Methods Mol Biol. 2020;2051:145-159. doi: 10.1007/978-1-4939-9744-2_6.
4
In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。
J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.
5
mzIdentML 1.3.0 - Essential progress on the support of crosslinking and other identifications based on multiple spectra.mzIdentML 1.3.0 - 在支持基于多个光谱的交联和其他鉴定方面取得重要进展。
Proteomics. 2024 Sep;24(17):e2300385. doi: 10.1002/pmic.202300385. Epub 2024 Jul 12.
6
jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats.jmzReader:一个用于处理和可视化多种基于文本和 XML 的质谱数据格式的 Java 解析库。
Proteomics. 2012 Mar;12(6):795-8. doi: 10.1002/pmic.201100578.
7
IPeak: An open source tool to combine results from multiple MS/MS search engines.IPeak:一个用于整合多个串联质谱搜索引擎结果的开源工具。
Proteomics. 2015 Sep;15(17):2916-20. doi: 10.1002/pmic.201400208. Epub 2015 Aug 6.
8
mzIdentML: an open community-built standard format for the results of proteomics spectrum identification algorithms.mzIdentML:一种由社区构建的用于蛋白质组学谱图鉴定算法结果的开放标准格式。
Methods Mol Biol. 2011;696:161-77. doi: 10.1007/978-1-60761-987-1_10.
9
Tools (Viewer, Library and Validator) that facilitate use of the peptide and protein identification standard format, termed mzIdentML.有助于使用肽和蛋白质鉴定标准格式(称为 mzIdentML)的工具(查看器、库和验证器)。
Mol Cell Proteomics. 2013 Nov;12(11):3026-35. doi: 10.1074/mcp.O113.029777. Epub 2013 Jun 28.
10
ProCon - PROteomics CONversion tool.ProCon - 蛋白质组学转换工具。
J Proteomics. 2015 Nov 3;129:56-62. doi: 10.1016/j.jprot.2015.06.015. Epub 2015 Jul 13.

引用本文的文献

1
Metaproteomics of the human gut microbiota: Challenges and contributions to other OMICS.人类肠道微生物群的宏蛋白质组学:挑战及对其他组学的贡献
Clin Mass Spectrom. 2019 Jun 4;14 Pt A:18-30. doi: 10.1016/j.clinms.2019.06.001. eCollection 2019 Sep.
2
Hijacking the Mustard-Oil Bomb: How a Glucosinolate-Sequestering Flea Beetle Copes With Plant Myrosinases.劫持芥子油炸弹:一种积累硫代葡萄糖苷的跳甲如何应对植物黑芥子酶
Front Plant Sci. 2021 May 20;12:645030. doi: 10.3389/fpls.2021.645030. eCollection 2021.
3
Impact of the Identification Strategy on the Reproducibility of the DDA and DIA Results.

本文引用的文献

1
The mzIdentML data standard for mass spectrometry-based proteomics results.基于质谱的蛋白质组学结果的 mzIdentML 数据标准。
Mol Cell Proteomics. 2012 Jul;11(7):M111.014381. doi: 10.1074/mcp.M111.014381. Epub 2012 Feb 27.
2
Protein inference: a review.蛋白质推断:综述。
Brief Bioinform. 2012 Sep;13(5):586-614. doi: 10.1093/bib/bbs004. Epub 2012 Feb 28.
3
IsoformResolver: A peptide-centric algorithm for protein inference.IsoformResolver:一种基于肽段的蛋白质推断算法。
鉴定策略对数据依赖性采集(DDA)和数据非依赖性采集(DIA)结果可重复性的影响。
J Proteome Res. 2020 Aug 7;19(8):3153-3161. doi: 10.1021/acs.jproteome.0c00153. Epub 2020 Jun 19.
4
Bioinformatics Resources for Plant Abiotic Stress Responses: State of the Art and Opportunities in the Fast Evolving -Omics Era.植物非生物胁迫响应的生物信息学资源:快速发展的组学时代的现状与机遇
Plants (Basel). 2020 May 6;9(5):591. doi: 10.3390/plants9050591.
5
Improving Proteomics Data Reproducibility with a Dual-Search Strategy.采用双重搜索策略提高蛋白质组学数据的可重复性
Anal Chem. 2020 Jan 21;92(2):1697-1701. doi: 10.1021/acs.analchem.9b04955. Epub 2020 Jan 9.
6
Proteomics INTegrator (PINT): An Online Tool To Store, Query, and Visualize Large Proteomics Experiment Results.蛋白质组学整合器(PINT):一个用于存储、查询和可视化大型蛋白质组学实验结果的在线工具。
J Proteome Res. 2019 Aug 2;18(8):2999-3008. doi: 10.1021/acs.jproteome.8b00711. Epub 2019 Jul 1.
7
Plant Defensive β-Glucosidases Resist Digestion and Sustain Activity in the Gut of a Lepidopteran Herbivore.植物防御性β-葡萄糖苷酶在鳞翅目食草动物肠道中抗消化并维持活性。
Front Plant Sci. 2018 Oct 8;9:1389. doi: 10.3389/fpls.2018.01389. eCollection 2018.
8
Deducing the presence of proteins and proteoforms in quantitative proteomics.定量蛋白质组学中蛋白质和蛋白质异构体的推断。
Nat Commun. 2018 Jun 13;9(1):2320. doi: 10.1038/s41467-018-04411-5.
9
In-Depth Proteomic Characterization of Classical and Non-Classical Monocyte Subsets.经典和非经典单核细胞亚群的深度蛋白质组学表征
Proteomes. 2018 Feb 5;6(1):8. doi: 10.3390/proteomes6010008.
10
Detection of Missing Proteins Using the PRIDE Database as a Source of Mass Spectrometry Evidence.以PRIDE数据库作为质谱证据来源检测缺失蛋白质
J Proteome Res. 2016 Nov 4;15(11):4101-4115. doi: 10.1021/acs.jproteome.6b00437. Epub 2016 Sep 15.
J Proteome Res. 2011 Jul 1;10(7):3060-75. doi: 10.1021/pr200039p. Epub 2011 Jun 7.
4
PeptideClassifier for protein inference and targeted quantitative proteomics.用于蛋白质推断和靶向定量蛋白质组学的肽段分类器
Nat Biotechnol. 2010 Jul;28(7):647-50. doi: 10.1038/nbt0710-647.
5
Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function.鸟枪法蛋白质组学数据的确定性蛋白质推断为拟南芥花粉发育和功能提供了新见解。
Genome Res. 2009 Oct;19(10):1786-800. doi: 10.1101/gr.089060.108. Epub 2009 Jun 22.
6
Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures.对简单和复杂肽混合物进行数据独立分析时,对多重前体和产物离子光谱的数据库搜索与核算。
Proteomics. 2009 Mar;9(6):1696-719. doi: 10.1002/pmic.200800564.
7
The detection, correlation, and comparison of peptide precursor and product ions from data independent LC-MS with data dependant LC-MS/MS.来自数据非依赖型液相色谱-质谱法(LC-MS)的肽前体离子和产物离子与数据依赖型液相色谱-串联质谱法(LC-MS/MS)的检测、关联及比较。
Proteomics. 2009 Mar;9(6):1683-95. doi: 10.1002/pmic.200800562.
8
Absolute quantification of proteins by LCMSE: a virtue of parallel MS acquisition.通过低分辨质谱成像进行蛋白质的绝对定量:平行质谱采集的一个优点。
Mol Cell Proteomics. 2006 Jan;5(1):144-56. doi: 10.1074/mcp.M500230-MCP200. Epub 2005 Oct 11.
9
Interpretation of shotgun proteomic data: the protein inference problem.鸟枪法蛋白质组学数据的解读:蛋白质推断问题。
Mol Cell Proteomics. 2005 Oct;4(10):1419-40. doi: 10.1074/mcp.R500012-MCP200. Epub 2005 Jul 11.
10
A statistical model for identifying proteins by tandem mass spectrometry.一种通过串联质谱法鉴定蛋白质的统计模型。
Anal Chem. 2003 Sep 1;75(17):4646-58. doi: 10.1021/ac0341261.