• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于串联质谱数据的通用聚类工具的实现与应用

Implementation and application of a versatile clustering tool for tandem mass spectrometry data.

作者信息

Flikka Kristian, Meukens Jeroen, Helsens Kenny, Vandekerckhove Joël, Eidhammer Ingvar, Gevaert Kris, Martens Lennart

机构信息

Computational Biology Unit, Bergen Center for Computational Science, University of Bergen, Bergen, Norway.

出版信息

Proteomics. 2007 Sep;7(18):3245-58. doi: 10.1002/pmic.200700160.

DOI:10.1002/pmic.200700160
PMID:17708593
Abstract

High-throughput proteomics experiments typically generate large amounts of peptide fragmentation mass spectra during a single experiment. There is often a substantial amount of redundant fragmentation of the same precursors among these spectra, which is usually considered a nuisance. We here discuss the potential of clustering and merging redundant spectra to turn this redundancy into a useful property of the dataset. To this end, we have created the first general-purpose, freely available open-source software application for clustering and merging MS/MS spectra. The application also introduces a novel approach to calculating the similarity of fragmentation mass spectra that takes into account the increased precision of modern mass spectrometers, and we suggest a simple but effective improvement to single-linkage clustering. The application and the novel algorithms are applied to several real-life proteomic datasets and the results are discussed. An analysis of the influence of the different algorithms available and their parameters is given, as well as a number of important applications of the overall approach.

摘要

高通量蛋白质组学实验通常在单次实验中产生大量肽段碎裂质谱图。在这些质谱图中,同一前体往往存在大量冗余碎裂,这通常被视为一种麻烦。我们在此讨论对冗余质谱图进行聚类和合并的潜力,以便将这种冗余转化为数据集的一个有用特性。为此,我们创建了首个用于聚类和合并串联质谱(MS/MS)谱图的通用、免费开源软件应用程序。该应用程序还引入了一种计算碎裂质谱图相似度的新方法,该方法考虑了现代质谱仪提高的精度,并且我们提出了对单链聚类的一个简单而有效的改进。该应用程序和新算法被应用于多个实际蛋白质组学数据集,并对结果进行了讨论。给出了对可用的不同算法及其参数影响的分析,以及该整体方法的一些重要应用。

相似文献

1
Implementation and application of a versatile clustering tool for tandem mass spectrometry data.一种用于串联质谱数据的通用聚类工具的实现与应用
Proteomics. 2007 Sep;7(18):3245-58. doi: 10.1002/pmic.200700160.
2
A spectral clustering approach to MS/MS identification of post-translational modifications.一种用于质谱/质谱鉴定翻译后修饰的谱聚类方法。
J Proteome Res. 2008 Nov;7(11):4614-22. doi: 10.1021/pr800226w. Epub 2008 Sep 19.
3
Grid-based analysis of tandem mass spectrometry data in clinical proteomics.临床蛋白质组学中串联质谱数据的基于网格的分析
Stud Health Technol Inform. 2007;126:13-22.
4
Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries.使用谱库对大规模蛋白质组学实验中的肽段MS/MS谱进行分析。
Anal Chem. 2006 Aug 15;78(16):5678-84. doi: 10.1021/ac060279n.
5
An accurate and efficient algorithm for Peptide and ptm identification by tandem mass spectrometry.一种用于通过串联质谱法进行肽段和翻译后修饰鉴定的准确高效算法。
Genome Inform. 2007;19:119-30.
6
Sequence similarity-driven proteomics in organisms with unknown genomes by LC-MS/MS and automated de novo sequencing.通过液相色谱-串联质谱法(LC-MS/MS)和自动从头测序,对基因组未知的生物体进行序列相似性驱动的蛋白质组学研究。
Proteomics. 2007 Jul;7(14):2318-29. doi: 10.1002/pmic.200700003.
7
Filtering strategies for improving protein identification in high-throughput MS/MS studies.用于在高通量串联质谱研究中提高蛋白质鉴定的过滤策略。
Proteomics. 2009 Feb;9(4):848-60. doi: 10.1002/pmic.200800517.
8
Two dimensional mass mapping as a general method of data representation in comprehensive analysis of complex molecular mixtures.二维质量图谱作为复杂分子混合物综合分析中数据表示的通用方法。
Anal Chem. 2009 May 15;81(10):3738-45. doi: 10.1021/ac802532j.
9
swissPIT: a novel approach for pipelined analysis of mass spectrometry data.瑞士蛋白质组学工具包(swissPIT):一种用于质谱数据分析流水线式分析的新方法。
Bioinformatics. 2008 Jun 1;24(11):1416-7. doi: 10.1093/bioinformatics/btn139. Epub 2008 Apr 23.
10
CIFTER: automated charge-state determination for peptide tandem mass spectra.CIFTER:用于肽串联质谱的自动电荷态测定
Anal Chem. 2008 Mar 1;80(5):1520-8. doi: 10.1021/ac702038q. Epub 2008 Feb 2.

引用本文的文献

1
CHICKN: extraction of peptide chromatographic elution profiles from large scale mass spectrometry data by means of Wasserstein compressive hierarchical cluster analysis.CHICKN:通过 Wasserstein 压缩分层聚类分析从大规模质谱数据中提取肽色谱洗脱曲线。
BMC Bioinformatics. 2021 Feb 12;22(1):68. doi: 10.1186/s12859-021-03969-0.
2
Exploring the potential of public proteomics data.探索公共蛋白质组学数据的潜力。
Proteomics. 2016 Jan;16(2):214-25. doi: 10.1002/pmic.201500295. Epub 2015 Dec 15.
3
CAMS-RS: Clustering Algorithm for Large-Scale Mass Spectrometry Data Using Restricted Search Space and Intelligent Random Sampling.
CAMS-RS:一种使用受限搜索空间和智能随机采样的大规模质谱数据聚类算法。
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jan-Feb;11(1):128-41. doi: 10.1109/TCBB.2013.152.
4
Spectral archives: extending spectral libraries to analyze both identified and unidentified spectra.光谱档案:扩展光谱库以分析已识别和未识别的光谱。
Nat Methods. 2011 May 15;8(7):587-91. doi: 10.1038/nmeth.1609.
5
OMSSAGUI: An open-source user interface component to configure and run the OMSSA search engine.OMSSAGUI:一个用于配置和运行OMSSA搜索引擎的开源用户界面组件。
Proteomics. 2008 Jun;8(12):2376-8. doi: 10.1002/pmic.200701126.