• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过质谱数据聚类改进大规模蛋白质组学

Improving large-scale proteomics by clustering of mass spectrometry data.

作者信息

Beer Ilan, Barnea Eilon, Ziv Tamar, Admon Arie

机构信息

IBM Haifa Research Lab, Haifa, Israel.

出版信息

Proteomics. 2004 Apr;4(4):950-60. doi: 10.1002/pmic.200300652.

DOI:10.1002/pmic.200300652
PMID:15048977
Abstract

Tandem mass spectrometry (MS/MS), coupled with liquid chromatography (LC), is a powerful tool for the analysis and comparison of complex protein and peptide mixtures. However, the extremely large amounts of data that result from the process are very complex and difficult to analyze. We show how the clustering of similar spectra from multiple LC-MS/MS runs can help in data management and improve the analysis of complex peptide mixtures. The major effect of spectrum clustering is the reduction of the huge amounts of data to a manageable size. As a result, analysis time is shorter and more data can be stored for further analysis. Furthermore, spectrum quality improvement allows the identification of more peptides with greater confidence, the comparison of complex peptide mixtures is facilitated, and the entire proteomics project is presented in concise form. Pep-Miner is an advanced software tool that implements these clustering-based applications. It proved useful in several comparative proteomics projects involving lung cancer cells and various other cell types. In one of these projects, Pep-Miner reduced 517 000 spectra to 20 900 clusters and identified 2518 peptides derived from 830 proteins. Clustering and identification lasted less than two hours on an IBM Thinkpad T23 computer (laptop). Pep-Miner's unique properties make it a very useful tool for large-scale shotgun proteomics projects.

摘要

串联质谱法(MS/MS)与液相色谱法(LC)联用,是分析和比较复杂蛋白质及肽混合物的强大工具。然而,该过程产生的海量数据非常复杂且难以分析。我们展示了如何通过对多次LC-MS/MS运行产生的相似光谱进行聚类,来帮助进行数据管理并改进对复杂肽混合物的分析。光谱聚类的主要作用是将海量数据减少到可管理的规模。结果,分析时间更短,且能存储更多数据以供进一步分析。此外,光谱质量的提高使得能够更有信心地鉴定更多肽段,便于对复杂肽混合物进行比较,并以简洁的形式呈现整个蛋白质组学项目。Pep-Miner是一款实现这些基于聚类应用的先进软件工具。它在涉及肺癌细胞及其他多种细胞类型的多个比较蛋白质组学项目中证明很有用。在其中一个项目中,Pep-Miner将517000个光谱减少到20900个聚类,并鉴定出源自830种蛋白质的2518个肽段。在一台IBM Thinkpad T23笔记本电脑上,聚类和鉴定耗时不到两小时。Pep-Miner的独特特性使其成为大规模鸟枪法蛋白质组学项目的非常有用的工具。

相似文献

1
Improving large-scale proteomics by clustering of mass spectrometry data.通过质谱数据聚类改进大规模蛋白质组学
Proteomics. 2004 Apr;4(4):950-60. doi: 10.1002/pmic.200300652.
2
MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis.MultiAlign:一种用于靶向组学分析的多重 LC-MS 分析工具。
BMC Bioinformatics. 2013 Feb 12;14:49. doi: 10.1186/1471-2105-14-49.
3
LC-MALDI-TOF/TOF for shotgun proteomics.用于鸟枪法蛋白质组学的液相色谱-基质辅助激光解吸电离飞行时间串联质谱仪
Methods Mol Biol. 2014;1156:27-38. doi: 10.1007/978-1-4939-0685-7_2.
4
Enhanced characterization of complex proteomic samples using LC-MALDI MS/MS: exclusion of redundant peptides from MS/MS analysis in replicate runs.使用液相色谱-基质辅助激光解吸电离串联质谱对复杂蛋白质组样品进行增强表征:在重复分析中从串联质谱分析中排除冗余肽段。
Anal Chem. 2005 Dec 1;77(23):7816-25. doi: 10.1021/ac050956y.
5
Post alignment clustering procedure for comparative quantitative proteomics LC-MS data.用于比较定量蛋白质组学液相色谱-质谱数据的比对后聚类程序。
Proteomics. 2008 Jan;8(1):32-6. doi: 10.1002/pmic.200700707.
6
Shotgun proteomics: tools for the analysis of complex biological systems.鸟枪法蛋白质组学:用于分析复杂生物系统的工具。
Curr Opin Mol Ther. 2002 Jun;4(3):242-50.
7
Improved reporter ion assignment of raw isobaric stable isotope labeled liquid chromatography/matrix-assisted laser desorption/ionization tandem time-of-flight mass spectral data for quantitative proteomics.改进的原始同重同位素标记液相色谱/基质辅助激光解吸/电离串联飞行时间质谱数据的报告离子分配,用于定量蛋白质组学。
Rapid Commun Mass Spectrom. 2012 Dec 15;26(23):2777-85. doi: 10.1002/rcm.6403.
8
msCRUSH: Fast Tandem Mass Spectral Clustering Using Locality Sensitive Hashing.msCRUSH:基于局部敏感哈希的快速串联质谱聚类。
J Proteome Res. 2019 Jan 4;18(1):147-158. doi: 10.1021/acs.jproteome.8b00448. Epub 2018 Dec 14.
9
MSQ: a tool for quantification of proteomics data generated by a liquid chromatography/matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometry based targeted quantitative proteomics platform.MSQ:一种用于量化基于液相色谱/基质辅助激光解吸/电离飞行时间串联质谱的靶向定量蛋白质组学平台产生的蛋白质组学数据的工具。
Rapid Commun Mass Spectrom. 2010 Feb;24(4):403-8. doi: 10.1002/rcm.4407.
10
CHASE, a charge-assisted sequencing algorithm for automated homology-based protein identifications with matrix-assisted laser desorption/ionization time-of-flight post-source decay fragmentation data.CHASE,一种电荷辅助测序算法,用于基于同源性的蛋白质自动鉴定,采用基质辅助激光解吸/电离飞行时间源后衰变碎裂数据。
J Mass Spectrom. 2005 Apr;40(4):475-88. doi: 10.1002/jms.817.

引用本文的文献

1
Mining Small Molecules from Strains Isolated from Philippine Teredinidae.从菲律宾船蛆分离出的菌株中挖掘小分子。
Metabolites. 2022 Nov 21;12(11):1152. doi: 10.3390/metabo12111152.
2
Analytical Considerations of Large-Scale Aptamer-Based Datasets for Translational Applications.用于转化应用的基于适配体的大规模数据集的分析考量
Cancers (Basel). 2022 Apr 29;14(9):2227. doi: 10.3390/cancers14092227.
3
Polyglutamylation: biology and analysis.多聚谷氨酰化:生物学与分析。
Amino Acids. 2022 Apr;54(4):529-542. doi: 10.1007/s00726-022-03146-4. Epub 2022 Mar 31.
4
Comprehensive Two-Dimensional Gas Chromatography Mass Spectrometry-Based Metabolomics.基于全面二维气相色谱-质谱联用的代谢组学。
Adv Exp Med Biol. 2021;1280:57-67. doi: 10.1007/978-3-030-51652-9_4.
5
CHICKN: extraction of peptide chromatographic elution profiles from large scale mass spectrometry data by means of Wasserstein compressive hierarchical cluster analysis.CHICKN:通过 Wasserstein 压缩分层聚类分析从大规模质谱数据中提取肽色谱洗脱曲线。
BMC Bioinformatics. 2021 Feb 12;22(1):68. doi: 10.1186/s12859-021-03969-0.
6
Methods for Proteogenomics Data Analysis, Challenges, and Scalability Bottlenecks: A Survey.蛋白质基因组学数据分析方法、挑战及可扩展性瓶颈:一项综述。
IEEE Access. 2021;9:5497-5516. doi: 10.1109/ACCESS.2020.3047588. Epub 2020 Dec 25.
7
Possible proteomic biomarkers for the detection of pancreatic cancer in oral fluids.口腔液中用于检测胰腺癌的潜在蛋白质组学生物标志物。
Sci Rep. 2020 Dec 15;10(1):21995. doi: 10.1038/s41598-020-78922-x.
8
Future Prospects of Spectral Clustering Approaches in Proteomics.蛋白质组学中光谱聚类方法的未来展望。
Proteomics. 2018 Jul;18(14):e1700454. doi: 10.1002/pmic.201700454.
9
A novel quantification-driven proteomic strategy identifies an endogenous peptide of pleiotrophin as a new biomarker of Alzheimer's disease.一种新型的定量驱动蛋白质组学策略鉴定出多效生长因子的内源性肽作为阿尔茨海默病的一种新生物标志物。
Sci Rep. 2017 Oct 17;7(1):13333. doi: 10.1038/s41598-017-13831-0.
10
DISMS2: A flexible algorithm for direct proteome- wide distance calculation of LC-MS/MS runs.DISMS2:一种用于液相色谱-串联质谱运行直接全蛋白质组距离计算的灵活算法。
BMC Bioinformatics. 2017 Mar 3;18(1):148. doi: 10.1186/s12859-017-1514-2.