• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

光谱档案:扩展光谱库以分析已识别和未识别的光谱。

Spectral archives: extending spectral libraries to analyze both identified and unidentified spectra.

机构信息

Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California, USA.

出版信息

Nat Methods. 2011 May 15;8(7):587-91. doi: 10.1038/nmeth.1609.

DOI:10.1038/nmeth.1609
PMID:21572408
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3128193/
Abstract

Tandem mass spectrometry (MS/MS) experiments yield multiple, nearly identical spectra of the same peptide in various laboratories, but proteomics researchers typically do not leverage the unidentified spectra produced in other labs to decode spectra they generate. We propose a spectral archives approach that clusters MS/MS datasets, representing similar spectra by a single consensus spectrum. Spectral archives extend spectral libraries by analyzing both identified and unidentified spectra in the same way and maintaining information about peptide spectra that are common across species and conditions. Thus archives offer both traditional library spectrum similarity-based search capabilities along with new ways to analyze the data. By developing a clustering tool, MS-Cluster, we generated a spectral archive from ∼1.18 billion spectra that greatly exceeds the size of existing spectral repositories. We advocate that publicly available data should be organized into spectral archives rather than be analyzed as disparate datasets, as is mostly the case today.

摘要

串联质谱(MS/MS)实验在不同实验室中产生同一肽的多个几乎相同的谱图,但蛋白质组学研究人员通常不会利用其他实验室生成的未识别谱图来解码他们自己生成的谱图。我们提出了一种谱图档案方法,通过单个共识谱图对 MS/MS 数据集进行聚类,从而代表相似的谱图。谱图档案通过以相同的方式分析已识别和未识别的谱图来扩展谱图库,并保留有关跨物种和条件的肽谱图的信息。因此,档案库不仅提供了传统的基于库谱相似度的搜索功能,还提供了新的数据分析方法。通过开发聚类工具 MS-Cluster,我们从约 11.8 亿个谱图中生成了一个谱图档案,其规模大大超过了现有谱图库的大小。我们主张,应将可公开获得的数据组织成谱图档案,而不是像当今大多数情况那样作为不同数据集进行分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb62/3128193/3b2c81a465ca/nihms-291694-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb62/3128193/5afd3ea64824/nihms-291694-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb62/3128193/3b2c81a465ca/nihms-291694-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb62/3128193/5afd3ea64824/nihms-291694-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb62/3128193/3b2c81a465ca/nihms-291694-f0003.jpg

相似文献

1
Spectral archives: extending spectral libraries to analyze both identified and unidentified spectra.光谱档案:扩展光谱库以分析已识别和未识别的光谱。
Nat Methods. 2011 May 15;8(7):587-91. doi: 10.1038/nmeth.1609.
2
Spectral Library Search Improves Assignment of TMT Labeled MS/MS Spectra.光谱库检索可提高 TMT 标记 MS/MS 谱的分配。
J Proteome Res. 2018 Sep 7;17(9):3325-3331. doi: 10.1021/acs.jproteome.8b00594. Epub 2018 Aug 16.
3
Enhanced peptide quantification using spectral count clustering and cluster abundance.使用谱计数聚类和聚类丰度进行增强的肽定量。
BMC Bioinformatics. 2011 Oct 28;12:423. doi: 10.1186/1471-2105-12-423.
4
Extending the coverage of spectral libraries: a neighbor-based approach to predicting intensities of peptide fragmentation spectra.扩展光谱库的覆盖范围:一种基于邻近关系预测肽段碎裂谱强度的方法。
Proteomics. 2013 Mar;13(5):756-65. doi: 10.1002/pmic.201100670. Epub 2013 Feb 4.
5
ClusterSheep: A Graphics Processing Unit-Accelerated Software Tool for Large-Scale Clustering of Tandem Mass Spectra from Shotgun Proteomics.ClusterSheep:一种用于从 shotgun 蛋白质组学中大规模聚类串联质谱的图形处理单元加速软件工具。
J Proteome Res. 2021 Dec 3;20(12):5359-5367. doi: 10.1021/acs.jproteome.1c00485. Epub 2021 Nov 4.
6
Peptide identification from mixture tandem mass spectra.从混合串联质谱中鉴定肽。
Mol Cell Proteomics. 2010 Jul;9(7):1476-85. doi: 10.1074/mcp.M000136-MCP201. Epub 2010 Mar 27.
7
Combinatorial approach for large-scale identification of linked peptides from tandem mass spectrometry spectra.用于从串联质谱谱图中大规模鉴定连接肽段的组合方法。
Mol Cell Proteomics. 2014 Apr;13(4):1128-36. doi: 10.1074/mcp.M113.035758. Epub 2014 Feb 3.
8
Deep learning embedder method and tool for mass spectra similarity search.用于质谱相似性搜索的深度学习嵌入器方法和工具。
J Proteomics. 2021 Feb 10;232:104070. doi: 10.1016/j.jprot.2020.104070. Epub 2020 Dec 8.
9
[Progress in the spectral library based protein identification strategy].[基于光谱库的蛋白质鉴定策略研究进展]
Sheng Wu Gong Cheng Xue Bao. 2018 Apr 25;34(4):525-536. doi: 10.13345/j.cjb.170321.
10
MaRaCluster: A Fragment Rarity Metric for Clustering Fragment Spectra in Shotgun Proteomics.MaRaCluster:一种用于鸟枪法蛋白质组学中片段谱聚类的片段稀有度度量方法。
J Proteome Res. 2016 Mar 4;15(3):713-20. doi: 10.1021/acs.jproteome.5b00749. Epub 2016 Jan 12.

引用本文的文献

1
The microbiome diversifies long- to short-chain fatty acid-derived N-acyl lipids.微生物群落使长链至短链脂肪酸衍生的N-酰基脂质多样化。
Cell. 2025 Jun 9. doi: 10.1016/j.cell.2025.05.015.
2
MS-RT: A Method for Evaluating MS/MS Clustering Performance for Metabolomics Data.MS-RT:一种评估代谢组学数据MS/MS聚类性能的方法。
J Proteome Res. 2025 Apr 4;24(4):1778-1790. doi: 10.1021/acs.jproteome.4c00881. Epub 2025 Mar 5.
3
The microbiome diversifies -acyl lipid pools - including short-chain fatty acid-derived compounds.微生物群落使酰基脂质库多样化,包括短链脂肪酸衍生化合物。

本文引用的文献

1
Optimization and testing of mass spectral library search algorithms for compound identification.化合物鉴定的质谱文库搜索算法的优化和测试。
J Am Soc Mass Spectrom. 1994 Sep;5(9):859-66. doi: 10.1016/1044-0305(94)87009-8.
2
False discovery rates of protein identifications: a strike against the two-peptide rule.蛋白质鉴定的错误发现率:对双肽规则的一次打击。
J Proteome Res. 2009 Sep;8(9):4173-81. doi: 10.1021/pr9004794.
3
Identification of early intestinal neoplasia protein biomarkers using laser capture microdissection and MALDI MS.
bioRxiv. 2024 Nov 2:2024.10.31.621412. doi: 10.1101/2024.10.31.621412.
4
The Proteomics Standards Initiative Standardized Formats for Spectral Libraries and Fragment Ion Peak Annotations: mzSpecLib and mzPAF.蛋白质组学标准倡议标准化格式的光谱库和碎片离子峰注释:mzSpecLib 和 mzPAF。
Anal Chem. 2024 Nov 19;96(46):18491-18501. doi: 10.1021/acs.analchem.4c04091. Epub 2024 Nov 8.
5
The Comprehensive Profiling of the Chemical Components in the Raw and Processed Roots of by Combining UPLC-Q-TOF-MS Coupled with MS/MS-Based Molecular Networking.采用 UPLC-Q-TOF-MS 结合基于 MS/MS 的分子网络联用技术对生品和炮制品的化学成分进行全面分析。
Molecules. 2024 Oct 14;29(20):4866. doi: 10.3390/molecules29204866.
6
Exploring the dynamic landscape of immunopeptidomics: Unravelling posttranslational modifications and navigating bioinformatics terrain.探索免疫肽组学的动态格局:揭示翻译后修饰并跨越生物信息学领域。
Mass Spectrom Rev. 2025 Jul-Aug;44(4):599-629. doi: 10.1002/mas.21905. Epub 2024 Aug 16.
7
SpecEncoder: deep metric learning for accurate peptide identification in proteomics.SpecEncoder:用于蛋白质组学中精确肽段鉴定的深度度量学习。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i257-i265. doi: 10.1093/bioinformatics/btae220.
8
A Portable and Reusable Database Infrastructure for Mass Spectrometry, and Its Associated Toolkit (The DIMSpec Project).一种用于质谱分析的便携式可重复使用数据库基础设施及其相关工具包(DIMSpec项目)。
J Am Soc Mass Spectrom. 2024 Jun 5;35(6):1282-1291. doi: 10.1021/jasms.4c00073. Epub 2024 May 5.
9
The underappreciated diversity of bile acid modifications.胆汁酸修饰的被低估的多样性。
Cell. 2024 Mar 28;187(7):1801-1818.e20. doi: 10.1016/j.cell.2024.02.019. Epub 2024 Mar 11.
10
Fast mass spectrometry search and clustering of untargeted metabolomics data.快速质谱搜索和无靶向代谢组学数据聚类。
Nat Biotechnol. 2024 Nov;42(11):1672-1677. doi: 10.1038/s41587-023-01985-4. Epub 2024 Jan 2.
使用激光捕获显微切割和基质辅助激光解吸电离质谱法鉴定早期肠道肿瘤蛋白质生物标志物。
Mol Cell Proteomics. 2009 May;8(5):936-45. doi: 10.1074/mcp.M800345-MCP200. Epub 2009 Jan 21.
4
De novo sequencing of unique sequence tags for discovery of post-translational modifications of proteins.用于发现蛋白质翻译后修饰的独特序列标签的从头测序。
Anal Chem. 2008 Oct 15;80(20):7742-54. doi: 10.1021/ac801123p. Epub 2008 Sep 11.
5
Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.光谱词典:将从头肽测序与串联质谱数据库搜索相结合。
Mol Cell Proteomics. 2009 Jan;8(1):53-69. doi: 10.1074/mcp.M800103-MCP200. Epub 2008 Aug 14.
6
Separating the wheat from the chaff: unbiased filtering of background tandem mass spectra improves protein identification.去伪存真:背景串联质谱的无偏滤波可改善蛋白质鉴定
J Proteome Res. 2008 Aug;7(8):3382-95. doi: 10.1021/pr800140v. Epub 2008 Jun 18.
7
Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.通过概率性合并多种串联质谱(MS/MS)搜索方法的结果来提高灵敏度。
J Proteome Res. 2008 Jan;7(1):245-53. doi: 10.1021/pr070540w.
8
Algorithm for identification of fusion proteins via mass spectrometry.通过质谱法鉴定融合蛋白的算法。
J Proteome Res. 2008 Jan;7(1):89-95. doi: 10.1021/pr070214g.
9
Clustering millions of tandem mass spectra.对数百万个串联质谱进行聚类。
J Proteome Res. 2008 Jan;7(1):113-22. doi: 10.1021/pr070361e. Epub 2007 Dec 8.
10
Targeted discovery of novel human exons by comparative genomics.通过比较基因组学靶向发现新的人类外显子。
Genome Res. 2007 Dec;17(12):1763-73. doi: 10.1101/gr.7128207. Epub 2007 Nov 7.