• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用特征哈希和图形处理单元进行高分辨率质谱的极快速准确开放修饰谱库搜索。

Extremely Fast and Accurate Open Modification Spectral Library Searching of High-Resolution Mass Spectra Using Feature Hashing and Graphics Processing Units.

机构信息

Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium.

Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium.

出版信息

J Proteome Res. 2019 Oct 4;18(10):3792-3799. doi: 10.1021/acs.jproteome.9b00291. Epub 2019 Aug 30.

DOI:10.1021/acs.jproteome.9b00291
PMID:31448616
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6886738/
Abstract

Open modification searching (OMS) is a powerful search strategy to identify peptides with any type of modification. OMS works by using a very wide precursor mass window to allow modified spectra to match against their unmodified variants, after which the modification types can be inferred from the corresponding precursor mass differences. A disadvantage of this strategy, however, is the large computational cost, because each query spectrum has to be compared against a multitude of candidate peptides. We have previously introduced the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. Here we demonstrate how this candidate selection procedure can be further optimized using graphics processing units. Additionally, we introduce a feature hashing scheme to convert high-resolution spectra to low-dimensional vectors. On the basis of these algorithmic advances, along with low-level code optimizations, the new version of ANN-SoLo is up to an order of magnitude faster than its initial version. This makes it possible to efficiently perform open searches on a large scale to gain a deeper understanding about the protein modification landscape. We demonstrate the computational efficiency and identification performance of ANN-SoLo based on a large data set of the draft human proteome. ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license at https://github.com/bittremieux/ANN-SoLo .

摘要

开放修饰搜索(Open modification searching,OMS)是一种强大的搜索策略,可用于识别具有任何类型修饰的肽。OMS 通过使用非常宽的前体质量窗口来实现,允许修饰后的谱与未修饰的变体匹配,然后可以从相应的前体质量差异推断出修饰类型。然而,这种策略的一个缺点是计算成本高,因为每个查询谱都必须与大量候选肽进行比较。我们之前介绍了用于快速准确的开放光谱库搜索的 ANN-SoLo 工具。ANN-SoLo 使用近似最近邻索引来通过仅选择数量有限的最相关的库谱与未知查询谱进行比较来加速 OMS。在这里,我们展示了如何使用图形处理单元进一步优化这种候选选择过程。此外,我们引入了一种特征哈希方案,将高分辨率谱转换为低维向量。基于这些算法上的改进以及低级代码优化,新版本的 ANN-SoLo 的速度比其初始版本快一个数量级。这使得可以有效地进行大规模的开放搜索,以更深入地了解蛋白质修饰景观。我们基于人类蛋白质组草案的大型数据集展示了 ANN-SoLo 的计算效率和识别性能。ANN-SoLo 是用 Python 和 C++实现的。它在 Apache 2.0 许可证下可在 https://github.com/bittremieux/ANN-SoLo 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/918971340173/nihms-1059683-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/0a426d1cebfa/nihms-1059683-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/b28b10ff3968/nihms-1059683-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/1e6fbdb3e690/nihms-1059683-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/6bd1a40ef3f2/nihms-1059683-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/918971340173/nihms-1059683-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/0a426d1cebfa/nihms-1059683-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/b28b10ff3968/nihms-1059683-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/1e6fbdb3e690/nihms-1059683-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/6bd1a40ef3f2/nihms-1059683-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b977/6886738/918971340173/nihms-1059683-f0005.jpg

相似文献

1
Extremely Fast and Accurate Open Modification Spectral Library Searching of High-Resolution Mass Spectra Using Feature Hashing and Graphics Processing Units.使用特征哈希和图形处理单元进行高分辨率质谱的极快速准确开放修饰谱库搜索。
J Proteome Res. 2019 Oct 4;18(10):3792-3799. doi: 10.1021/acs.jproteome.9b00291. Epub 2019 Aug 30.
2
Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing.快速开放修改谱库搜索通过近似最近邻索引。
J Proteome Res. 2018 Oct 5;17(10):3463-3474. doi: 10.1021/acs.jproteome.8b00359. Epub 2018 Sep 13.
3
Semisupervised Machine Learning for Sensitive Open Modification Spectral Library Searching.基于半监督学习的敏感开放修饰谱库检索方法。
J Proteome Res. 2023 Feb 3;22(2):585-593. doi: 10.1021/acs.jproteome.2c00616. Epub 2023 Jan 23.
4
Accelerating open modification spectral library searching on tensor core in high-dimensional space.在高维空间的张量核上加速开放修改谱库搜索。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad404.
5
QuickMod: A tool for open modification spectrum library searches.QuickMod:用于开放修饰谱库检索的工具。
J Proteome Res. 2011 Jul 1;10(7):2913-21. doi: 10.1021/pr200152g. Epub 2011 May 2.
6
Large-scale tandem mass spectrum clustering using fast nearest neighbor searching.使用快速最近邻搜索的大规模串联质谱聚类
Rapid Commun Mass Spectrom. 2025 May;39 Suppl 1(Suppl 1):e9153. doi: 10.1002/rcm.9153. Epub 2021 Jul 20.
7
A Fast and Memory-Efficient Spectral Library Search Algorithm Using Locality-Sensitive Hashing.基于局部敏感哈希的快速、低内存消耗光谱库搜索算法
Proteomics. 2020 Nov;20(21-22):e2000002. doi: 10.1002/pmic.202000002. Epub 2020 Jun 29.
8
Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate.开放 MS/MS 谱库检索以鉴定意料外的翻译后修饰,并提高谱图鉴定率。
Bioinformatics. 2010 Jun 15;26(12):i399-406. doi: 10.1093/bioinformatics/btq185.
9
Hunting for unexpected post-translational modifications by spectral library searching with tier-wise scoring.通过分层评分的谱库搜索寻找意外的翻译后修饰。
J Proteome Res. 2014 May 2;13(5):2262-71. doi: 10.1021/pr401006g. Epub 2014 Apr 2.
10
Sensitive and Specific Spectral Library Searching with CompOmics Spectral Library Searching Tool and Percolator.使用 CompOmics 光谱库检索工具和 percolator 进行敏感和特异的光谱库检索。
J Proteome Res. 2022 May 6;21(5):1365-1370. doi: 10.1021/acs.jproteome.2c00075. Epub 2022 Apr 21.

引用本文的文献

1
Quality Control in the Mass Spectrometry Proteomics Core: A Practical Primer.质谱蛋白质组学核心实验室的质量控制:实用入门指南。
J Biomol Tech. 2024 Sep 12;35(3). doi: 10.7171/3fc1f5fe.42308a9a. eCollection 2024 Sep 30.
2
Comprehensive Proteomics and Machine Learning Analysis to Distinguish Follicular Adenoma and Follicular Thyroid Carcinoma from Indeterminate Thyroid Nodules.综合蛋白质组学与机器学习分析以区分不确定甲状腺结节中的滤泡性腺瘤和滤泡状甲状腺癌。
Endocrinol Metab (Seoul). 2025 Aug;40(4):623-636. doi: 10.3803/EnM.2024.2208. Epub 2025 Apr 10.
3
Proteome-wide non-cleavable crosslink identification with MS Annika 3.0 reveals the structure of the C. elegans Box C/D complex.

本文引用的文献

1
spectrum_utils: A Python Package for Mass Spectrometry Data Processing and Visualization.spectrum_utils:一个用于质谱数据分析和可视化的 Python 包。
Anal Chem. 2020 Jan 7;92(1):659-661. doi: 10.1021/acs.analchem.9b04884. Epub 2019 Dec 20.
2
TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets.TagGraph 揭示了来自大型串联质谱数据集的广泛蛋白质修饰图谱。
Nat Biotechnol. 2019 Apr;37(4):469-479. doi: 10.1038/s41587-019-0067-5. Epub 2019 Apr 1.
3
DeltaMass: Automated Detection and Visualization of Mass Shifts in Proteomic Open-Search Results.
利用MS Annika 3.0进行全蛋白质组不可裂解交联鉴定揭示了秀丽隐杆线虫Box C/D复合物的结构。
Commun Chem. 2024 Dec 19;7(1):300. doi: 10.1038/s42004-024-01386-x.
4
Proteomics Can Rise to the Challenge of Pseudogenes' Coding Nature.蛋白质组学能够应对假基因编码特性带来的挑战。
J Proteome Res. 2024 Dec 6;23(12):5233-5249. doi: 10.1021/acs.jproteome.4c00116. Epub 2024 Nov 1.
5
Demystifying PTM Identification Using MODplus: Best Practices and Pitfalls.使用 MODplus 揭开 PTM 鉴定的神秘面纱:最佳实践和陷阱。
Methods Mol Biol. 2024;2836:37-55. doi: 10.1007/978-1-0716-4007-4_3.
6
Multi-Omics integration can be used to rescue metabolic information for some of the dark region of the Pseudomonas putida proteome.多组学整合可用于挽救恶臭假单胞菌蛋白质组某些未知区域的代谢信息。
BMC Genomics. 2024 Mar 11;25(1):267. doi: 10.1186/s12864-024-10082-y.
7
Open access repository-scale propagated nearest neighbor suspect spectral library for untargeted metabolomics.开放获取存储库规模传播的近邻可疑光谱库,用于无目标代谢组学。
Nat Commun. 2023 Dec 20;14(1):8488. doi: 10.1038/s41467-023-44035-y.
8
Spectroscape enables real-time query and visualization of a spectral archive in proteomics.Spectroscape 能够实时查询和可视化蛋白质组学中的光谱档案。
Nat Commun. 2023 Oct 7;14(1):6267. doi: 10.1038/s41467-023-42006-x.
9
Flash entropy search to query all mass spectral libraries in real time.实时查询所有质谱文库的 Flash 熵搜索。
Nat Methods. 2023 Oct;20(10):1475-1478. doi: 10.1038/s41592-023-02012-9. Epub 2023 Sep 21.
10
Accelerating open modification spectral library searching on tensor core in high-dimensional space.在高维空间的张量核上加速开放修改谱库搜索。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad404.
DeltaMass:蛋白质组学开放式检索结果中质量漂移的自动检测和可视化。
J Proteome Res. 2019 Feb 1;18(2):715-720. doi: 10.1021/acs.jproteome.8b00728. Epub 2018 Dec 17.
4
Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine.使用高效的开放式搜索引擎对串联质谱中的肽段进行全面鉴定。
Nat Biotechnol. 2018 Oct 8. doi: 10.1038/nbt.4236.
5
Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing.快速开放修改谱库搜索通过近似最近邻索引。
J Proteome Res. 2018 Oct 5;17(10):3463-3474. doi: 10.1021/acs.jproteome.8b00359. Epub 2018 Sep 13.
6
Enhanced Global Post-translational Modification Discovery with MetaMorpheus.MetaMorpheus 增强全局翻译后修饰发现。
J Proteome Res. 2018 May 4;17(5):1844-1851. doi: 10.1021/acs.jproteome.7b00873. Epub 2018 Apr 2.
7
How many human proteoforms are there?人类蛋白异构体有多少?
Nat Chem Biol. 2018 Feb 14;14(3):206-214. doi: 10.1038/nchembio.2576.
8
Quality control in mass spectrometry-based proteomics.基于质谱的蛋白质组学中的质量控制。
Mass Spectrom Rev. 2018 Sep;37(5):697-711. doi: 10.1002/mas.21544. Epub 2017 Sep 7.
9
SpecOMS: A Full Open Modification Search Method Performing All-to-All Spectra Comparisons within Minutes.SpecOMS:一种全开放式的修饰谱检索方法,可在数分钟内完成全谱比较。
J Proteome Res. 2017 Aug 4;16(8):3030-3038. doi: 10.1021/acs.jproteome.7b00308. Epub 2017 Jul 11.
10
MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.MSFragger:基于质谱的蛋白质组学中实现超快速且全面的肽段鉴定
Nat Methods. 2017 May;14(5):513-520. doi: 10.1038/nmeth.4256. Epub 2017 Apr 10.