• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过预筛选加速气相色谱质谱的内存高效搜索

Memory-Efficient Searching of Gas-Chromatography Mass Spectra Accelerated by Prescreening.

作者信息

Smirnov Aleksandr, Liao Yunfei, Du Xiuxia

机构信息

Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA.

出版信息

Metabolites. 2022 May 29;12(6):491. doi: 10.3390/metabo12060491.

DOI:10.3390/metabo12060491
PMID:35736424
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9229217/
Abstract

The number of metabolomics studies and spectral libraries for compound annotation (i.e., assigning possible compound identities to a fragmentation spectrum) has been growing steadily in recent years. Accompanying this growth is the number of mass spectra available for searching through those libraries. As the size of spectral libraries grows, accurate and fast compound annotation becomes more challenging. We herein report a prescreening algorithm that was developed to address the speed of spectral search under the constraint of low memory requirements. This prescreening has been incorporated into the Automated Data Analysis Pipeline Spectral Knowledgebase (ADAP-KDB) and can be applied to compound annotation by searching other spectral libraries as well. Performance of the prescreening algorithm was evaluated for different sets of parameters and compared to the original ADAP-KDB spectral search and the MSSearch software. The comparison has demonstrated that the new algorithm is about four-times faster than the original when searching for low-resolution mass spectra, and about as fast as the original when searching for high-resolution mass spectra. However, the new algorithm is still slower than MSSearch due to the relational database design of the former. The new search workflow can be tried out at the ADAP-KDB web portal.

摘要

近年来,用于化合物注释(即给碎片谱图确定可能的化合物身份)的代谢组学研究数量和光谱库一直在稳步增长。随着这种增长,可用于在这些库中进行搜索的质谱数量也在增加。随着光谱库规模的扩大,准确且快速的化合物注释变得更具挑战性。我们在此报告一种预筛选算法,该算法是为在低内存需求的约束下解决光谱搜索速度问题而开发的。这种预筛选已被纳入自动数据分析管道光谱知识库(ADAP-KDB),并且通过搜索其他光谱库也可应用于化合物注释。针对不同参数集对预筛选算法的性能进行了评估,并与原始的ADAP-KDB光谱搜索和MSSearch软件进行了比较。比较结果表明,在搜索低分辨率质谱时,新算法比原始算法快约四倍,在搜索高分辨率质谱时与原始算法速度相当。然而,由于新算法的关系数据库设计,它仍然比MSSearch慢。新的搜索工作流程可在ADAP-KDB门户网站上试用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199e/9229217/21536a145a5b/metabolites-12-00491-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199e/9229217/39459e883ef2/metabolites-12-00491-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199e/9229217/4557e742d52e/metabolites-12-00491-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199e/9229217/21536a145a5b/metabolites-12-00491-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199e/9229217/39459e883ef2/metabolites-12-00491-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199e/9229217/4557e742d52e/metabolites-12-00491-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/199e/9229217/21536a145a5b/metabolites-12-00491-g003.jpg

相似文献

1
Memory-Efficient Searching of Gas-Chromatography Mass Spectra Accelerated by Prescreening.通过预筛选加速气相色谱质谱的内存高效搜索
Metabolites. 2022 May 29;12(6):491. doi: 10.3390/metabo12060491.
2
ADAP-KDB: A Spectral Knowledgebase for Tracking and Prioritizing Unknown GC-MS Spectra in the NIH's Metabolomics Data Repository.ADAP-KDB:一个用于在 NIH 代谢组学数据存储库中跟踪和优先处理未知 GC-MS 谱的谱知识库。
Anal Chem. 2021 Sep 14;93(36):12213-12220. doi: 10.1021/acs.analchem.1c00355. Epub 2021 Aug 29.
3
ADAP-GC 3.2: Graphical Software Tool for Efficient Spectral Deconvolution of Gas Chromatography-High-Resolution Mass Spectrometry Metabolomics Data.ADAP-GC 3.2:用于高效解析气相色谱-高分辨率质谱代谢组学数据的图形化软件工具。
J Proteome Res. 2018 Jan 5;17(1):470-478. doi: 10.1021/acs.jproteome.7b00633. Epub 2017 Nov 7.
4
A Fast and Memory-Efficient Spectral Library Search Algorithm Using Locality-Sensitive Hashing.基于局部敏感哈希的快速、低内存消耗光谱库搜索算法
Proteomics. 2020 Nov;20(21-22):e2000002. doi: 10.1002/pmic.202000002. Epub 2020 Jun 29.
5
compMS2Miner: An Automatable Metabolite Identification, Visualization, and Data-Sharing R Package for High-Resolution LC-MS Data Sets.compMS2Miner:一个用于高分辨 LC-MS 数据集的自动化代谢物鉴定、可视化和数据共享 R 包。
Anal Chem. 2017 Apr 4;89(7):3919-3928. doi: 10.1021/acs.analchem.6b02394. Epub 2017 Mar 27.
6
ADAP-GC 4.0: Application of Clustering-Assisted Multivariate Curve Resolution to Spectral Deconvolution of Gas Chromatography-Mass Spectrometry Metabolomics Data.ADAP-GC 4.0:基于聚类辅助多元曲线分辨的气相色谱-质谱代谢组学数据光谱解卷积应用。
Anal Chem. 2019 Jul 16;91(14):9069-9077. doi: 10.1021/acs.analchem.9b01424. Epub 2019 Jul 5.
7
Metabolomic spectral libraries for data-independent SWATH liquid chromatography mass spectrometry acquisition.用于数据非依赖型SWATH液相色谱质谱采集的代谢组学光谱库。
Anal Bioanal Chem. 2018 Mar;410(7):1873-1884. doi: 10.1007/s00216-018-0860-x. Epub 2018 Feb 6.
8
MIDAS: a database-searching algorithm for metabolite identification in metabolomics.MIDAS:一种用于代谢组学中代谢物鉴定的数据库搜索算法。
Anal Chem. 2014 Oct 7;86(19):9496-503. doi: 10.1021/ac5014783. Epub 2014 Sep 11.
9
COSS: A Fast and User-Friendly Tool for Spectral Library Searching.COSS:一款快速易用的光谱库搜索工具。
J Proteome Res. 2020 Jul 2;19(7):2786-2793. doi: 10.1021/acs.jproteome.9b00743. Epub 2020 May 28.
10
The critical role that spectral libraries play in capturing the metabolomics community knowledge.光谱库在捕获代谢组学领域知识方面的关键作用。
Metabolomics. 2022 Nov 19;18(12):94. doi: 10.1007/s11306-022-01947-y.

引用本文的文献

1
Transforming Big Data into AI-ready data for nutrition and obesity research.将大数据转化为适用于营养与肥胖研究的人工智能可用数据。
Obesity (Silver Spring). 2024 May;32(5):857-870. doi: 10.1002/oby.23989. Epub 2024 Mar 1.

本文引用的文献

1
MS2DeepScore: a novel deep learning similarity measure to compare tandem mass spectra.MS2DeepScore:一种用于比较串联质谱的新型深度学习相似性度量方法。
J Cheminform. 2021 Oct 29;13(1):84. doi: 10.1186/s13321-021-00558-4.
2
ADAP-KDB: A Spectral Knowledgebase for Tracking and Prioritizing Unknown GC-MS Spectra in the NIH's Metabolomics Data Repository.ADAP-KDB:一个用于在 NIH 代谢组学数据存储库中跟踪和优先处理未知 GC-MS 谱的谱知识库。
Anal Chem. 2021 Sep 14;93(36):12213-12220. doi: 10.1021/acs.analchem.1c00355. Epub 2021 Aug 29.
3
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships.
Spec2Vec:通过学习结构关系提高质谱相似性评分。
PLoS Comput Biol. 2021 Feb 16;17(2):e1008724. doi: 10.1371/journal.pcbi.1008724. eCollection 2021 Feb.
4
Deep learning embedder method and tool for mass spectra similarity search.用于质谱相似性搜索的深度学习嵌入器方法和工具。
J Proteomics. 2021 Feb 10;232:104070. doi: 10.1016/j.jprot.2020.104070. Epub 2020 Dec 8.
5
ADAP-GC 4.0: Application of Clustering-Assisted Multivariate Curve Resolution to Spectral Deconvolution of Gas Chromatography-Mass Spectrometry Metabolomics Data.ADAP-GC 4.0:基于聚类辅助多元曲线分辨的气相色谱-质谱代谢组学数据光谱解卷积应用。
Anal Chem. 2019 Jul 16;91(14):9069-9077. doi: 10.1021/acs.analchem.9b01424. Epub 2019 Jul 5.
6
SPLASH, a hashed identifier for mass spectra.SPLASH,一种用于质谱的哈希标识符。
Nat Biotechnol. 2016 Nov 8;34(11):1099-1101. doi: 10.1038/nbt.3689.
7
Optimization and testing of mass spectral library search algorithms for compound identification.化合物鉴定的质谱文库搜索算法的优化和测试。
J Am Soc Mass Spectrom. 1994 Sep;5(9):859-66. doi: 10.1016/1044-0305(94)87009-8.
8
Wavelet- and Fourier-transform-based spectrum similarity approaches to compound identification in gas chromatography/mass spectrometry.基于小波和傅里叶变换的光谱相似性方法在气相色谱/质谱联用中的化合物鉴定。
Anal Chem. 2011 Jul 15;83(14):5631-8. doi: 10.1021/ac200740w. Epub 2011 Jun 28.