• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChemDistiller:用于质谱代谢物注释的引擎。

ChemDistiller: an engine for metabolite annotation in mass spectrometry.

机构信息

Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London, UK.

出版信息

Bioinformatics. 2018 Jun 15;34(12):2096-2102. doi: 10.1093/bioinformatics/bty080.

DOI:10.1093/bioinformatics/bty080
PMID:29447341
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9881669/
Abstract

MOTIVATION

High-resolution mass spectrometry permits simultaneous detection of thousands of different metabolites in biological samples; however, their automated annotation still presents a challenge due to the limited number of tailored computational solutions freely available to the scientific community.

RESULTS

Here, we introduce ChemDistiller, a customizable engine that combines automated large-scale annotation of metabolites using tandem MS data with a compiled database containing tens of millions of compounds with pre-calculated 'fingerprints' and fragmentation patterns. Our tests using publicly and commercially available tandem MS spectra for reference compounds show retrievals rates comparable to or exceeding the ones obtainable by the current state-of-the-art solutions in the field while offering higher throughput, scalability and processing speed.

AVAILABILITY AND IMPLEMENTATION

Source code freely available for download at https://bitbucket.org/iAnalytica/chemdistillerpython.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

高分辨率质谱允许同时检测生物样本中数千种不同的代谢物;然而,由于科学界可自由获得的定制计算解决方案数量有限,其自动化注释仍然是一个挑战。

结果

在这里,我们介绍了 ChemDistiller,这是一个可定制的引擎,它将使用串联质谱数据对代谢物进行自动大规模注释与一个包含数千万种化合物的编译数据库相结合,这些化合物具有预先计算的“指纹”和碎片模式。我们使用公开和商业上可用的串联质谱谱图对参考化合物进行测试,结果表明,我们的检索率与该领域当前最先进的解决方案相当,甚至更高,同时提供更高的吞吐量、可扩展性和处理速度。

可用性和实现

可在 https://bitbucket.org/iAnalytica/chemdistillerpython 下载源代码。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
ChemDistiller: an engine for metabolite annotation in mass spectrometry.ChemDistiller:用于质谱代谢物注释的引擎。
Bioinformatics. 2018 Jun 15;34(12):2096-2102. doi: 10.1093/bioinformatics/bty080.
2
MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation.MetFID:基于人工神经网络的化合物指纹预测代谢物注释。
Metabolomics. 2020 Sep 30;16(10):104. doi: 10.1007/s11306-020-01726-7.
3
Annotation of Specialized Metabolites from High-Throughput and High-Resolution Mass Spectrometry Metabolomics.高通量和高分辨率质谱代谢组学中特异性代谢物的注释。
Methods Mol Biol. 2020;2104:209-225. doi: 10.1007/978-1-0716-0239-3_12.
4
Quality evaluation of metabolite annotation based on comprehensive simulation of MS/MS data from high-resolution mass spectrometry (HRMS) and similarity scoring.基于高分辨率质谱(HRMS)的MS/MS数据综合模拟和相似性评分的代谢物注释质量评估
Anal Bioanal Chem. 2025 Jun;417(14):3061-3077. doi: 10.1007/s00216-025-05847-7. Epub 2025 Apr 18.
5
Automated Annotation of Untargeted All-Ion Fragmentation LC-MS Metabolomics Data with MetaboAnnotatoR.使用 MetaboAnnotatoR 对非靶向全离子碎裂 LC-MS 代谢组学数据进行自动注释。
Anal Chem. 2022 Mar 1;94(8):3446-3455. doi: 10.1021/acs.analchem.1c03032. Epub 2022 Feb 18.
6
Metabolomic spectral libraries for data-independent SWATH liquid chromatography mass spectrometry acquisition.用于数据非依赖型SWATH液相色谱质谱采集的代谢组学光谱库。
Anal Bioanal Chem. 2018 Mar;410(7):1873-1884. doi: 10.1007/s00216-018-0860-x. Epub 2018 Feb 6.
7
Autonomous METLIN-Guided In-source Fragment Annotation for Untargeted Metabolomics.自主 METLIN 引导的内源性碎片注释用于非靶向代谢组学。
Anal Chem. 2019 Mar 5;91(5):3246-3253. doi: 10.1021/acs.analchem.8b03126. Epub 2019 Feb 11.
8
[A novel method for efficient screening and annotation of important pathway-associated metabolites based on the modified metabolome and probe molecules].一种基于改良代谢组和探针分子的重要通路相关代谢物高效筛选与注释新方法
Se Pu. 2022 Sep;40(9):788-796. doi: 10.3724/SP.J.1123.2022.03025.
9
Addressing big data challenges in mass spectrometry-based metabolomics.解决基于质谱的代谢组学中的大数据挑战。
Chem Commun (Camb). 2022 Sep 8;58(72):9979-9990. doi: 10.1039/d2cc03598g.
10
Proposal for a chemically consistent way to annotate ions arising from the analysis of reference compounds under ESI conditions: A prerequisite to proper mass spectral database constitution in metabolomics.关于在电喷雾电离(ESI)条件下对参考化合物分析产生的离子进行化学一致注释的方法建议:代谢组学中构建合适质谱数据库的先决条件。
J Mass Spectrom. 2019 Jun;54(6):567-582. doi: 10.1002/jms.4372.

引用本文的文献

1
Introducing "Identification Probability" for Automated and Transferable Assessment of Metabolite Identification Confidence in Metabolomics and Related Studies.介绍用于代谢组学及相关研究中代谢物鉴定置信度的自动化和可转移评估的“鉴定概率”。
Anal Chem. 2025 Jan 14;97(1):1-11. doi: 10.1021/acs.analchem.4c04060. Epub 2024 Dec 19.
2
Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies.引入“识别概率”用于代谢组学及相关研究中代谢物识别可信度的自动化和可转移评估。
bioRxiv. 2024 Jul 31:2024.07.30.605945. doi: 10.1101/2024.07.30.605945.
3

本文引用的文献

1
High spatial resolution mass spectrometry imaging reveals the genetically programmed, developmental modification of the distribution of thylakoid membrane lipids among individual cells of maize leaf.高空间分辨率质谱成像揭示了玉米叶片单个细胞中类囊体膜脂分布的基因编程发育修饰。
Plant J. 2017 Feb;89(4):825-838. doi: 10.1111/tpj.13422. Epub 2017 Feb 3.
2
FDR-controlled metabolite annotation for high-resolution imaging mass spectrometry.FDR 控制的代谢物注释用于高分辨率成像质谱。
Nat Methods. 2017 Jan;14(1):57-60. doi: 10.1038/nmeth.4072. Epub 2016 Nov 14.
3
MetFrag relaunched: incorporating strategies beyond in silico fragmentation.
Recent advances in proteomics and metabolomics in plants.
植物蛋白质组学和代谢组学的最新进展。
Mol Hortic. 2022 Jul 23;2(1):17. doi: 10.1186/s43897-022-00038-9.
4
Comparison of Compound Identification Tools Using Data Dependent and Data Independent High-Resolution Mass Spectrometry Spectra.使用数据依赖型和数据独立型高分辨率质谱光谱的化合物鉴定工具比较
Metabolites. 2023 Jun 21;13(7):777. doi: 10.3390/metabo13070777.
5
Strategies for structure elucidation of small molecules based on LC-MS/MS data from complex biological samples.基于复杂生物样品的液相色谱-串联质谱数据解析小分子结构的策略。
Comput Struct Biotechnol J. 2022 Sep 7;20:5085-5097. doi: 10.1016/j.csbj.2022.09.004. eCollection 2022.
6
How Well Can We Predict Mass Spectra from Structures? Benchmarking Competitive Fragmentation Modeling for Metabolite Identification on Untrained Tandem Mass Spectra.从结构上预测质谱的能力如何?在未经训练的串联质谱上对代谢物鉴定进行竞争碎片建模的基准测试。
J Chem Inf Model. 2022 Sep 12;62(17):4049-4056. doi: 10.1021/acs.jcim.2c00936. Epub 2022 Aug 31.
7
Convolutional Neural Network-Based Compound Fingerprint Prediction for Metabolite Annotation.基于卷积神经网络的代谢物注释复合指纹预测
Metabolites. 2022 Jun 29;12(7):605. doi: 10.3390/metabo12070605.
8
Deep kernel learning improves molecular fingerprint prediction from tandem mass spectra.深度学习提高串联质谱分子指纹预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i342-i349. doi: 10.1093/bioinformatics/btac260.
9
MINE 2.0: enhanced biochemical coverage for peak identification in untargeted metabolomics.MINE 2.0:增强的生物化学覆盖范围,用于非靶向代谢组学中的峰识别。
Bioinformatics. 2022 Jun 27;38(13):3484-3487. doi: 10.1093/bioinformatics/btac331.
10
Automated Annotation of Untargeted All-Ion Fragmentation LC-MS Metabolomics Data with MetaboAnnotatoR.使用 MetaboAnnotatoR 对非靶向全离子碎裂 LC-MS 代谢组学数据进行自动注释。
Anal Chem. 2022 Mar 1;94(8):3446-3455. doi: 10.1021/acs.analchem.1c03032. Epub 2022 Feb 18.
MetFrag重新推出:纳入计算机辅助碎片化之外的策略。
J Cheminform. 2016 Jan 29;8:3. doi: 10.1186/s13321-016-0115-9. eCollection 2016.
4
PubChem Substance and Compound databases.美国国立医学图书馆化学物质数据库和化合物数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1202-13. doi: 10.1093/nar/gkv951. Epub 2015 Sep 22.
5
Searching molecular structure databases with tandem mass spectra using CSI:FingerID.使用CSI:FingerID通过串联质谱搜索分子结构数据库。
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12580-5. doi: 10.1073/pnas.1509788112. Epub 2015 Sep 21.
6
MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics.MINEs:用于非靶向代谢组学的计算预测酶多底物催化产物的开放获取数据库。
J Cheminform. 2015 Aug 28;7:44. doi: 10.1186/s13321-015-0087-1. eCollection 2015.
7
CFM-ID: a web server for annotation, spectrum prediction and metabolite identification from tandem mass spectra.CFM-ID:一个用于串联质谱注释、谱预测和代谢物鉴定的网络服务器。
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W94-9. doi: 10.1093/nar/gku436. Epub 2014 Jun 3.
8
InChI - the worldwide chemical structure identifier standard.InChI - 全球化学结构标识符标准。
J Cheminform. 2013 Jan 24;5(1):7. doi: 10.1186/1758-2946-5-7.
9
HMDB 3.0--The Human Metabolome Database in 2013.HMDB 3.0——2013 年的人类代谢物数据库。
Nucleic Acids Res. 2013 Jan;41(Database issue):D801-7. doi: 10.1093/nar/gks1065. Epub 2012 Nov 17.
10
ECMDB: the E. coli Metabolome Database.ECMDB:大肠杆菌代谢组数据库。
Nucleic Acids Res. 2013 Jan;41(Database issue):D625-30. doi: 10.1093/nar/gks992. Epub 2012 Oct 29.