• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于气相色谱-质谱联用代谢组学中相似性差异的错误识别发现

Discovery of False Identification Using Similarity Difference in GC-MS based Metabolomics.

作者信息

Kim Seongho, Zhang Xiang

机构信息

Biostatistics Core, Karmanos Cancer Institute, Department of Oncology, Wayne State University, Detroit, MI, 48201, USA.

Department of Chemistry, University of Louisville, Louisville, KY, 40292, USA.

出版信息

J Chemom. 2015 Feb 1;29(2):80-86. doi: 10.1002/cem.2665.

DOI:10.1002/cem.2665
PMID:25937705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4414261/
Abstract

Compound identification is a critical process in metabolomics. The widely used approach for compound identification in gas chromatography-mass spectrometry (GC-MS) based metabolomics is the spectrum matching, in which the mass spectral similarity between an experimental mass spectrum and each mass spectrum in a reference library is calculated. While various similarity measures have been developed to improve the overall accuracy of compound identification, little attention has been paid to reducing the false discovery rate. We, therefore, develop an approach for controlling false identification rate using the distribution of the difference between the first and the second highest spectral similarity scores. We further propose a model-based approach to achieving a desired true positive rate. The developed method is applied to the NIST mass spectral library and its performance is compared with the conventional approach that uses only the maximum spectral similarity score. The results show that the developed method achieves a significantly higher 1 score and positive predictive value than those of the conventional approach.

摘要

化合物鉴定是代谢组学中的一个关键过程。在基于气相色谱-质谱联用(GC-MS)的代谢组学中,广泛使用的化合物鉴定方法是谱图匹配,即计算实验质谱与参考库中每个质谱之间的质谱相似性。虽然已经开发了各种相似性度量方法来提高化合物鉴定的整体准确性,但对于降低错误发现率却很少有人关注。因此,我们开发了一种利用第一和第二高谱图相似性分数之间差异的分布来控制错误鉴定率的方法。我们还进一步提出了一种基于模型的方法来实现所需的真阳性率。将所开发的方法应用于美国国家标准与技术研究院(NIST)质谱库,并将其性能与仅使用最高谱图相似性分数的传统方法进行比较。结果表明,所开发的方法比传统方法具有显著更高的F1分数和阳性预测值。

相似文献

1
Discovery of False Identification Using Similarity Difference in GC-MS based Metabolomics.基于气相色谱-质谱联用代谢组学中相似性差异的错误识别发现
J Chemom. 2015 Feb 1;29(2):80-86. doi: 10.1002/cem.2665.
2
Gaussian Mixture Modeling Extensions for Improved False Discovery Rate Estimation in GC-MS Metabolomics.高斯混合模型扩展在 GC-MS 代谢组学中提高错误发现率估计的应用。
J Am Soc Mass Spectrom. 2023 Jun 7;34(6):1096-1104. doi: 10.1021/jasms.3c00039. Epub 2023 Apr 21.
3
Compound identification using partial and semipartial correlations for gas chromatography-mass spectrometry data.采用气相色谱-质谱联用数据的偏相关和半偏相关进行化合物鉴定。
Anal Chem. 2012 Aug 7;84(15):6477-87. doi: 10.1021/ac301350n. Epub 2012 Jul 26.
4
A method of finding optimal weight factors for compound identification in gas chromatography-mass spectrometry.一种在气相色谱-质谱联用中用于寻找化合物鉴定的最优权重因子的方法。
Bioinformatics. 2012 Apr 15;28(8):1158-63. doi: 10.1093/bioinformatics/bts083. Epub 2012 Feb 13.
5
Comparative analysis of mass spectral matching-based compound identification in gas chromatography-mass spectrometry.基于质谱匹配的气相色谱-质谱联用化合物鉴定的比较分析。
J Chromatogr A. 2013 Jul 12;1298:132-8. doi: 10.1016/j.chroma.2013.05.021. Epub 2013 May 13.
6
Isotope cluster-based compound matching in gas chromatography/mass spectrometry for non-targeted metabolomics.基于同位素簇的气相色谱/质谱联用化合物匹配在非靶向代谢组学中的应用。
Anal Chem. 2013 Apr 16;85(8):4030-7. doi: 10.1021/ac303774z. Epub 2013 Apr 5.
7
A Novel Approach of Matching Mass-to-Charge Ratio for Compound Identification in Gas Chromatography-Mass Spectrometry.气相色谱-质谱联用中用于化合物鉴定的质荷比匹配新方法。
J AOAC Int. 2019 Mar 1;102(2):638-645. doi: 10.5740/jaoacint.18-0261. Epub 2018 Nov 16.
8
Spectral entropy outperforms MS/MS dot product similarity for small-molecule compound identification.光谱熵在小分子化合物鉴定方面优于 MS/MS 点积相似度。
Nat Methods. 2021 Dec;18(12):1524-1531. doi: 10.1038/s41592-021-01331-z. Epub 2021 Dec 2.
9
Comparative analysis of mass spectral similarity measures on peak alignment for comprehensive two-dimensional gas chromatography mass spectrometry.全面二维气相色谱质谱联用峰对齐的质谱相似性度量的比较分析。
Comput Math Methods Med. 2013;2013:509761. doi: 10.1155/2013/509761. Epub 2013 Sep 16.
10
Compound identification in GC-MS by simultaneously evaluating the mass spectrum and retention index.通过同时评估质谱图和保留指数进行气相色谱-质谱联用中的化合物鉴定。
Analyst. 2014 May 21;139(10):2507-14. doi: 10.1039/c3an02171h.

引用本文的文献

1
Characterizing Families of Spectral Similarity Scores and Their Use Cases for Gas Chromatography-Mass Spectrometry Small Molecule Identification.表征光谱相似性得分家族及其在气相色谱-质谱小分子鉴定中的应用案例
Metabolites. 2023 Oct 21;13(10):1101. doi: 10.3390/metabo13101101.
2
Missing data in multi-omics integration: Recent advances through artificial intelligence.多组学整合中的缺失数据:通过人工智能取得的最新进展
Front Artif Intell. 2023 Feb 9;6:1098308. doi: 10.3389/frai.2023.1098308. eCollection 2023.
3
Potential of the Acetone Leaf Extract and Fractions of (Eckl.) Vatke (Rubiaceae) to Combat Co-Infection of Tuberculosis and Helminthiasis.

本文引用的文献

1
Compound identification in GC-MS by simultaneously evaluating the mass spectrum and retention index.通过同时评估质谱图和保留指数进行气相色谱-质谱联用中的化合物鉴定。
Analyst. 2014 May 21;139(10):2507-14. doi: 10.1039/c3an02171h.
2
Estimating probabilities of correct identification from results of mass spectral library searches.从质谱文库检索结果估计正确识别的概率。
J Am Soc Mass Spectrom. 1994 Apr;5(4):316-23. doi: 10.1016/1044-0305(94)85022-4.
3
Optimization and testing of mass spectral library search algorithms for compound identification.
丙酮叶提取物及(Eckl.)Vatke(茜草科)各馏分对抗结核病和蠕虫病合并感染的潜力。
Front Pharmacol. 2022 Jan 11;12:744137. doi: 10.3389/fphar.2021.744137. eCollection 2021.
4
Optimization of the Data Treatment Steps of a Non-targeted LC-MS-Based Workflow for the Identification of Trace Chemical Residues in Honey.基于非靶向液相色谱-质谱联用工作流程的数据处理步骤优化,用于鉴定蜂蜜中的痕量化学残留物。
J Am Soc Mass Spectrom. 2019 May;30(5):765-777. doi: 10.1007/s13361-019-02157-y. Epub 2019 Mar 14.
化合物鉴定的质谱文库搜索算法的优化和测试。
J Am Soc Mass Spectrom. 1994 Sep;5(9):859-66. doi: 10.1016/1044-0305(94)87009-8.
4
RAMSY: ratio analysis of mass spectrometry to improve compound identification.RAMSY:质谱比分析提高化合物鉴定能力。
Anal Chem. 2013 Nov 19;85(22):10771-9. doi: 10.1021/ac4019268. Epub 2013 Oct 29.
5
Method for assessing the statistical significance of mass spectral similarities using basic local alignment search tool statistics.使用基本局部比对搜索工具统计学评估质谱相似度的统计显著性的方法。
Anal Chem. 2013 Sep 3;85(17):8291-7. doi: 10.1021/ac401564v. Epub 2013 Aug 14.
6
Comparative analysis of mass spectral matching-based compound identification in gas chromatography-mass spectrometry.基于质谱匹配的气相色谱-质谱联用化合物鉴定的比较分析。
J Chromatogr A. 2013 Jul 12;1298:132-8. doi: 10.1016/j.chroma.2013.05.021. Epub 2013 May 13.
7
Isotope cluster-based compound matching in gas chromatography/mass spectrometry for non-targeted metabolomics.基于同位素簇的气相色谱/质谱联用化合物匹配在非靶向代谢组学中的应用。
Anal Chem. 2013 Apr 16;85(8):4030-7. doi: 10.1021/ac303774z. Epub 2013 Apr 5.
8
Compound identification using partial and semipartial correlations for gas chromatography-mass spectrometry data.采用气相色谱-质谱联用数据的偏相关和半偏相关进行化合物鉴定。
Anal Chem. 2012 Aug 7;84(15):6477-87. doi: 10.1021/ac301350n. Epub 2012 Jul 26.
9
A method of finding optimal weight factors for compound identification in gas chromatography-mass spectrometry.一种在气相色谱-质谱联用中用于寻找化合物鉴定的最优权重因子的方法。
Bioinformatics. 2012 Apr 15;28(8):1158-63. doi: 10.1093/bioinformatics/bts083. Epub 2012 Feb 13.
10
An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry.基于竞争得分的气相色谱-质谱代谢物鉴定的经验贝叶斯模型。
BMC Bioinformatics. 2011 Oct 10;12:392. doi: 10.1186/1471-2105-12-392.