• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用高斯混合模型的天然有机物高分辨率质谱噪声滤波算法

Noise Filtering Algorithm Using Gaussian Mixture Models for High-Resolution Mass Spectra of Natural Organic Matter.

作者信息

Potemkin Alexander A, Proskurnin Mikhail A, Volkov Dmitry S

机构信息

Chemistry Department of M.V. Lomonosov Moscow State University, Leninskie Gory, 1-3, GSP-1, Moscow 119991, Russia.

出版信息

Anal Chem. 2024 Apr 9;96(14):5455-5461. doi: 10.1021/acs.analchem.3c05453. Epub 2024 Mar 26.

DOI:10.1021/acs.analchem.3c05453
PMID:38530650
Abstract

High-resolution mass spectra of natural organic matter (NOM) contain a large number of noise signals. These signals interfere with the correct molecular composition estimation during nontargeted analysis because formula-assignment programs find empirical formulas for such peaks as well. Previously proposed noise filtering methods that utilize the profile of the intensity distribution of mass spectrum peaks rely on a histogram to calculate the intensity threshold value. However, the histogram profile can vary depending on the user settings. In addition, these algorithms are not automated, so they are handled manually. To overcome the mentioned drawbacks, we propose a new algorithm for noise filtering in mass spectra. This filter is based on Gaussian Mixture Models (GMMs), a machine learning method to find the intensity threshold value. The algorithm is completely data-driven and eliminates the need to work with a histogram. It has no customizable parameters and automatically determines the noise level for each individual mass spectrum. The algorithm performance was tested on mass spectra of natural organic matter obtained by averaging a different number of microscans (transients), and the results were compared with other noise filters proposed in the literature. Finally, the effect of this noise filtering approach on the fraction of peaks with assigned formulas was investigated. It was shown that there is always an increase in the identification rate, but the magnitude of the effect changes with the number of microscans averaged. The increase can be as high as 15%.

摘要

天然有机物(NOM)的高分辨率质谱包含大量噪声信号。在非靶向分析过程中,这些信号会干扰正确的分子组成估计,因为分子式分配程序也会为这类峰找到经验分子式。先前提出的利用质谱峰强度分布轮廓的噪声过滤方法依赖直方图来计算强度阈值。然而,直方图轮廓可能会因用户设置而有所不同。此外,这些算法不是自动化的,因此需要手动处理。为了克服上述缺点,我们提出了一种用于质谱噪声过滤的新算法。该滤波器基于高斯混合模型(GMM),这是一种用于找到强度阈值的机器学习方法。该算法完全由数据驱动,无需使用直方图。它没有可定制的参数,并能自动为每个单独的质谱确定噪声水平。我们在通过平均不同数量的微扫描(瞬态)获得的天然有机物质谱上测试了该算法的性能,并将结果与文献中提出的其他噪声滤波器进行了比较。最后,研究了这种噪声过滤方法对已分配分子式的峰比例的影响。结果表明,识别率总是会提高,但影响的程度会随着平均微扫描次数的变化而变化。提高幅度可达15%。

相似文献

1
Noise Filtering Algorithm Using Gaussian Mixture Models for High-Resolution Mass Spectra of Natural Organic Matter.使用高斯混合模型的天然有机物高分辨率质谱噪声滤波算法
Anal Chem. 2024 Apr 9;96(14):5455-5461. doi: 10.1021/acs.analchem.3c05453. Epub 2024 Mar 26.
2
Total mass difference statistics algorithm: a new approach to identification of high-mass building blocks in electrospray ionization Fourier transform ion cyclotron mass spectrometry data of natural organic matter.总质量差异统计算法:一种鉴定天然有机物电喷雾电离傅里叶变换离子回旋共振质谱数据中高分子量构建块的新方法。
Anal Chem. 2009 Dec 15;81(24):10106-15. doi: 10.1021/ac901476u.
3
MFAssignR: Molecular formula assignment software for ultrahigh resolution mass spectrometry analysis of environmental complex mixtures.MFAssignR:用于环境复杂混合物超高分辨率质谱分析的分子式分配软件。
Environ Res. 2020 Dec;191:110114. doi: 10.1016/j.envres.2020.110114. Epub 2020 Aug 28.
4
Development of a Gaussian-Based Alignment Algorithm for the Ultrahigh-Resolution Mass Spectra of Dissolved Organic Matter.基于高斯分布的溶解有机物超高分辨率质谱对齐算法的开发。
Anal Chem. 2023 Feb 7;95(5):2796-2803. doi: 10.1021/acs.analchem.2c04113. Epub 2023 Jan 23.
5
Filtering of MS/MS data for peptide identification.用于肽段鉴定的MS/MS数据过滤
BMC Genomics. 2013;14 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-14-S7-S2. Epub 2013 Nov 5.
6
Machine-learning assisted molecular formula assignment to high-resolution mass spectrometry data of dissolved organic matter.机器学习辅助将分子公式分配给溶解有机物的高分辨率质谱数据。
Talanta. 2023 Jul 1;259:124484. doi: 10.1016/j.talanta.2023.124484. Epub 2023 Mar 24.
7
Sulfur organic compounds in bottom sediments of the eastern Gulf of Finland.芬兰湾东部底部沉积物中的硫有机化合物。
Environ Sci Pollut Res Int. 2007 Sep;14(6):366-76. doi: 10.1065/espr2006.08.334.
8
A method detection limit for the analysis of natural organic matter via Fourier transform ion cyclotron resonance mass spectrometry.一种通过傅里叶变换离子回旋共振质谱法分析天然有机物的方法检测限。
Anal Chem. 2014 Aug 19;86(16):8376-82. doi: 10.1021/ac501946m. Epub 2014 Aug 6.
9
Development and comparison of formula assignment algorithms for ultrahigh-resolution mass spectra of natural organic matter.发展和比较天然有机物超高分辨率质谱公式赋值算法。
Anal Chim Acta. 2020 Aug 15;1125:247-257. doi: 10.1016/j.aca.2020.05.048. Epub 2020 May 24.
10
Applications of fractional lower order S transform time frequency filtering algorithm to machine fault diagnosis.分数低阶S变换时频滤波算法在机械故障诊断中的应用
PLoS One. 2017 Apr 13;12(4):e0175202. doi: 10.1371/journal.pone.0175202. eCollection 2017.

引用本文的文献

1
Simplistic Software for Analyzing Mass Spectra and a Mixed Experimental-Theoretical Database for Identifying Poisonous and Explosive Substances.用于质谱分析的简易软件以及用于识别有毒和爆炸物质的实验与理论混合数据库。
J Comput Chem. 2025 Jun 30;46(17):e70148. doi: 10.1002/jcc.70148.
2
Evaluation of the current methods for assigning molecular formulas to dissolved organic matter using high resolution mass spectrometry.使用高分辨率质谱法评估当前为溶解有机物分配分子式的方法。
Sci Rep. 2025 Mar 8;15(1):8105. doi: 10.1038/s41598-025-87539-x.