一种非负矩阵分解相关方法的数学比较及其对质谱成像数据分析的实际意义。

A mathematical comparison of non-negative matrix factorization related methods with practical implications for the analysis of mass spectrometry imaging data.

机构信息

STADIUS Center for Dynamical Systems, Signal Processing, and Data Analytics, Department of Electrical Engineering (ESAT), KU Leuven, Leuven, Belgium.

Department of Cellular and Molecular Medicine, KU Leuven Campus Gasthuisberg O&N 2, Leuven, Belgium.

出版信息

Rapid Commun Mass Spectrom. 2021 Nov 15;35(21):e9181. doi: 10.1002/rcm.9181.

DOI:10.1002/rcm.9181

PMID:34374141

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9285509/

Abstract

RATIONALE

Non-negative matrix factorization (NMF) has been used extensively for the analysis of mass spectrometry imaging (MSI) data, visualizing simultaneously the spatial and spectral distributions present in a slice of tissue. The statistical framework offers two related NMF methods: probabilistic latent semantic analysis (PLSA) and latent Dirichlet allocation (LDA), which is a generative model. This work offers a mathematical comparison between NMF, PLSA, and LDA, and includes a detailed evaluation of Kullback-Leibler NMF (KL-NMF) for MSI for the first time. We will inspect the results for MSI data analysis as these different mathematical approaches impose different characteristics on the data and the resulting decomposition.

METHODS

The four methods (NMF, KL-NMF, PLSA, and LDA) are compared on seven different samples: three originated from mice pancreas and four from human-lymph-node tissues, all obtained using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS).

RESULTS

Where matrix factorization methods are often used for the analysis of MSI data, we find that each method has different implications on the exactness and interpretability of the results. We have discovered promising results using KL-NMF, which has only rarely been used for MSI so far, improving both NMF and PLSA, and have shown that the hitherto stated equivalent KL-NMF and PLSA algorithms do differ in the case of MSI data analysis. LDA, assumed to be the better method in the field of text mining, is shown to be outperformed by PLSA in the setting of MALDI-MSI. Additionally, the molecular results of the human-lymph-node data have been thoroughly analyzed for better assessment of the methods under investigation.

CONCLUSIONS

We present an in-depth comparison of multiple NMF-related factorization methods for MSI. We aim to provide fellow researchers in the field of MSI a clear understanding of the mathematical implications using each of these analytical techniques, which might affect the exactness and interpretation of the results.

摘要

原理

非负矩阵分解（NMF）已被广泛用于质谱成像（MSI）数据分析，同时可视化组织切片中的空间和光谱分布。该统计框架提供了两种相关的 NMF 方法：概率潜在语义分析（PLSA）和潜在狄利克雷分配（LDA），这是一种生成模型。这项工作对 NMF、PLSA 和 LDA 进行了数学比较，并首次详细评估了用于 MSI 的柯尔莫哥洛夫-莱布勒 NMF（KL-NMF）。我们将检查这些不同数学方法对 MSI 数据分析的结果，因为这些方法对数据和由此产生的分解施加了不同的特征。

方法

在七种不同的样本上比较了四种方法（NMF、KL-NMF、PLSA 和 LDA）：三种来自小鼠胰腺，四种来自人类淋巴结组织，均使用基质辅助激光解吸/电离飞行时间质谱（MALDI-TOF MS）获得。

结果

在矩阵分解方法常用于 MSI 数据分析的情况下，我们发现每种方法对结果的准确性和可解释性都有不同的影响。我们使用 KL-NMF 发现了有希望的结果，迄今为止，KL-NMF 很少用于 MSI，它提高了 NMF 和 PLSA 的性能，并表明在 MSI 数据分析中，迄今为止所陈述的等效 KL-NMF 和 PLSA 算法确实不同。在 MALDI-MSI 中，在文本挖掘领域被认为是更好的方法的 LDA 被证明不如 PLSA。此外，对人类淋巴结数据的分子结果进行了彻底分析，以便更好地评估所研究的方法。

结论

我们对多种与 NMF 相关的 MSI 因子分解方法进行了深入比较。我们旨在为 MSI 领域的研究人员提供对使用这些分析技术的数学含义的清晰理解，这可能会影响结果的准确性和解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b75d/9285509/7e01bbf797d6/RCM-35-0-g008.jpg

相似文献

A mathematical comparison of non-negative matrix factorization related methods with practical implications for the analysis of mass spectrometry imaging data.

Rapid Commun Mass Spectrom. 2021 Nov 15;35(21):e9181. doi: 10.1002/rcm.9181.

Nitrogen and Sulfur Co-doped Carbon-Dot-Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry Imaging for Profiling Bisphenol S Distribution in Mouse Tissues.

Anal Chem. 2018 Sep 18;90(18):10872-10880. doi: 10.1021/acs.analchem.8b02362. Epub 2018 Sep 5.

Supervised non-negative matrix factorization methods for MALDI imaging applications.

Bioinformatics. 2019 Jun 1;35(11):1940-1947. doi: 10.1093/bioinformatics/bty909.

Interpretable dimensionality reduction and classification of mass spectrometry imaging data in a visceral pain model via non-negative matrix factorization.

PLoS One. 2024 Oct 10;19(10):e0300526. doi: 10.1371/journal.pone.0300526. eCollection 2024.

Revisiting Species Identification within the Enterobacter cloacae Complex by Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry.

Microbiol Spectr. 2021 Sep 3;9(1):e0066121. doi: 10.1128/Spectrum.00661-21. Epub 2021 Aug 11.

Application of chemometric algorithms to MALDI mass spectrometry imaging of pharmaceutical tablets.

J Pharm Biomed Anal. 2015 Feb;105:91-100. doi: 10.1016/j.jpba.2014.11.047. Epub 2014 Dec 9.

Optimization and evaluation of MALDI TOF mass spectrometric imaging for quantification of orally dosed octreotide in mouse tissues.

Talanta. 2017 Apr 1;165:128-135. doi: 10.1016/j.talanta.2016.12.049. Epub 2016 Dec 21.

Matrix-free mass spectrometry imaging of mouse brain tissue sections on silicon nanopost arrays.

J Comp Neurol. 2019 Sep 1;527(13):2101-2121. doi: 10.1002/cne.24566. Epub 2018 Dec 5.

omniSpect: an open MATLAB-based tool for visualization and analysis of matrix-assisted laser desorption/ionization and desorption electrospray ionization mass spectrometry images.

J Am Soc Mass Spectrom. 2013 Apr;24(4):646-9. doi: 10.1007/s13361-012-0572-y. Epub 2013 Feb 26.

Mass spectrometry imaging of triglycerides in biological tissues by laser desorption ionization from silicon nanopost arrays.

J Mass Spectrom. 2020 Apr;55(4):e4443. doi: 10.1002/jms.4443. Epub 2019 Dec 2.

引用本文的文献

NMFProfiler: a multi-omics integration method for samples stratified in groups.

Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf066.

Predicting Protein Pathways Associated to Tumor Heterogeneity by Correlating Spatial Lipidomics and Proteomics: The Dry Proteomic Concept.

Mol Cell Proteomics. 2025 Jan;24(1):100891. doi: 10.1016/j.mcpro.2024.100891. Epub 2024 Dec 5.

Interpretable dimensionality reduction and classification of mass spectrometry imaging data in a visceral pain model via non-negative matrix factorization.

PLoS One. 2024 Oct 10;19(10):e0300526. doi: 10.1371/journal.pone.0300526. eCollection 2024.

Adipose tissue composition determines its computed tomography radiodensity.

Eur Radiol. 2024 Mar;34(3):1635-1644. doi: 10.1007/s00330-023-09911-7. Epub 2023 Sep 1.

Spatiochemical Characterization of the Pancreas Using Mass Spectrometry Imaging and Topological Data Analysis.

Anal Chem. 2023 Jul 18;95(28):10550-10556. doi: 10.1021/acs.analchem.2c05606. Epub 2023 Jul 4.

Automated Library Generation and Serendipity Quantification Enables Diverse Discovery in Coordination Chemistry.

J Am Chem Soc. 2023 Feb 1;145(4):2332-2341. doi: 10.1021/jacs.2c11066. Epub 2023 Jan 17.

Emerging Computational Methods in Mass Spectrometry Imaging.

Adv Sci (Weinh). 2022 Dec;9(34):e2203339. doi: 10.1002/advs.202203339. Epub 2022 Oct 17.

本文引用的文献

Spatial Metabolomics and Imaging Mass Spectrometry in the Age of Artificial Intelligence.

Annu Rev Biomed Data Sci. 2020 Jul;3:61-87. doi: 10.1146/annurev-biodatasci-011420-031537. Epub 2020 Apr 13.

Prioritization of /-Values in Mass Spectrometry Imaging Profiles Obtained Using Uniform Manifold Approximation and Projection for Dimensionality Reduction.

Anal Chem. 2020 Apr 7;92(7):5240-5248. doi: 10.1021/acs.analchem.9b05764. Epub 2020 Mar 20.

Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry.

Mass Spectrom Rev. 2020 May;39(3):245-291. doi: 10.1002/mas.21602. Epub 2019 Oct 11.

Evaluation of Distance Metrics and Spatial Autocorrelation in Uniform Manifold Approximation and Projection Applied to Mass Spectrometry Imaging Data.

Anal Chem. 2019 May 7;91(9):5706-5714. doi: 10.1021/acs.analchem.8b05827. Epub 2019 Apr 25.

Supervised non-negative matrix factorization methods for MALDI imaging applications.

Bioinformatics. 2019 Jun 1;35(11):1940-1947. doi: 10.1093/bioinformatics/bty909.

Quantifying biological samples using Linear Poisson Independent Component Analysis for MALDI-ToF mass spectra.

Bioinformatics. 2018 Mar 15;34(6):1001-1008. doi: 10.1093/bioinformatics/btx630.

Unsupervised Discovery and Comparison of Structural Families Across Multiple Samples in Untargeted Metabolomics.

Anal Chem. 2017 Jul 18;89(14):7569-7577. doi: 10.1021/acs.analchem.7b01391. Epub 2017 Jul 5.

Matrix Factorization Techniques for Analysis of Imaging Mass Spectrometry Data.

Proc IEEE Int Symp Bioinformatics Bioeng. 2008 Oct;2008. doi: 10.1109/BIBE.2008.4696797. Epub 2008 Dec 8.

Regularized Non-Negative Matrix Factorization for Identifying Differentially Expressed Genes and Clustering Samples: A Survey.

IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):974-987. doi: 10.1109/TCBB.2017.2665557. Epub 2017 Feb 7.

Topic modeling for untargeted substructure exploration in metabolomics.

Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13738-13743. doi: 10.1073/pnas.1608041113. Epub 2016 Nov 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种非负矩阵分解相关方法的数学比较及其对质谱成像数据分析的实际意义。

A mathematical comparison of non-negative matrix factorization related methods with practical implications for the analysis of mass spectrometry imaging data.

机构信息

出版信息

RATIONALE

METHODS

RESULTS

CONCLUSIONS

原理

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献