• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越主成分分析:化学中子空间建模的因子分析方法的批判性比较。

Beyond principal components: a critical comparison of factor analysis methods for subspace modelling in chemistry.

机构信息

Trace Analysis Research Centre, Department of Chemistry, Dalhousie University, PO Box 15000, Halifax, NS, B3H 4R2, Canada.

Institute for Advanced Study in Basic Sciences, GavaZang, Zanjan, 45137-66731, Iran.

出版信息

Anal Methods. 2021 Sep 30;13(37):4188-4219. doi: 10.1039/d1ay01124c.

DOI:10.1039/d1ay01124c
PMID:34473142
Abstract

Multivariate data analysis tools have become an integral part of modern analytical chemistry, and principal component analysis (PCA) is perhaps foremost among these. PCA is central in approaching many problems in data exploration, classification, calibration, modelling, and curve resolution. However, PCA is only one form of a broader group of factor analysis (FA) methods that are rarely employed by chemists. The dominance of PCA in chemistry is primarily a consequence of history and convenience, but this has obscured the potential advantages of other FA tools that are widely used in other fields. The purpose of this article, which is intended for those who are already familiar with the mathematical foundations and applications of PCA, is to develop a framework to relate PCA to other commonly used FA methods from the perspective of chemical applications. Specifically, PCA is compared to maximum likelihood factor analysis (MLFA), principal axis factorization (PAF) and maximum likelihood PCA (MLPCA). Similarities and differences are highlighted with regard to the assumptions and constraints of the models, algorithms employed, and calculation of scores and loadings. Practical aspects such as data dimensionality, preprocessing, rank estimation, improper solutions (Heywood cases), and software implementation are considered. The performance of the four methods is compared using both simulated and experimental data sets. While PCA provides the most reliable estimates when measurement error variance is uniform (homoscedastic noise) and MLPCA works best when the error covariance matrix is explicitly known, MLFA and PAF have the distinct advantage of providing information about measurement uncertainty and adapting to situations of unknown heteroscedastic errors, eliminating the need for scaling. Moreover, MLFA in particular is shown to be tolerant to deviations from model linearity. These results make a strong case for increased application of other FA methods in chemistry.

摘要

多元数据分析工具已成为现代分析化学不可或缺的一部分,而主成分分析(PCA)可能是其中最重要的一种。PCA 在数据探索、分类、校准、建模和曲线解析等许多问题中处于核心地位。然而,PCA 只是更广泛的因子分析(FA)方法组中的一种形式,化学家很少使用。PCA 在化学中的主导地位主要是历史和便利性的结果,但这掩盖了其他 FA 工具在其他领域广泛应用的潜在优势。本文旨在为那些已经熟悉 PCA 的数学基础和应用的人提供一个框架,从化学应用的角度来比较 PCA 与其他常用 FA 方法。具体而言,将 PCA 与最大似然因子分析(MLFA)、主轴因子分解(PAF)和最大似然 PCA(MLPCA)进行比较。在模型的假设和约束、所采用的算法以及得分和载荷的计算方面,突出了它们之间的异同。还考虑了数据维度、预处理、秩估计、不适当的解(Heywood 情况)和软件实现等实际方面。使用模拟和实验数据集比较了这四种方法的性能。当测量误差方差均匀(同方差噪声)时,PCA 提供最可靠的估计,而当明确知道误差协方差矩阵时,MLPCA 效果最佳,MLFA 和 PAF 具有明显的优势,可提供有关测量不确定性的信息,并适应未知异方差误差的情况,无需缩放。此外,MLFA 尤其显示出对模型线性度偏差的容忍性。这些结果强烈表明应在化学中更多地应用其他 FA 方法。

相似文献

1
Beyond principal components: a critical comparison of factor analysis methods for subspace modelling in chemistry.超越主成分分析:化学中子空间建模的因子分析方法的批判性比较。
Anal Methods. 2021 Sep 30;13(37):4188-4219. doi: 10.1039/d1ay01124c.
2
Asymptotic performance of PCA for high-dimensional heteroscedastic data.主成分分析(PCA)对高维异方差数据的渐近性能
J Multivar Anal. 2018 Sep;167:435-452. doi: 10.1016/j.jmva.2018.06.002. Epub 2018 Jun 19.
3
Matrix factorization algorithms for the identification of muscle synergies: evaluation on simulated and experimental data sets.用于识别肌肉协同作用的矩阵分解算法:对模拟数据集和实验数据集的评估
J Neurophysiol. 2006 Apr;95(4):2199-212. doi: 10.1152/jn.00222.2005. Epub 2006 Jan 4.
4
Multivariate analysis of neuronal interactions in the generalized partial least squares framework: simulations and empirical studies.广义偏最小二乘框架下神经元相互作用的多变量分析:模拟与实证研究
Neuroimage. 2003 Oct;20(2):625-42. doi: 10.1016/S1053-8119(03)00333-1.
5
Principal Component Analysis and Factor Analysis: differences and similarities in Nutritional Epidemiology application.主成分分析和因子分析:营养流行病学应用中的差异与相似之处
Rev Bras Epidemiol. 2019 Jul 29;22:e190041. doi: 10.1590/1980-549720190041.
6
Maximum likelihood multivariate calibration.
Anal Chem. 1997 Jul 1;69(13):2299-311. doi: 10.1021/ac961029h.
7
On applicability of PCA, voxel-wise variance normalization and dimensionality assumptions for sliding temporal window sICA in resting-state fMRI.关于 PCA、体素方差归一化和滑动时间窗口独立成分分析在静息态 fMRI 中的维度假设的适用性。
Magn Reson Imaging. 2013 Oct;31(8):1338-48. doi: 10.1016/j.mri.2013.06.002. Epub 2013 Jul 8.
8
Differential analysis of FA-NNC, PCA-MLR, and PMF methods applied in source apportionment of PAHs in street dust.FA-NNC、PCA-MLR 和 PMF 方法在街道灰尘中多环芳烃源解析中的应用的差异分析。
Environ Monit Assess. 2020 Oct 24;192(11):727. doi: 10.1007/s10661-020-08679-3.
9
Principal component-based feature selection for tumor classification.基于主成分的肿瘤分类特征选择
Biomed Mater Eng. 2015;26 Suppl 1:S2011-7. doi: 10.3233/BME-151505.
10
Principal Component Analysis based on Nuclear norm Minimization.基于核范数最小化的主成分分析。
Neural Netw. 2019 Oct;118:1-16. doi: 10.1016/j.neunet.2019.05.020. Epub 2019 Jun 8.

引用本文的文献

1
Prospective Harmonization, Common Data Elements, and Sharing Strategies for Multicenter Pre-Clinical Traumatic Brain Injury Research in the Translational Outcomes Project in Neurotrauma Consortium.神经创伤联盟转化结果项目中多中心临床前创伤性脑损伤研究的前瞻性协调、通用数据元素及共享策略
J Neurotrauma. 2025 May;42(9-10):877-897. doi: 10.1089/neu.2023.0653. Epub 2025 Jan 20.
2
On the Importance of Investigating Data Structure in Miniaturized NIR Spectroscopy Measurements of Food: The Case Study of Sugar.论食品近红外光谱小型化测量中数据结构研究的重要性:以糖为例
Foods. 2023 Jan 20;12(3):493. doi: 10.3390/foods12030493.