主成分分析探索：使用谱图直观推导主成分分析。

Exploration of Principal Component Analysis: Deriving Principal Component Analysis Visually Using Spectra.

机构信息

J Renwick Beattie Consulting, Ballycastle, UK.

Esmonde-White Technologies, Ann Arbor, MI, USA.

出版信息

Appl Spectrosc. 2021 Apr;75(4):361-375. doi: 10.1177/0003702820987847. Epub 2021 Jan 22.

DOI:10.1177/0003702820987847

PMID:33393349

Abstract

Spectroscopy rapidly captures a large amount of data that is not directly interpretable. Principal component analysis is widely used to simplify complex spectral datasets into comprehensible information by identifying recurring patterns in the data with minimal loss of information. The linear algebra underpinning principal component analysis is not well understood by many applied analytical scientists and spectroscopists who use principal component analysis. The meaning of features identified through principal component analysis is often unclear. This manuscript traces the journey of the spectra themselves through the operations behind principal component analysis, with each step illustrated by simulated spectra. Principal component analysis relies solely on the information within the spectra, consequently the mathematical model is dependent on the nature of the data itself. The direct links between model and spectra allow concrete spectroscopic explanation of principal component analysis , such as the scores representing "concentration" or "weights". The principal components (loadings) are by definition hidden, repeated and uncorrelated spectral shapes that linearly combine to generate the observed spectra. They can be visualized as subtraction spectra between extreme differences within the dataset. Each PC is shown to be a successive refinement of the estimated spectra, improving the fit between PC reconstructed data and the original data. Understanding the data-led development of a principal component analysis model shows how to interpret application specific chemical meaning of the principal component analysis loadings and how to analyze scores. A critical benefit of principal component analysis is its simplicity and the succinctness of its description of a dataset, making it powerful and flexible.

摘要

光谱学快速捕获大量无法直接解释的数据。主成分分析（PCA）被广泛用于通过识别数据中的重复模式，以最小的信息损失将复杂的光谱数据集简化为可理解的信息。许多使用主成分分析的应用分析科学家和光谱学家并不理解主成分分析背后的线性代数。通过主成分分析识别的特征的含义往往不清楚。本文通过模拟光谱，追踪光谱本身在主成分分析背后的操作过程，每一步都进行了说明。主成分分析仅依赖于光谱中的信息，因此数学模型取决于数据本身的性质。模型和光谱之间的直接联系允许对主成分分析进行具体的光谱解释，例如代表“浓度”或“权重”的得分。主成分（载荷）根据定义是隐藏的、重复的和不相关的光谱形状，它们线性组合生成观察到的光谱。它们可以可视化为数据集内极端差异之间的减法光谱。每个主成分都被显示为对估计光谱的连续改进，从而提高 PC 重建数据与原始数据之间的拟合度。理解主成分分析模型的数据驱动发展表明如何解释主成分分析载荷的特定于应用的化学意义，以及如何分析得分。主成分分析的一个关键优势是其简单性及其对数据集的简洁描述，这使其强大且灵活。

相似文献

Exploration of Principal Component Analysis: Deriving Principal Component Analysis Visually Using Spectra.主成分分析探索：使用谱图直观推导主成分分析。

Appl Spectrosc. 2021 Apr;75(4):361-375. doi: 10.1177/0003702820987847. Epub 2021 Jan 22.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

Principal component analysis or kernel principal component analysis based joint spectral subspace method for calibration transfer.基于主成分分析或核主成分分析的联合谱子空间方法用于校正传递。

Spectrochim Acta A Mol Biomol Spectrosc. 2020 Feb 15;227:117653. doi: 10.1016/j.saa.2019.117653. Epub 2019 Oct 18.

PC 2D-COS: A Principal Component Base Approach to Two-Dimensional Correlation Spectroscopy.PC 2D-COS：一种基于主成分的二维相关光谱方法。

Appl Spectrosc. 2020 Apr;74(4):460-472. doi: 10.1177/0003702819891194. Epub 2020 Feb 19.

Understanding the molecular information contained in principal component analysis of vibrational spectra of biological systems.理解生物系统振动光谱主成分分析中所包含的分子信息。

Analyst. 2012 Jan 21;137(2):322-32. doi: 10.1039/c1an15821j. Epub 2011 Nov 24.

Laser-Induced Breakdown Spectroscopy and Principal Component Analysis for the Classification of Spectra from Gold-Bearing Ores.激光诱导击穿光谱法与主成分分析在含金矿石光谱分类中的应用。

Appl Spectrosc. 2020 Jan;74(1):42-54. doi: 10.1177/0003702819881444. Epub 2019 Nov 7.

Kernel principal component analysis residual diagnosis (KPCARD): An automated method for cosmic ray artifact removal in Raman spectra.核主成分分析残差诊断（KPCARD）：一种用于去除拉曼光谱中宇宙射线伪影的自动化方法。

Anal Chim Acta. 2016 Mar 24;913:111-20. doi: 10.1016/j.aca.2016.01.042. Epub 2016 Jan 27.

Comparing patterns of component loadings: principal component analysis (PCA) versus independent component analysis (ICA) in analyzing multivariate non-normal data.比较成分载荷模式：主成分分析（PCA）与独立成分分析（ICA）在分析多元非正态数据中的应用。

Behav Res Methods. 2012 Dec;44(4):1239-43. doi: 10.3758/s13428-012-0193-1.

Effect of Principal Component Analysis Centering and Scaling on Classification of Mycobacteria from Raman Spectra.主成分分析中心化和定标对拉曼光谱分枝杆菌分类的影响。

Appl Spectrosc. 2017 Jun;71(6):1249-1255. doi: 10.1177/0003702816678867. Epub 2016 Nov 25.

Quantitation of resonances in biological 31P NMR spectra via principal component analysis: potential and limitations.通过主成分分析对生物31P NMR谱中的共振进行定量：潜力与局限

NMR Biomed. 1996 May;9(3):93-104. doi: 10.1002/(SICI)1099-1492(199605)9:3<93::AID-NBM410>3.0.CO;2-D.

引用本文的文献

Prediction of Soil Properties Using Vis-NIR Spectroscopy Combined with Machine Learning: A Review.利用可见-近红外光谱结合机器学习预测土壤性质：综述

Sensors (Basel). 2025 Aug 14;25(16):5045. doi: 10.3390/s25165045.

Bidirectional decision analysis of online Ride-hailing enterprises based on fuzzy theory and cloud model.基于模糊理论和云模型的在线网约车企业双向决策分析

Sci Rep. 2025 Aug 20;15(1):30544. doi: 10.1038/s41598-025-15908-7.

Integrative bioinformatics and machine learning approaches reveal oxidative stress and glucose metabolism related genes as therapeutic targets and drug candidates in Alzheimer's disease.整合生物信息学和机器学习方法揭示氧化应激和葡萄糖代谢相关基因作为阿尔茨海默病的治疗靶点和候选药物。

Front Immunol. 2025 Jun 26;16:1572468. doi: 10.3389/fimmu.2025.1572468. eCollection 2025.

Purslane-Fortified Yogurt: In-Line Process Control by FT-NIR Spectroscopy and Storage Monitoring.马齿苋强化酸奶：傅里叶变换近红外光谱法在线过程控制与储存监测

Foods. 2025 Jun 11;14(12):2053. doi: 10.3390/foods14122053.

Exploring Generative Artificial Intelligence and Data Augmentation Techniques for Spectroscopy Analysis.探索用于光谱分析的生成式人工智能和数据增强技术。

Chem Rev. 2025 Jul 9;125(13):6130-6155. doi: 10.1021/acs.chemrev.4c00815. Epub 2025 Jun 23.

Spatial data intelligence and city metaverse: A review.空间数据智能与城市元宇宙：综述

Fundam Res. 2023 Dec 28;5(3):1169-1193. doi: 10.1016/j.fmre.2023.10.014. eCollection 2025 May.

HSQC-NMR spectroscopy and exploratory data analysis of crude oil residue in relation to the time of spill.与泄漏时间相关的原油残渣的HSQC核磁共振光谱法及探索性数据分析。

RSC Adv. 2025 Jun 4;15(24):18910-18919. doi: 10.1039/d5ra00826c.

Unraveling Polymorphic Control in the Solid-State [2 + 2] Cycloaddition of Vitamin K: Insights from Single-Crystal Irradiation.解析维生素K固态[2 + 2]环加成反应中的多晶型控制：单晶辐照的见解

J Am Chem Soc. 2025 Jun 18;147(24):21109-21120. doi: 10.1021/jacs.5c06303. Epub 2025 Jun 4.

Identifying the Geographical Origin of Wolfberry Using Near-Infrared Spectroscopy and Stacking-Orthogonal Linear Discriminant Analysis.利用近红外光谱和堆叠正交线性判别分析识别枸杞的地理来源。

Foods. 2025 May 9;14(10):1684. doi: 10.3390/foods14101684.

Study on Detection Method of Sulfamethazine Residues in Duck Blood Based on Surface-Enhanced Raman Spectroscopy.基于表面增强拉曼光谱法的鸭血中磺胺二甲嘧啶残留检测方法研究

Biosensors (Basel). 2025 May 1;15(5):286. doi: 10.3390/bios15050286.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

主成分分析探索：使用谱图直观推导主成分分析。

Exploration of Principal Component Analysis: Deriving Principal Component Analysis Visually Using Spectra.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献