Gao Hong-Tao, Li Tong-Hua, Chen Kai, Li Wei-Guang, Bi Xian
Department of Chemistry, Tongji University, China; Department of Chemistry, Jining Teachers College, Jining Shandong, China.
Talanta. 2005 Mar 31;66(1):65-73. doi: 10.1016/j.talanta.2004.09.017.
Non-negative matrix factorization (NMF), with the constraints of non-negativity, has been recently proposed for multi-variate data analysis. Because it allows only additive, not subtractive, combinations of the original data, NMF is capable of producing region or parts-based representation of objects. It has been used for image analysis and text processing. Unlike PCA, the resolutions of NMF are non-negative and can be easily interpreted and understood directly. Due to multiple solutions, the original algorithm of NMF [D.D. Lee, H.S. Seung, Nature 401 (1999) 788] is not suitable for resolving chemical mixed signals. In reality, NMF has never been applied to resolving chemical mixed signals. It must be modified according to the characteristics of the chemical signals, such as smoothness of spectra, unimodality of chromatograms, sparseness of mass spectra, etc. We have used the modified NMF algorithm to narrow the feasible solution region for resolving chemical signals, and found that it could produce reasonable and acceptable results for certain experimental errors, especially for overlapping chromatograms and sparse mass spectra. Simulated two-dimensional (2-D) data and real GUJINGGONG alcohol liquor GC-MS data have been resolved soundly by NMF technique. Butyl caproate and its isomeric compound (butyric acid, hexyl ester) have been identified from the overlapping spectra. The result of NMF is preferable to that of Heuristic evolving latent projections (HELP). It shows that NMF is a promising chemometric resolution method for complex samples.
非负矩阵分解(NMF),在非负性约束下,最近被提出用于多变量数据分析。由于它只允许原始数据进行相加组合,而不是相减组合,NMF能够生成基于区域或部分的对象表示。它已被用于图像分析和文本处理。与主成分分析(PCA)不同,NMF的分辨率是非负的,并且可以直接轻松地解释和理解。由于存在多个解,NMF的原始算法[D.D. Lee, H.S. Seung, Nature 401 (1999) 788]不适用于解析化学混合信号。实际上,NMF从未被应用于解析化学混合信号。必须根据化学信号的特征对其进行修改,例如光谱的平滑性、色谱图的单峰性、质谱的稀疏性等。我们使用修改后的NMF算法来缩小解析化学信号的可行解区域,发现对于一定的实验误差,它可以产生合理且可接受的结果,特别是对于重叠色谱图和稀疏质谱。通过NMF技术已成功解析了模拟二维(2-D)数据和实际古井贡酒气相色谱 - 质谱(GC-MS)数据。从重叠光谱中鉴定出了己酸丁酯及其同分异构体化合物(丁酸己酯)。NMF的结果优于启发式演化潜投影(HELP)的结果。这表明NMF是一种用于复杂样品的有前途的化学计量学解析方法。