College of Information Science and Engineering, Northeastern University, Shenyang 110819, China; Centre of Simulation, Shenyang Institute of Engineering, Shenyang 110136, China.
Centre of Simulation, Shenyang Institute of Engineering, Shenyang 110136, China.
Anal Chim Acta. 2018 Feb 13;1000:109-122. doi: 10.1016/j.aca.2017.11.028. Epub 2017 Nov 22.
This paper presents a novel spectrum analysis tool named synergy adaptive moving window modeling based on immune clone algorithm (SA-MWM-ICA) considering the tedious and inconvenient labor involved in the selection of pre-processing methods and spectral variables by prior experience. In this work, immune clone algorithm is first introduced into the spectrum analysis field as a new optimization strategy, covering the shortage of the relative traditional methods. Based on the working principle of the human immune system, the performance of the quantitative model is regarded as antigen, and a special vector corresponding to the above mentioned antigen is regarded as antibody. The antibody contains a pre-processing method optimization region which is created by 11 decimal digits, and a spectrum variable optimization region which is formed by some moving windows with changeable width and position. A set of original antibodies are created by modeling with this algorithm. After calculating the affinity of these antibodies, those with high affinity will be selected to clone. The regulation for cloning is that the higher the affinity, the more copies will be. In the next step, another import operation named hyper-mutation is applied to the antibodies after cloning. Moreover, the regulation for hyper-mutation is that the lower the affinity, the more possibility will be. Several antibodies with high affinity will be created on the basis of these steps. Groups of simulated dataset, gasoline near-infrared spectra dataset, and soil near-infrared spectra dataset are employed to verify and illustrate the performance of SA-MWM-ICA. Analysis results show that the performance of the quantitative models adopted by SA-MWM-ICA are better especially for structures with relatively complex spectra than traditional models such as partial least squares (PLS), moving window PLS (MWPLS), genetic algorithm PLS (GAPLS), and pretreatment method classification and adjustable parameter changeable size moving window PLS (CA-CSMWPLS). The selected pre-processing methods and spectrum variables are easily explained. The proposed method will converge in few generations and can be used not only for near-infrared spectroscopy analysis but also for other similar spectral analysis, such as infrared spectroscopy.
本文提出了一种新的谱分析工具,名为基于免疫克隆算法的协同自适应移动窗口建模(SA-MWM-ICA),考虑到通过先验经验选择预处理方法和光谱变量的繁琐和不便。在这项工作中,免疫克隆算法首先被引入到光谱分析领域,作为一种新的优化策略,弥补了相对传统方法的不足。基于人体免疫系统的工作原理,将定量模型的性能视为抗原,与上述抗原相对应的特殊向量视为抗体。抗体包含一个由 11 个十进制数字创建的预处理方法优化区域,以及一个由具有可变宽度和位置的一些移动窗口组成的光谱变量优化区域。通过该算法进行建模会创建一组原始抗体。在计算这些抗体的亲和力之后,选择具有高亲和力的抗体进行克隆。克隆的调节规则是,亲和力越高,克隆的副本越多。在下一步中,对克隆后的抗体应用另一个名为超突变的导入操作。此外,超突变的调节规则是,亲和力越低,可能性越大。基于这些步骤,会创建几个具有高亲和力的抗体。模拟数据集、汽油近红外光谱数据集和土壤近红外光谱数据集被用来验证和说明 SA-MWM-ICA 的性能。分析结果表明,SA-MWM-ICA 采用的定量模型的性能更好,特别是对于结构比较复杂的光谱,优于传统模型,如偏最小二乘法(PLS)、移动窗口偏最小二乘法(MWPLS)、遗传算法偏最小二乘法(GAPLS)和预处理方法分类和可调参数变化大小移动窗口偏最小二乘法(CA-CSMWPLS)。选择的预处理方法和光谱变量易于解释。所提出的方法将在几轮迭代中收敛,不仅可用于近红外光谱分析,还可用于其他类似的光谱分析,如红外光谱分析。