Ni Hong-Fei, Si Le-Ting, Huang Jia-Peng, Zan Qiong, Chen Yong, Luan Lian-Jun, Wu Yong-Jiang, Liu Xue-Song
College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058, China.
Suzhou Zeda Xingbang Pharmaceutical Technology Co., Ltd. Suzhou 215163, China.
Zhongguo Zhong Yao Za Zhi. 2021 Jan;46(1):110-117. doi: 10.19540/j.cnki.cjcmm.20201022.304.
Near-infrared spectroscopy(NIRS) combined with band screening method and modeling algorithm can be used to achieve the rapid and non-destructive detection of the traditional Chinese medicine(TCM) production process. This paper focused on the ginkgo leaf macroporous resin purification process, which is the key technology of Yinshen Tongluo Capsules, in order to achieve the rapid determination of quercetin, kaempferol and isorhamnetin in effluent. The abnormal spectrum was eliminated by Mahalanobis distance algorithm, and the data set was divided by the sample set partitioning method based on joint X-Y distances(SPXY). The key information bands were selected by synergy interval partial least squares(siPLS); based on that, competitive adaptive reweighted sampling(CARS), successive projections algorithm(SPA) and Monte Carlo uninformative variable(MC-UVE) were used to select wavelengths to obtain less but more critical variable data. With selected key variables as input, the quantitative analysis model was established by genetic algorithm joint extreme learning machine(GA-ELM) algorithm. The performance of the model was compared with that of partial least squares regression(PLSR). The results showed that the combination with siPLS-CARS-GA-ELM could achieve the optimal model performance with the minimum number of variables. The calibration set correlation coefficient R_c and the validation set correlation coefficient R_p of quercetin, kaempferol and isorhamnetin were all above 0.98. The root mean square error of calibration(RMSEC), the root mean square error of prediction(RMSEP) and the relative standard errors of prediction(RSEP) were 0.030 0, 0.029 2 and 8.88%, 0.041 4, 0.034 8 and 8.46%, 0.029 3, 0.027 1 and 10.10%, respectively. Compared with the PLSR me-thod, the performance of the GA-ELM model was greatly improved, which proved that NIRS combined with GA-ELM method has a great potential for rapid determination of effective components of TCM.
近红外光谱法(NIRS)结合波段筛选方法和建模算法可用于实现对中药生产过程的快速无损检测。本文聚焦于银参通络胶囊的关键技术——银杏叶大孔树脂纯化工艺,以实现对流出液中槲皮素、山奈酚和异鼠李素的快速测定。采用马氏距离算法消除异常光谱,基于联合X-Y距离的样本集划分方法(SPXY)对数据集进行划分。通过协同区间偏最小二乘法(siPLS)选择关键信息波段;在此基础上,采用竞争性自适应重加权采样(CARS)、连续投影算法(SPA)和蒙特卡罗无信息变量消除法(MC-UVE)选择波长,以获得较少但更关键的变量数据。以选定的关键变量作为输入,采用遗传算法联合极限学习机(GA-ELM)算法建立定量分析模型。将该模型的性能与偏最小二乘回归(PLSR)模型进行比较。结果表明,siPLS-CARS-GA-ELM组合能够以最少的变量数实现最优的模型性能。槲皮素、山奈酚和异鼠李素的校正集相关系数R_c和验证集相关系数R_p均在0.98以上。校正均方根误差(RMSEC)、预测均方根误差(RMSEP)和预测相对标准误差(RSEP)分别为0.030 0、0.029 2和8.88%,0.041 4、0.034 8和8.46%,0.029 3、0.027 1和10.10%。与PLSR方法相比,GA-ELM模型的性能有了很大提高,证明了NIRS结合GA-ELM方法在快速测定中药有效成分方面具有巨大潜力。