• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于BiPLS结合GA的可见/近红外光谱中多元线性回归变量选择

[Selection of variables for MLR in Vis/NIR spectroscopy based on BiPLS combined with GA].

作者信息

Li Peng-Fei, Wang Jia-Hua, Cao Nan-Ning, Han Dong-Hai

机构信息

College of Food Science and Nutritional Engineering, China Agricultural University, Beijing 100083, China.

出版信息

Guang Pu Xue Yu Guang Pu Fen Xi. 2009 Oct;29(10):2637-41.

PMID:20038026
Abstract

The feasibility of using efficient selection of variables in Vis/NIR for a rapid and conclusive determination of fruit inner qualities such as soluble solids content (SSC) of plums was investigated. A new strategy was proposed in the present paper, i. e. two-stage variable selection using the backward interval partial least squares (BiPLS) combined with genetic algorithm (GA). Firstly, it splits the whole spectral region into equidistant sub-regions and then develops all BiPLS regression models, and the informative regions which are used to constructed PLS models with the lowest error can be located. Secondly, GA method is used to select variable in these informative regions, which are used for regression variables of MLR model. The Vis/NIR spectra containing 225 individual data points were processed by Savizky-Golay filter smoothing and second-order derivative, and 9 sub-regions were selected by BiPLS procedure when the spectra were divided into 25 sub-regions. The optimal 12 variables, which were the output of the GA procedure, were selected by the higher occurrence frequency while the GA procedure ran 100 times. In order to simplify the multiple linear regression (MLR) modeling, the wavelength variables with the maximum occurrence frequency were chosen when the adjacent wavelengths were selected by GA. Finally, 638, 734, 752, 868, 910, 916 and 938 nm were used to build a MLR model. The results show that MLR model produced by BiPLS-GA performs well with correlation coefficients (R) of 0.984, root mean standard error of calibration (RMSEC) of 0.364 and root mean standard error of prediction (RMSEP) of 0.471 for SSC, which outperforms models using stepwise regression analysis (SRA). This work proved that the BiPLS-GA could determine optimal variables in Vis/NIR spectra and improve the accuracy of model.

摘要

研究了在可见/近红外光谱中利用有效变量选择快速准确测定李子果实内部品质(如可溶性固形物含量,SSC)的可行性。本文提出了一种新策略,即采用向后间隔偏最小二乘法(BiPLS)结合遗传算法(GA)的两阶段变量选择方法。首先,将整个光谱区域划分为等距子区域,然后建立所有BiPLS回归模型,从而确定用于构建误差最小的PLS模型的信息区域。其次,利用GA方法在这些信息区域中选择变量,作为多元线性回归(MLR)模型的回归变量。对包含225个独立数据点的可见/近红外光谱进行Savizky-Golay滤波平滑和二阶导数处理,当光谱划分为25个子区域时,通过BiPLS程序选择出9个子区域。GA程序运行100次时,通过较高的出现频率选择出作为GA程序输出的最优12个变量。为简化多元线性回归(MLR)建模,当GA选择相邻波长时,选择出现频率最高的波长变量。最后,利用638、734、752、868、910、916和938nm建立MLR模型。结果表明,BiPLS-GA产生的MLR模型性能良好,SSC的相关系数(R)为0.984,校正均方根误差(RMSEC)为0.364,预测均方根误差(RMSEP)为0.471,优于逐步回归分析(SRA)模型。这项工作证明,BiPLS-GA可以在可见/近红外光谱中确定最优变量,并提高模型的准确性。

相似文献

1
[Selection of variables for MLR in Vis/NIR spectroscopy based on BiPLS combined with GA].基于BiPLS结合GA的可见/近红外光谱中多元线性回归变量选择
Guang Pu Xue Yu Guang Pu Fen Xi. 2009 Oct;29(10):2637-41.
2
Hybrid variable selection in visible and near-infrared spectral analysis for non-invasive quality determination of grape juice.可见近红外光谱分析中用于非侵入式葡萄汁品质测定的混合变量选择
Anal Chim Acta. 2010 Feb 5;659(1-2):229-37. doi: 10.1016/j.aca.2009.11.045. Epub 2009 Nov 26.
3
[Study on variable selection of NIR spectral information based on GA and SCMWPLS].
Guang Pu Xue Yu Guang Pu Fen Xi. 2010 Apr;30(4):915-9.
4
[Analysis of near infrared spectra of apple SSC by genetic algorithm optimization].基于遗传算法优化的苹果可溶性固形物近红外光谱分析
Guang Pu Xue Yu Guang Pu Fen Xi. 2008 Oct;28(10):2308-11.
5
Genetic algorithm interval partial least squares regression combined successive projections algorithm for variable selection in near-infrared quantitative analysis of pigment in cucumber leaves.基于遗传算法区间偏最小二乘回归与连续投影算法的黄瓜叶片色素近红外定量分析变量选择
Appl Spectrosc. 2010 Jul;64(7):786-94. doi: 10.1366/000370210791666246.
6
[Analysis of NIR characteristic wavelengths for apple flesh firmness based on GA and iPLS].基于遗传算法和间隔偏最小二乘法的苹果果肉硬度近红外特征波长分析
Guang Pu Xue Yu Guang Pu Fen Xi. 2009 Oct;29(10):2760-4.
7
Variable selection in visible/near infrared spectra for linear and nonlinear calibrations: a case study to determine soluble solids content of beer.用于线性和非线性校准的可见/近红外光谱中的变量选择:测定啤酒可溶性固形物含量的案例研究
Anal Chim Acta. 2009 Mar 2;635(1):45-52. doi: 10.1016/j.aca.2009.01.017. Epub 2009 Jan 17.
8
Measurement of soluble solids content in watermelon by Vis/NIR diffuse transmittance technique.采用可见/近红外漫透射技术测定西瓜中的可溶性固形物含量。
J Zhejiang Univ Sci B. 2007 Feb;8(2):105-10. doi: 10.1631/jzus.2007.B0105.
9
Fast determination of oxides content in cement raw meal using NIR-spectroscopy and backward interval PLS with genetic algorithm.使用近红外光谱法和基于遗传算法的反向间隔偏最小二乘法快速测定水泥生料中的氧化物含量。
Spectrochim Acta A Mol Biomol Spectrosc. 2019 Dec 5;223:117327. doi: 10.1016/j.saa.2019.117327. Epub 2019 Jun 29.
10
[Determination of soluble solids content in navel oranges by Vis/NIR diffuse transmission spectra combined with CARS method].基于可见/近红外漫透射光谱结合CARS方法测定脐橙可溶性固形物含量
Guang Pu Xue Yu Guang Pu Fen Xi. 2012 Dec;32(12):3229-33.

引用本文的文献

1
Non-Destructive Detection of Meat Quality Based on Multiple Spectral Dimension Reduction Methods by Near-Infrared Spectroscopy.基于近红外光谱多光谱降维方法的肉质无损检测
Foods. 2023 Jan 8;12(2):300. doi: 10.3390/foods12020300.