Suppr超能文献

连续小波回归模型中基于改进的逆概率加权(IPW)-偏最小二乘法(PLS)的变量选择

Variable selection by modified IPW (iterative predictor weighting)-PLS (partial least squares) in continuous wavelet regression models.

作者信息

Chen Da, Hu Bin, Shao Xueguang, Su Qingde

机构信息

Department of Chemistry, University of Science and Technology of China, Hefei, Anhui 230026, People's Republic of China.

出版信息

Analyst. 2004 Jul;129(7):664-9. doi: 10.1039/b400410h. Epub 2004 Jun 7.

Abstract

Variable selection is often used to produce more robust and parsimonious regression models. But when they are applied directly to the raw near-infrared spectra, it is not easy to select appropriate variables because background and noise will often overshadow or overlap the absorption bands of analyte. In this work, a new hybrid algorithm based on the selection of the most informative variables in the continuous wavelet transform (CWT) domain is described. The strategy is a combination of CWT and a procedure of modified iterative predictor weighting-partial least square (mIPW-PLS). After elimination of the background and noise in NIR spectra by CWT, the mIPW-PLS approach is used to select the most informative CWT coefficients. With the selected CWT coefficients, a PLS model is built finally for prediction. It is indicated that the extraction of most important variables in the CWT domain can effectively avoid the interference of background and noise, and result in a high quality of regression model with a very small number of variables and fewer PLS components.

摘要

变量选择常用于构建更稳健、更简洁的回归模型。但当直接将其应用于原始近红外光谱时,由于背景和噪声常常会掩盖或重叠分析物的吸收带,所以选择合适的变量并非易事。在这项工作中,描述了一种基于在连续小波变换(CWT)域中选择最具信息变量的新型混合算法。该策略是CWT与改进的迭代预测器加权-偏最小二乘法(mIPW-PLS)过程的结合。通过CWT消除近红外光谱中的背景和噪声后,使用mIPW-PLS方法选择最具信息的CWT系数。利用所选的CWT系数,最终构建一个PLS模型用于预测。结果表明,在CWT域中提取最重要的变量能够有效避免背景和噪声的干扰,并能以非常少的变量和更少的PLS分量得到高质量的回归模型。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验