文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

通过全局最小误差不相关变量消除法改进偏最小二乘模型中的变量减少。

Improved variable reduction in partial least squares modelling by Global-Minimum Error Uninformative-Variable Elimination.

机构信息

Research Group Analysis Techniques in the Life Sciences, Avans Hogeschool, University of Professional Education, P.O. Box 90116, 4800 RA Breda, The Netherlands.

Department of Analytical Chemistry and Pharmaceutical Technology, Center for Pharmaceutical Research, Vrije Universiteit Brussel-VUB, Laarbeeklaan 103, B-1090 Brussels, Belgium.

出版信息

Anal Chim Acta. 2017 Aug 22;982:37-47. doi: 10.1016/j.aca.2017.06.001. Epub 2017 Jun 16.


DOI:10.1016/j.aca.2017.06.001
PMID:28734364
Abstract

The calibration performance of Partial Least Squares regression (PLS) can be improved by eliminating uninformative variables. For PLS, many variable elimination methods have been developed. One is the Uninformative-Variable Elimination for PLS (UVE-PLS). However, the number of variables retained by UVE-PLS is usually still large. In UVE-PLS, variable elimination is repeated as long as the root mean squared error of cross validation (RMSECV) is decreasing. The set of variables in this first local minimum is retained. In this paper, a modification of UVE-PLS is proposed and investigated, in which UVE is repeated until no further reduction in variables is possible, followed by a search for the global RMSECV minimum. The method is called Global-Minimum Error Uninformative-Variable Elimination for PLS, denoted as GME-UVE-PLS or simply GME-UVE. After each iteration, the predictive ability of the PLS model, built with the remaining variable set, is assessed by RMSECV. The variable set with the global RMSECV minimum is then finally selected. The goal is to obtain smaller sets of variables with similar or improved predictability than those from the classical UVE-PLS method. The performance of the GME-UVE-PLS method is investigated using four data sets, i.e. a simulated set, NIR and NMR spectra, and a theoretical molecular descriptors set, resulting in twelve profile-response (X-y) calibrations. The selective and predictive performances of the models resulting from GME-UVE-PLS are statistically compared to those from UVE-PLS and 1-step UVE, one-sided paired t-tests. The results demonstrate that variable reduction with the proposed GME-UVE-PLS method, usually eliminates significantly more variables than the classical UVE-PLS, while the predictive abilities of the resulting models are better. With GME-UVE-PLS, a lower number of uninformative variables, without a chemical meaning for the response, may be retained than with UVE-PLS. The selectivity of the classical UVE method thus can be improved by the application of the proposed GME-UVE method resulting in more parsimonious models.

摘要

偏最小二乘回归(PLS)的校准性能可以通过消除无信息变量来提高。对于 PLS,已经开发了许多变量消除方法。其中一种是偏最小二乘的无信息变量消除(UVE-PLS)。然而,UVE-PLS 保留的变量数量通常仍然很大。在 UVE-PLS 中,只要交叉验证均方根误差(RMSECV)减小,就会重复进行变量消除。保留第一个局部最小值的变量集。在本文中,提出并研究了 UVE-PLS 的一种改进方法,其中重复 UVE,直到无法进一步减少变量,然后搜索全局 RMSECV 最小值。该方法称为偏最小二乘的全局最小误差无信息变量消除(GME-UVE-PLS),简称 GME-UVE 或简称 GME-UVE。每次迭代后,都会使用 RMSECV 评估基于剩余变量集构建的 PLS 模型的预测能力。然后最终选择具有全局 RMSECV 最小值的变量集。目标是获得具有相似或改进的可预测性的较小变量集,而不是经典 UVE-PLS 方法的变量集。使用四个数据集(即模拟集、NIR 和 NMR 光谱以及理论分子描述符集)研究了 GME-UVE-PLS 方法的性能,从而产生了十二个轮廓响应(X-y)校准。使用单边配对 t 检验对来自 GME-UVE-PLS 的模型的选择性和预测性能与来自 UVE-PLS 和 1 步 UVE 的模型的选择性和预测性能进行了统计学比较。结果表明,与经典的 UVE-PLS 相比,使用所提出的 GME-UVE-PLS 方法进行变量减少通常会消除更多的变量,而得到的模型的预测能力更好。使用 GME-UVE-PLS,可以保留比 UVE-PLS 更少的无信息变量,这些变量对响应没有化学意义。通过应用所提出的 GME-UVE 方法,可以提高经典 UVE 方法的选择性,从而得到更简洁的模型。

相似文献

[1]
Improved variable reduction in partial least squares modelling by Global-Minimum Error Uninformative-Variable Elimination.

Anal Chim Acta. 2017-6-16

[2]
Improved variable reduction in partial least squares modelling based on predictive-property-ranked variables and adaptation of partial least squares complexity.

Anal Chim Acta. 2011-6-29

[3]
[Detection of Chromium Content in Soybean Oil by Laser Induced Breakdown Spectroscopy and UVE Method].

Guang Pu Xue Yu Guang Pu Fen Xi. 2016-10

[4]
Elimination of the uninformative calibration sample subset in the modified UVE(Uninformative Variable Elimination)-PLS (Partial Least Squares) method.

Anal Sci. 2001-2

[5]
Determination of alpha-linolenic acid and linoleic acid in edible oils using near-infrared spectroscopy improved by wavelet transform and uninformative variable elimination.

Anal Chim Acta. 2009-2-23

[6]
Predictive-property-ranked variable reduction in partial least squares modelling with final complexity adapted models: comparison of properties for ranking.

Anal Chim Acta. 2012-11-16

[7]
Improvement of spectral calibration for food analysis through multi-model fusion.

Spectrochim Acta A Mol Biomol Spectrosc. 2012-6-6

[8]
Rapid detection of Rosa laevigata polysaccharide content by near-infrared spectroscopy.

Spectrochim Acta A Mol Biomol Spectrosc. 2011-2-23

[9]
New cut-off criterion for uninformative variable elimination in multivariate calibration of near-infrared spectra for the determination of heroin in illicit street drugs.

Anal Chim Acta. 2008-12-23

[10]
[Discrimination of pressed and extracted camellia oils by Vis/NIR spectra combined with UVE-PLS-LDA].

Guang Pu Xue Yu Guang Pu Fen Xi. 2013-9

引用本文的文献

[1]
Quantitative analysis of the illegal addition of Atenolol in Panax notoginseng based on NIR-MIR spectral data fusion and calibration transfer.

RSC Adv. 2024-4-17

[2]
Extreme Gradient Boosting Combined with Conformal Predictors for Informative Solubility Estimation.

Molecules. 2023-12-19

[3]
Terahertz Spectroscopy Characterization and Prediction of the Aging Degree of Polyethylene Pipes Based on PLS.

Materials (Basel). 2023-5-11

[4]
Next-Generation Sequencing in the Assessment of the Transcriptomic Landscape of DNA Damage Repair Genes in Abdominal Aortic Aneurysm, Chronic Venous Disease and Lower Extremity Artery Disease.

Int J Mol Sci. 2022-12-29

[5]
Intelligent Evaluation of Stone Cell Content of Korla Fragrant Pears by Vis/NIR Reflection Spectroscopy.

Foods. 2022-8-9

[6]
Simultaneous Determination of Metal Ions in Zinc Sulfate Solution Using UV-Vis Spectrometry and SPSE-XGBoost Method.

Sensors (Basel). 2020-8-31

[7]
Real-Time Detection on SPAD Value of Potato Plant Using an In-Field Spectral Imaging Sensor System.

Sensors (Basel). 2020-6-17

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索