• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于变量空间连续收缩的多元校正混合变量选择策略。

A hybrid variable selection strategy based on continuous shrinkage of variable space in multivariate calibration.

机构信息

College of Food Science and Technology, Hainan University, Haikou, 570228, China; Institute of Environment and Plant Protection, Chinese Academy of Tropical Agricultural Sciences, Haikou, 571101, PR China.

College of Tobacco Science, Guizhou University, Guiyang, 550025, China.

出版信息

Anal Chim Acta. 2019 Jun 13;1058:58-69. doi: 10.1016/j.aca.2019.01.022. Epub 2019 Jan 21.

DOI:10.1016/j.aca.2019.01.022
PMID:30851854
Abstract

When analyzing high-dimensional near-infrared (NIR) spectral datasets, variable selection is critical to improving models' predictive abilities. However, some methods have many limitations, such as a high risk of overfitting, time-intensiveness, or large computation demands, when dealing with a high number of variables. In this study, we propose a hybrid variable selection strategy based on the continuous shrinkage of variable space which is the core idea of variable combination population analysis (VCPA). The VCPA-based hybrid strategy continuously shrinks the variable space from big to small and optimizes it based on modified VCPA in the first step. It then employs iteratively retaining informative variables (IRIV) and a genetic algorithm (GA) to carry out further optimization in the second step. It takes full advantage of VCPA, GA, and IRIV, and makes up for their drawbacks in the face of high numbers of variables. Three NIR datasets and three variable selection methods including two widely-used methods (competitive adaptive reweighted sampling, CARS and genetic algorithm-interval partial least squares, GA-iPLS) and one hybrid method (variable importance in projection coupled with genetic algorithm, VIP-GA) were used to investigate the improvement of VCPA-based hybrid strategy. The results show that VCPA-GA and VCPA-IRIV significantly improve model's prediction performance when compared with other methods, indicating that the modified VCPA step is a very efficient way to filter the uninformative variables and VCPA-based hybrid strategy is a good and promising strategy for variable selection in NIR. The MATLAB source codes of VCPA-GA and VCPA-IRIV can be freely downloaded in the website: https://cn.mathworks.com/matlabcentral/profile/authors/5526470-yonghuan-yun.

摘要

在分析高维近红外(NIR)光谱数据集时,变量选择对于提高模型的预测能力至关重要。然而,当处理大量变量时,一些方法存在许多限制,例如过度拟合的风险高、时间密集或计算需求大。在本研究中,我们提出了一种基于连续收缩变量空间的混合变量选择策略,这是变量组合群体分析(VCPA)的核心思想。基于 VCPA 的混合策略从大到小连续收缩变量空间,并在第一步中基于改进的 VCPA 对其进行优化。然后,它在第二步中采用迭代保留信息变量(IRIV)和遗传算法(GA)进行进一步优化。它充分利用了 VCPA、GA 和 IRIV,并弥补了它们在面对大量变量时的缺点。我们使用三个 NIR 数据集和三种变量选择方法(包括两种广泛使用的方法[竞争自适应重加权采样,CARS 和遗传算法-区间偏最小二乘,GA-iPLS]和一种混合方法[变量重要性投影与遗传算法,VIP-GA])来研究 VCPA 基混合策略的改进。结果表明,与其他方法相比,VCPA-GA 和 VCPA-IRIV 显著提高了模型的预测性能,这表明改进的 VCPA 步骤是过滤无信息变量的非常有效的方法,VCPA 基混合策略是 NIR 中变量选择的一种很好且有前途的策略。VCPA-GA 和 VCPA-IRIV 的 MATLAB 源代码可在以下网站免费下载:https://cn.mathworks.com/matlabcentral/profile/authors/5526470-yonghuan-yun。

相似文献

1
A hybrid variable selection strategy based on continuous shrinkage of variable space in multivariate calibration.一种基于变量空间连续收缩的多元校正混合变量选择策略。
Anal Chim Acta. 2019 Jun 13;1058:58-69. doi: 10.1016/j.aca.2019.01.022. Epub 2019 Jan 21.
2
Three-step hybrid strategy towards efficiently selecting variables in multivariate calibration of near-infrared spectra.三步混合策略在近红外光谱多元校正中高效选择变量。
Spectrochim Acta A Mol Biomol Spectrosc. 2020 Jan 5;224:117376. doi: 10.1016/j.saa.2019.117376. Epub 2019 Jul 8.
3
Using variable combination population analysis for variable selection in multivariate calibration.在多元校准中使用可变组合总体分析进行变量选择。
Anal Chim Acta. 2015 Mar 3;862:14-23. doi: 10.1016/j.aca.2014.12.048. Epub 2014 Dec 30.
4
A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration.一种在多元校正中迭代保留信息变量以选择最优变量子集的策略。
Anal Chim Acta. 2014 Jan 7;807:36-43. doi: 10.1016/j.aca.2013.11.032. Epub 2013 Nov 21.
5
Feasibility of an NIR spectral calibration transfer algorithm based on optimized feature variables to predict tobacco samples in different states.基于优化特征变量的近红外光谱定标传递算法在预测不同状态下烟草样品中的可行性研究。
Anal Methods. 2023 Feb 9;15(6):719-728. doi: 10.1039/d2ay01805e.
6
A novel variable selection approach that iteratively optimizes variable space using weighted binary matrix sampling.一种新颖的变量选择方法,该方法使用加权二元矩阵采样迭代优化变量空间。
Analyst. 2014 Oct 7;139(19):4836-45. doi: 10.1039/c4an00730a.
7
Quantitative analysis of three ingredients in Salvia miltiorrhiza by near infrared spectroscopy combined with hybrid variable selection strategy.近红外光谱结合混合变量选择策略定量分析丹参中的三种成分。
Spectrochim Acta A Mol Biomol Spectrosc. 2024 Jul 5;315:124273. doi: 10.1016/j.saa.2024.124273. Epub 2024 Apr 9.
8
A bootstrapping soft shrinkage approach for variable selection in chemical modeling.一种用于化学建模中变量选择的自举软收缩方法。
Anal Chim Acta. 2016 Feb 18;908:63-74. doi: 10.1016/j.aca.2016.01.001. Epub 2016 Jan 7.
9
Rapid Determination of Geniposide and Baicalin in Lanqin Oral Solution by Near-Infrared Spectroscopy with Chemometric Algorithms during Alcohol Precipitation.近红外光谱结合化学计量学算法快速测定蓝芩口服液醇沉过程中栀子苷和黄芩苷的含量
Molecules. 2022 Dec 20;28(1):4. doi: 10.3390/molecules28010004.
10
A novel variable selection method based on combined moving window and intelligent optimization algorithm for variable selection in chemical modeling.一种基于组合移动窗口和智能优化算法的新型变量选择方法,用于化学建模中的变量选择。
Spectrochim Acta A Mol Biomol Spectrosc. 2021 Feb 5;246:118986. doi: 10.1016/j.saa.2020.118986. Epub 2020 Sep 25.

引用本文的文献

1
Simultaneous Quantification and Visualization of Photosynthetic Pigments in Mill. under Different Levels of Nitrogen Application with Visible-Near Infrared Hyperspectral Imaging Technology.利用可见-近红外高光谱成像技术对不同施氮水平下的磨盘草光合色素进行同步定量与可视化分析
Plants (Basel). 2023 Aug 16;12(16):2956. doi: 10.3390/plants12162956.
2
Potential of Near-Infrared Spectroscopy (NIRS) for Efficient Classification Based on Postharvest Storage Time, Cultivar and Maturity in Coconut Water.基于收获后天数、品种和成熟度的近红外光谱(NIRS)对椰子水进行高效分类的潜力
Foods. 2023 Jun 20;12(12):2415. doi: 10.3390/foods12122415.
3
Comparison of various chemometric methods on visible and near-infrared spectral analysis for wood density prediction among different tree species and geographical origins.
不同树种和地理来源木材密度预测的可见和近红外光谱分析中各种化学计量学方法的比较
Front Plant Sci. 2023 Mar 10;14:1121287. doi: 10.3389/fpls.2023.1121287. eCollection 2023.
4
Retrieval of Leaf Chlorophyll Contents (LCCs) in Litchi Based on Fractional Order Derivatives and VCPA-GA-ML Algorithms.基于分数阶导数和VCPA-GA-ML算法的荔枝叶片叶绿素含量反演
Plants (Basel). 2023 Jan 21;12(3):501. doi: 10.3390/plants12030501.
5
Rapid Determination of Geniposide and Baicalin in Lanqin Oral Solution by Near-Infrared Spectroscopy with Chemometric Algorithms during Alcohol Precipitation.近红外光谱结合化学计量学算法快速测定蓝芩口服液醇沉过程中栀子苷和黄芩苷的含量
Molecules. 2022 Dec 20;28(1):4. doi: 10.3390/molecules28010004.
6
Maturity Stage Discrimination of Fruit Using Visible and Near-Infrared Hyperspectral Imaging.基于可见及近红外高光谱成像的水果成熟度阶段判别。
Molecules. 2022 Sep 25;27(19):6318. doi: 10.3390/molecules27196318.
7
Rapid and Low-Cost Detection of Millet Quality by Miniature Near-Infrared Spectroscopy and Iteratively Retaining Informative Variables.基于微型近红外光谱和迭代保留信息变量法的谷子品质快速低成本检测
Foods. 2022 Jun 22;11(13):1841. doi: 10.3390/foods11131841.
8
Intelligent Identification and Features Attribution of Saline-Alkali-Tolerant Rice Varieties Based on Raman Spectroscopy.基于拉曼光谱的耐盐碱水稻品种智能识别与特征归因
Plants (Basel). 2022 Apr 29;11(9):1210. doi: 10.3390/plants11091210.
9
Imaging Sub-Cellular Methionine and Insulin Interplay in Triple Negative Breast Cancer Lipid Droplet Metabolism.成像三阴性乳腺癌脂滴代谢中的亚细胞蛋氨酸与胰岛素相互作用。
Front Oncol. 2022 Mar 10;12:858017. doi: 10.3389/fonc.2022.858017. eCollection 2022.
10
Nondestructive Testing and Visualization of Catechin Content in Black Tea Fermentation Using Hyperspectral Imaging.利用高光谱成像技术对红茶发酵过程中儿茶素含量进行无损检测和可视化。
Sensors (Basel). 2021 Dec 2;21(23):8051. doi: 10.3390/s21238051.