• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于近红外光谱的偏最小二乘回归中变量选择算法用于预测土壤中特定金属的比较

Comparison between Variable-Selection Algorithms in PLS Regression with Near-Infrared Spectroscopy to Predict Selected Metals in Soil.

作者信息

Abrantes Giovanna, Almeida Valber, Maia Angelo Jamil, Nascimento Rennan, Nascimento Clistenes, Silva Ygor, Silva Yuri, Veras Germano

机构信息

Departamento de Química, Centro de Ciência e Tecnologia, Universidade Estadual da Paraíba, Campina Grande 58429-500, Brazil.

Agronomy Department, Federal Rural University of Pernambuco, Recife 52171-900, Brazil.

出版信息

Molecules. 2023 Oct 6;28(19):6959. doi: 10.3390/molecules28196959.

DOI:10.3390/molecules28196959
PMID:37836802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10574190/
Abstract

Soil is one of the Earth's most important natural resources. The presence of metals can decrease environmental quality if present in excessive amounts. Analyzing soil metal contents can be costly and time consuming, but near-infrared (NIR) spectroscopy coupled with chemometric tools can offer an alternative. The most important multivariate calibration method to predict concentrations or physical, chemical or physicochemical properties as a chemometric tool is partial least-squares (PLS) regression. However, a large number of irrelevant variables may cause problems of accuracy in the predictive chemometric models. Thus, stochastic variable-selection techniques, such as the Firefly algorithm by intervals in PLS (FFiPLS), can provide better solutions for specific problems. This study aimed to evaluate the performance of FFiPLS against deterministic PLS algorithms for the prediction of metals in river basin soils. The samples had their spectra collected from the region of 1000-2500 nm. Predictive models were then built from the spectral data, including PLS, interval-PLS (iPLS), successive projections algorithm for interval selection in PLS (iSPA-PLS), and FFiPLS. The chemometric models were built with raw data and preprocessed data by using different methods such as multiplicative scatter correction (MSC), standard normal variate (SNV), mean centering, adjustment of baseline and smoothing by the Savitzky-Golay method. The elliptical joint confidence region (EJCR) used in each chemometric model presented adequate fit. FFiPLS models of iron and titanium obtained a relative prediction deviation (RPD) of more than 2. The chemometric models for determination of aluminum obtained an RPD of more than 2 in the preprocessed data with SNV, MSC and baseline (offset + linear) and with raw data. The metals Be, Gd and Y failed to obtain adequate models in terms of residual prediction deviation (RPD). These results are associated with the low values of metals in the samples. Considering the complexity of the samples, the relative error of prediction (REP) obtained between 10 and 25% of the values adequate for this type of sample. Root mean square error of calibration and prediction (RMSEC and RMSEP, respectively) presented the same profile as the other quality parameters. The FFiPLS algorithm outperformed deterministic algorithms in the construction of models estimating the content of Al, Be, Gd and Y. This study produced chemometric models with variable selection able to determine metals in the Ipojuca River watershed soils using reflectance-mode NIR spectrometry.

摘要

土壤是地球上最重要的自然资源之一。如果金属含量过高,会降低环境质量。分析土壤金属含量成本高且耗时,但近红外(NIR)光谱结合化学计量学工具可提供一种替代方法。作为一种化学计量学工具,预测浓度或物理、化学或物理化学性质的最重要的多元校准方法是偏最小二乘(PLS)回归。然而,大量不相关变量可能会导致预测化学计量模型的准确性问题。因此,随机变量选择技术,如PLS区间萤火虫算法(FFiPLS),可以为特定问题提供更好的解决方案。本研究旨在评估FFiPLS相对于确定性PLS算法在预测流域土壤中金属含量方面的性能。从1000 - 2500nm区域收集样品的光谱。然后根据光谱数据建立预测模型,包括PLS、区间PLS(iPLS)、PLS区间选择的连续投影算法(iSPA - PLS)和FFiPLS。通过使用不同方法,如多元散射校正(MSC)、标准正态变量变换(SNV)、均值中心化、基线校正和Savitzky - Golay方法平滑,用原始数据和预处理数据建立化学计量模型。每个化学计量模型中使用的椭圆联合置信区域(EJCR)拟合良好。铁和钛的FFiPLS模型获得了大于2的相对预测偏差(RPD)。在使用SNV、MSC和基线(偏移 + 线性)的预处理数据以及原始数据中,测定铝的化学计量模型获得了大于2的RPD。金属铍、钆和钇在残差预测偏差(RPD)方面未能获得合适的模型。这些结果与样品中金属的低值有关。考虑到样品的复杂性,预测相对误差(REP)在适合此类样品的值的10%至25%之间。校准和预测的均方根误差(分别为RMSEC和RMSEP)与其他质量参数呈现相同的分布。在构建估计铝、铍、钆和钇含量的模型时,FFiPLS算法优于确定性算法。本研究利用反射模式近红外光谱法建立了具有变量选择功能的化学计量模型,能够测定伊波茹卡河流域土壤中的金属。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/1b05c30b1d8f/molecules-28-06959-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/5fded5f526e4/molecules-28-06959-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/1895d76411e7/molecules-28-06959-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/665c27318fec/molecules-28-06959-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/e6f81170bc7f/molecules-28-06959-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/7475be49d056/molecules-28-06959-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/1b05c30b1d8f/molecules-28-06959-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/5fded5f526e4/molecules-28-06959-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/1895d76411e7/molecules-28-06959-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/665c27318fec/molecules-28-06959-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/e6f81170bc7f/molecules-28-06959-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/7475be49d056/molecules-28-06959-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f937/10574190/1b05c30b1d8f/molecules-28-06959-g006.jpg

相似文献

1
Comparison between Variable-Selection Algorithms in PLS Regression with Near-Infrared Spectroscopy to Predict Selected Metals in Soil.基于近红外光谱的偏最小二乘回归中变量选择算法用于预测土壤中特定金属的比较
Molecules. 2023 Oct 6;28(19):6959. doi: 10.3390/molecules28196959.
2
Near-infrared spectroscopy for the prediction of rare earth elements in soils from the largest uranium-phosphate deposit in Brazil using PLS, iPLS, and iSPA-PLS models.基于偏最小二乘法(PLS)、交互部分最小二乘法(iPLS)和交互样本偏最小二乘法(iSPA-PLS)模型,利用近红外光谱法预测巴西最大磷铀矿床土壤中的稀土元素。
Environ Monit Assess. 2020 Oct 6;192(11):675. doi: 10.1007/s10661-020-08642-2.
3
Determination of fat content in chicken hamburgers using NIR spectroscopy and the Successive Projections Algorithm for interval selection in PLS regression (iSPA-PLS).应用近红外光谱法和偏最小二乘法中连续投影算法(iSPA-PLS)对间隔选择,测定鸡肉汉堡中的脂肪含量。
Spectrochim Acta A Mol Biomol Spectrosc. 2018 Jan 15;189:300-306. doi: 10.1016/j.saa.2017.08.046. Epub 2017 Aug 17.
4
A green method for the authentication of sugarcane spirit and prediction of density and alcohol content based on near infrared spectroscopy and chemometric tools.基于近红外光谱和化学计量学工具的一种鉴别甘蔗酒和预测其密度及酒精度的绿色方法。
Food Res Int. 2023 Aug;170:112830. doi: 10.1016/j.foodres.2023.112830. Epub 2023 Apr 27.
5
Evaluation of matcha tea quality index using portable NIR spectroscopy coupled with chemometric algorithms.利用便携式近红外光谱结合化学计量算法评价抹茶茶品质指标。
J Sci Food Agric. 2019 Aug 30;99(11):5019-5027. doi: 10.1002/jsfa.9743. Epub 2019 May 17.
6
Nondestructive Detection of Moisture Content in Palm Oil by Using Portable Vibrational Spectroscopy and Optimal Prediction Algorithms.利用便携式振动光谱和最优预测算法对棕榈油水分含量进行无损检测
J Anal Methods Chem. 2023 Jan 31;2023:3364720. doi: 10.1155/2023/3364720. eCollection 2023.
7
[Measurement of Soil Total Nitrogen Using Near Infrared Spectroscopy Combined with RCA and SPA].[基于随机蛙跳算法和连续投影算法的近红外光谱法测定土壤全氮含量]
Guang Pu Xue Yu Guang Pu Fen Xi. 2015 May;35(5):1248-52.
8
Near infrared system coupled chemometric algorithms for enumeration of total fungi count in cocoa beans neat solution.近红外系统结合化学计量学算法用于可可豆纯溶液中总真菌数的计数。
Food Chem. 2018 Feb 1;240:231-238. doi: 10.1016/j.foodchem.2017.07.117. Epub 2017 Jul 25.
9
Chemometrics-assisted simultaneous voltammetric determination of ascorbic acid, uric acid, dopamine and nitrite: application of non-bilinear voltammetric data for exploiting first-order advantage.化学计量学辅助同时伏安法测定抗坏血酸、尿酸、多巴胺和亚硝酸盐:利用非双线性伏安数据发挥一阶优势的应用
Talanta. 2014 Feb;119:553-63. doi: 10.1016/j.talanta.2013.11.028. Epub 2013 Nov 27.
10
Chemometric pre-processing can negatively affect the performance of near-infrared spectroscopy models for fruit quality prediction.化学计量学预处理会对近红外光谱模型预测水果品质的性能产生负面影响。
Talanta. 2021 Jul 1;229:122303. doi: 10.1016/j.talanta.2021.122303. Epub 2021 Mar 11.

引用本文的文献

1
Application of Fourier Transform Near-Infrared Spectroscopy and Chemometrics for Quantitative Analysis of Milk of Lime (MOL) Used in the Sugar Industry.傅里叶变换近红外光谱法和化学计量学在制糖工业中使用的石灰乳定量分析中的应用
Molecules. 2025 May 24;30(11):2308. doi: 10.3390/molecules30112308.
2
Biomarker discovery in progressive supranuclear palsy from human cerebrospinal fluid.从人脑脊液中发现进行性核上性麻痹的生物标志物。
Clin Proteomics. 2024 Sep 28;21(1):56. doi: 10.1186/s12014-024-09507-3.

本文引用的文献

1
Exploring relationship of soil PTE geochemical and "VIS-NIR spectroscopy" patterns near Cu-Mo mine (Armenia).探讨铜矿钼矿附近土壤 PTE 地球化学与“可见-近红外光谱”模式的关系(亚美尼亚)。
Environ Pollut. 2023 Apr 15;323:121180. doi: 10.1016/j.envpol.2023.121180. Epub 2023 Jan 31.
2
Beryllium contamination and its risk management in terrestrial and aquatic environmental settings.铍的污染及其在陆地和水生环境中的风险管理。
Environ Pollut. 2023 Mar 1;320:121077. doi: 10.1016/j.envpol.2023.121077. Epub 2023 Jan 13.
3
Monitoring the concentrations of Cd, Cu, Pb, Ni, Cr, Zn, Mn and Fe in cultivated Haplic Luvisol soils using near-infrared reflectance spectroscopy and chemometrics.
利用近红外反射光谱和化学计量学监测耕作层淡黑钙土中 Cd、Cu、Pb、Ni、Cr、Zn、Mn 和 Fe 的浓度。
Talanta. 2023 Jan 1;251:123749. doi: 10.1016/j.talanta.2022.123749. Epub 2022 Jul 21.
4
Mechanisms and influencing factors of yttrium sorption on paddy soil: Experiments and modeling.钇在稻田土壤中吸附的机理及影响因素:实验与模拟。
Chemosphere. 2022 Nov;307(Pt 1):135688. doi: 10.1016/j.chemosphere.2022.135688. Epub 2022 Jul 14.
5
Phytomining of rare earth elements - A review.稀土元素的植物采矿——综述
Chemosphere. 2022 Jun;297:134259. doi: 10.1016/j.chemosphere.2022.134259. Epub 2022 Mar 8.
6
Redistribution and chemical speciation of rare earth elements in an ion-adsorption rare earth tailing, Southern China.中国南方离子吸附型稀土尾矿中稀土元素的赋存状态和分配规律
Sci Total Environ. 2022 May 15;821:153369. doi: 10.1016/j.scitotenv.2022.153369. Epub 2022 Jan 22.
7
Variable selection in the chemometric treatment of food data: A tutorial review.化学计量学处理食品数据中的变量选择:教程综述。
Food Chem. 2022 Feb 15;370:131072. doi: 10.1016/j.foodchem.2021.131072. Epub 2021 Sep 7.
8
Benefits and Detriments of Gadolinium from Medical Advances to Health and Ecological Risks.从医学进步到健康和生态风险看镧系元素的利弊。
Molecules. 2020 Dec 7;25(23):5762. doi: 10.3390/molecules25235762.
9
Modelling potentially toxic elements in forest soils with vis-NIR spectra and learning algorithms.应用可见-近红外光谱和学习算法对森林土壤中潜在有毒元素进行建模。
Environ Pollut. 2020 Dec;267:115574. doi: 10.1016/j.envpol.2020.115574. Epub 2020 Sep 6.
10
Variable selection - A review and recommendations for the practicing statistician.变量选择——给执业统计学家的一篇综述与建议
Biom J. 2018 May;60(3):431-449. doi: 10.1002/bimj.201700067. Epub 2018 Jan 2.