• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

水质数据的多向偏最小二乘建模

Multi-way partial least squares modeling of water quality data.

作者信息

Singh Kunwar P, Malik Amrita, Basant Nikita, Saxena Puneet

机构信息

Environmental Chemistry Division, Industrial Toxicology Research Centre, Post Box 80, MG Marg, Lucknow 226001, India.

出版信息

Anal Chim Acta. 2007 Feb 19;584(2):385-96. doi: 10.1016/j.aca.2006.11.038. Epub 2006 Nov 19.

DOI:10.1016/j.aca.2006.11.038
PMID:17386629
Abstract

A 10 years surface water quality data set pertaining to a polluted river was analyzed using partial least squares (PLS) regression models. Both the unfold-PLS and N-PLS (tri-PLS and quadri-PLS) models were calibrated through leave-one out cross-validation method. These were applied to the multivariate, multi-way data array with a view to assess and compare their predictive capabilities for biochemical oxygen demand (BOD) of river water in terms of their relative mean squares error of cross-validation, prediction and variance captured. The sum of squares of residuals and leverages were computed and analyzed to identify the sites, variables, years and months which may have influence on the constructed model. Both the tri- and quadri-PLS models yielded relatively low validation error as compared to unfold-PLS and captured high variance in model. Moreover, both of these methods produced acceptable model precision and accuracy. In case of tri-PLS the root mean squares errors were 1.65 and 2.17 for calibration and prediction, respectively; whereas these were 2.58 and 1.09 for quadri-PLS. At a preliminary level it seems that BOD can be predicted but a different data arrangement is needed. Moreover, analysis of the scores and loadings plots of the N-PLS models could provide information on time evolution of the river water quality.

摘要

使用偏最小二乘法(PLS)回归模型分析了一个与受污染河流相关的10年地表水水质数据集。展开式PLS模型和N-PLS模型(三向PLS模型和四向PLS模型)均通过留一法交叉验证方法进行校准。将这些模型应用于多元、多向数据阵列,旨在根据交叉验证、预测的相对均方误差以及所捕获的方差,评估和比较它们对河水生化需氧量(BOD)的预测能力。计算并分析了残差平方和与杠杆值,以识别可能对构建模型有影响的地点、变量、年份和月份。与展开式PLS模型相比,三向PLS模型和四向PLS模型产生的验证误差相对较低,且在模型中捕获了较高的方差。此外,这两种方法都产生了可接受的模型精度和准确性。对于三向PLS模型,校准和预测的均方根误差分别为1.65和2.17;而对于四向PLS模型,这些值分别为2.58和1.09。在初步阶段,似乎可以预测BOD,但需要不同的数据排列方式。此外,对N-PLS模型的得分图和载荷图进行分析,可以提供有关河流水质随时间演变的信息。

相似文献

1
Multi-way partial least squares modeling of water quality data.水质数据的多向偏最小二乘建模
Anal Chim Acta. 2007 Feb 19;584(2):385-96. doi: 10.1016/j.aca.2006.11.038. Epub 2006 Nov 19.
2
Effects of nonlinearities and uncorrelated or correlated errors in realistic simulated data on the prediction abilities of augmented classical least squares and partial least squares.现实模拟数据中的非线性以及不相关或相关误差对增强经典最小二乘法和偏最小二乘法预测能力的影响。
Appl Spectrosc. 2004 Sep;58(9):1065-73. doi: 10.1366/0003702041959334.
3
Support vector machines in water quality management.支持向量机在水质管理中的应用。
Anal Chim Acta. 2011 Oct 10;703(2):152-62. doi: 10.1016/j.aca.2011.07.027. Epub 2011 Jul 23.
4
Multivariate methods on the excitation emission matrix fluorescence spectroscopic data of diesel-kerosene mixtures: a comparative study.柴油-煤油混合物激发发射矩阵荧光光谱数据的多元分析方法:一项比较研究。
Anal Chim Acta. 2007 May 29;592(1):82-90. doi: 10.1016/j.aca.2007.03.079. Epub 2007 Apr 13.
5
Chemometrics assisted spectrophotometric determination of pyridine in water and wastewater.
Anal Chim Acta. 2008 Dec 7;630(1):10-8. doi: 10.1016/j.aca.2008.09.045. Epub 2008 Sep 27.
6
Multivariate statistical techniques for the evaluation of spatial and temporal variations in water quality of Gomti River (India)--a case study.用于评估印度贡蒂河水质时空变化的多元统计技术——案例研究
Water Res. 2004 Nov;38(18):3980-92. doi: 10.1016/j.watres.2004.06.011.
7
Ganga water quality at Patna with reference to physico-chemical and bacteriological parameters.
J Environ Sci Eng. 2007 Jan;49(1):28-32.
8
Modeling of temperature-induced near-infrared and low-field time-domain nuclear magnetic resonance spectral variation: chemometric prediction of limonene and water content in spray-dried delivery systems.温度诱导的近红外和低场时域核磁共振光谱变化建模:喷雾干燥给药系统中柠檬烯和水分含量的化学计量学预测
Appl Spectrosc. 2009 Feb;63(2):141-52. doi: 10.1366/000370209787392094.
9
Multivariate calibration of spectrophotometric data using a partial least squares with data fusion.使用偏最小二乘与数据融合对分光光度数据进行多元校正。
Spectrochim Acta A Mol Biomol Spectrosc. 2010 Aug;76(3-4):363-8. doi: 10.1016/j.saa.2010.03.024. Epub 2010 Mar 27.
10
A spatial-statistical approach for modeling the effect of non-point source pollution on different water quality parameters in the Velhas river watershed--Brazil.一种用于模拟巴西韦拉斯河流域非点源污染对不同水质参数影响的空间统计方法。
J Environ Manage. 2008 Jan;86(1):158-70. doi: 10.1016/j.jenvman.2006.12.009. Epub 2007 Feb 20.

引用本文的文献

1
Prediction of partition coefficient of some 3-hydroxy pyridine-4-one derivatives using combined partial least square regression and genetic algorithm.结合偏最小二乘回归和遗传算法预测某些3-羟基吡啶-4-酮衍生物的分配系数
Res Pharm Sci. 2014 Mar-Apr;9(2):143-53.
2
Establishment of a structural equation model for ground-level ozone: a case study at an urban roadside site.建立地面臭氧的结构方程模型:城市路边站点的案例研究。
Environ Monit Assess. 2014 Dec;186(12):8317-28. doi: 10.1007/s10661-014-4005-1. Epub 2014 Aug 22.