Suppr超能文献

运用多元统计技术评估河流水质变化:萨瓦河(克罗地亚):案例研究

Evaluation of river water quality variations using multivariate statistical techniques: Sava River (Croatia): a case study.

作者信息

Marinović Ruždjak Andrea, Ruždjak Domagoj

机构信息

Croatian Waters, Central Water Management Laboratory, Ulica grada Vukovara 220, 10000, Zagreb, Croatia,

出版信息

Environ Monit Assess. 2015 Apr;187(4):215. doi: 10.1007/s10661-015-4393-x. Epub 2015 Mar 29.

Abstract

For the evaluation of seasonal and spatial variations and the interpretation of a large and complex water quality dataset obtained during a 7-year monitoring program of the Sava River in Croatia, different multivariate statistical techniques were applied in this study. Basic statistical properties and correlations of 18 water quality parameters (variables) measured at 18 sampling sites (a total of 56,952 values) were examined. Correlations between air temperature and some water quality parameters were found in agreement with the previous studies of relationship between climatic and hydrological parameters. Principal component analysis (PCA) was used to explore the most important factors determining the spatiotemporal dynamics of the Sava River. PCA has determined a reduced number of seven principal components that explain over 75 % of the data set variance. The results revealed that parameters related to temperature and organic pollutants (CODMn and TSS) were the most important parameters contributing to water quality variation. PCA analysis of seasonal subsets confirmed this result and showed that the importance of parameters is changing from season to season. PCA of the four seasonal data subsets yielded six PCs with eigenvalues greater than one explaining 73.6 % (spring), 71.4 % (summer), 70.3 % (autumn), and 71.3 % (winter) of the total variance. To check the influence of the outliers in the data set whose distribution strongly deviates from the normal one, in addition to standard principal component analysis algorithm, two robust estimates of covariance matrix were calculated and subjected to PCA. PCA in both cases yielded seven principal components explaining 75 % of the total variance, and the results do not differ significantly from the results obtained by the standard PCA algorithm. With the implementation of robust PCA algorithm, it is demonstrated that the usage of standard algorithm is justified for data sets with small numbers of missing data, nondetects, and outliers (less than 4 %). The clustering procedure highlighted four different groups in which the sampling sites have similar characteristics and pollution levels. The first and the second group correspond to relatively low and moderately polluted sites while stations which are located in the middle of the river belong to the third and fourth group and correspond to highly and moderately polluted sites.

摘要

为评估克罗地亚萨瓦河7年监测计划期间获得的大规模复杂水质数据集的季节和空间变化并进行解读,本研究应用了不同的多元统计技术。研究了在18个采样点测量的18个水质参数(变量)的基本统计特性及相关性(总共56952个值)。发现气温与一些水质参数之间的相关性与先前关于气候和水文参数关系的研究结果一致。主成分分析(PCA)用于探究决定萨瓦河时空动态的最重要因素。PCA确定了七个主成分,这些主成分解释了超过75%的数据集方差。结果表明,与温度和有机污染物(化学需氧量和总悬浮固体)相关的参数是导致水质变化的最重要参数。对季节子集进行的PCA分析证实了这一结果,并表明参数的重要性随季节而变化。对四个季节数据子集进行的PCA产生了六个特征值大于1的主成分,分别解释了总方差的73.6%(春季)、71.4%(夏季)、70.3%(秋季)和71.3%(冬季)。为检验数据集中分布严重偏离正态分布的异常值的影响,除了标准主成分分析算法外,还计算了协方差矩阵的两种稳健估计值并进行PCA。两种情况下的PCA均产生了七个主成分,解释了总方差的75%,结果与标准PCA算法得到的结果无显著差异。通过实施稳健PCA算法,证明对于缺失数据、未检出值和异常值数量较少(小于4%)的数据集,使用标准算法是合理的。聚类程序突出显示了四个不同的组,其中采样点具有相似的特征和污染水平。第一组和第二组对应污染程度相对较低和中等的站点,而位于河流中部的站点属于第三组和第四组,对应污染程度高和中等的站点。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验