Mohamed Ibrahim, Othman Faridah, Ibrahim Adriana I N, Alaa-Eldin M E, Yunus Rossita M
Institute of Mathematical Sciences, Faculty of Science, University of Malaya, 50603, Kuala Lumpur, Malaysia,
Environ Monit Assess. 2015 Jan;187(1):4182. doi: 10.1007/s10661-014-4182-y. Epub 2014 Nov 30.
This case study uses several univariate and multivariate statistical techniques to evaluate and interpret a water quality data set obtained from the Klang River basin located within the state of Selangor and the Federal Territory of Kuala Lumpur, Malaysia. The river drains an area of 1,288 km(2), from the steep mountain rainforests of the main Central Range along Peninsular Malaysia to the river mouth in Port Klang, into the Straits of Malacca. Water quality was monitored at 20 stations, nine of which are situated along the main river and 11 along six tributaries. Data was collected from 1997 to 2007 for seven parameters used to evaluate the status of the water quality, namely dissolved oxygen, biochemical oxygen demand, chemical oxygen demand, suspended solids, ammoniacal nitrogen, pH, and temperature. The data were first investigated using descriptive statistical tools, followed by two practical multivariate analyses that reduced the data dimensions for better interpretation. The analyses employed were factor analysis and principal component analysis, which explain 60 and 81.6% of the total variation in the data, respectively. We found that the resulting latent variables from the factor analysis are interpretable and beneficial for describing the water quality in the Klang River. This study presents the usefulness of several statistical methods in evaluating and interpreting water quality data for the purpose of monitoring the effectiveness of water resource management. The results should provide more straightforward data interpretation as well as valuable insight for managers to conceive optimum action plans for controlling pollution in river water.
本案例研究运用了多种单变量和多变量统计技术,对从马来西亚雪兰莪州和吉隆坡联邦直辖区的巴生河流域获取的水质数据集进行评估和解读。该河流域面积为1288平方公里,从马来西亚半岛中部山脉陡峭的山地雨林一直延伸至巴生港的河口,注入马六甲海峡。在20个站点对水质进行了监测,其中9个位于主河道沿线,11个位于6条支流沿线。收集了1997年至2007年期间用于评估水质状况的7个参数的数据,即溶解氧、生化需氧量、化学需氧量、悬浮固体、氨氮、pH值和温度。首先使用描述性统计工具对数据进行研究,随后进行了两种实用的多变量分析,以降低数据维度以便更好地解读。所采用的分析方法是因子分析和主成分分析,它们分别解释了数据中总变异的60%和81.6%。我们发现,因子分析得出的潜在变量是可解释的,且有助于描述巴生河的水质。本研究展示了几种统计方法在评估和解读水质数据以监测水资源管理有效性方面的实用性。研究结果应为管理者构思控制河水污染的最佳行动计划提供更直观的数据解读以及有价值的见解。