University of Cambridge, Department of Zoology, Cambridge, CB2 3EJ, UK.
Spatial-Ecology, Meaderville House, Wheal Buller, Redruth, TR16 6ST, UK.
Sci Data. 2020 May 28;7(1):161. doi: 10.1038/s41597-020-0478-7.
Nitrogen (N) and Phosphorus (P) are essential nutritional elements for life processes in water bodies. However, in excessive quantities, they may represent a significant source of aquatic pollution. Eutrophication has become a widespread issue rising from a chemical nutrient imbalance and is largely attributed to anthropogenic activities. In view of this phenomenon, we present a new geo-dataset to estimate and map the concentrations of N and P in their various chemical forms at a spatial resolution of 30 arc-second (∼1 km) for the conterminous US. The models were built using Random Forest (RF), a machine learning algorithm that regressed the seasonally measured N and P concentrations collected at 62,495 stations across the US streams for the period of 1994-2018 onto a set of 47 in-house built environmental variables that are available at a near-global extent. The seasonal models were validated through internal and external validation procedures and the predictive powers measured by Pearson Coefficients reached approximately 0.66 on average.
氮(N)和磷(P)是水体生命过程中必需的营养元素。然而,过量的氮和磷可能是造成水环境污染的一个重要来源。富营养化是一种化学养分失衡的普遍现象,主要归因于人类活动。有鉴于此,我们提出了一个新的地理数据集,用以估计和绘制美国本土各种化学形态的 N 和 P 浓度,空间分辨率为 30 弧秒(约 1 公里)。模型是使用随机森林(RF)算法构建的,该算法将 1994 年至 2018 年期间在美国溪流中 62495 个站点季节性测量的 N 和 P 浓度与一套 47 个内部构建的可在近全球范围内获取的环境变量进行回归。通过内部和外部验证程序对季节性模型进行了验证,平均而言,皮尔逊系数的预测能力约为 0.66。