Estonian Marine Institute, Academy of Sciences of Estonia, Paldiski St. 1, EE-0031, Tallinn, Estonia.
Environ Monit Assess. 1996 Dec;43(3):283-308. doi: 10.1007/BF00394455.
In spite of the large number of monitoring data on hydrography and nutrients collected from the Baltic Sea, it is still difficult to describe large-scale distribution patterns of these variables. We therefore suggest a stochastic approach that allows the spatial reconstruction of the fields for the entire sea. The Baltic Sea monitoring data on temperature, salinity and nutrient concentrations from the years 1972-1991 are each divided into twenty data sets: five regions, times four seasons. The spatial regions are the Southern Baltic Proper, the Northern and Central Baltic Proper, both above and below halocline, and the region of the Gulf of Finland and the Gulf of Riga. The four seasons consist of three-month periods: January-March (winter), April-June (spring), July-September (summer) and October-December (fall). Each monthly subset of a regional and seasonal data set is modeled as a sample out of a monthly realization of a random field. The data sets are decomposed into mean and fluctuational components. The mean is determined as an average over the space cells with dimensions of standard sampling depth intervals vertically, 10' in meridional (south-north), 20' in zonal (west-east) directions and over five-year periods in time. The fluctuation fields are considered second-order stationary, homogeneous and horizontally isotropic. Estimated horizontal (surface) and vertical (depth) components of the spatial correlations are approximated by Gaussian functions. The correlation scales for the fields of the Baltic Proper are mostly larger than 100 nautical miles horizontally and 40 m vertically and their dependence on the sea region or season is relatively weak. The most probable noise-to-signal ratio values of the data lie in the interval 0.6 to 1.2. The estimated correlation functions and noise-to-signal ratios allow the optimum analysis technique to assess the correctness of each datum of a sample on the background of the field statistics. The outliers of each monthly sample are excluded from the analysis. The observed fluctuations are interpolated into locations with missing data by an optimum interpolation procedure. The discrete cell-and-five-year mean values are interpolated by a different, piece-wise linear technique. Since the data number for the mean interpolation considerably exceeds the data number for the fluctuation interpolation, the interpolation errors for the mean are assumed negligible compared to the interpolation errors for the fluctuation. The sums of the mean and fluctuation, interpolated into the withheld observation points, are compared to the actually observed values and to some other linear interpolation estimates. In all test cases the optimum interpolation procedure performs the best.
尽管波罗的海已经积累了大量的水文和营养监测数据,但仍难以描述这些变量的大范围分布模式。因此,我们建议采用一种随机方法,可以对整个海域的场进行空间重建。我们将 1972 年至 1991 年间波罗的海的温度、盐度和营养浓度监测数据分别分为 20 个数据集:5 个区域,4 个季节。空间区域是波罗的海南部、北部和中部、上下盐层以及芬兰湾和里加湾地区。四个季节包括三个月的时间段:1 月至 3 月(冬季)、4 月至 6 月(春季)、7 月至 9 月(夏季)和 10 月至 12 月(秋季)。每个区域和季节数据集中的每月子集都被建模为随机场每月实现的一个样本。数据集被分解为均值和波动分量。均值是通过对标准采样深度间隔的空间单元格进行平均来确定的,垂直方向为 10',经向(南北)为 20',时间为 5 年。波动场被认为是二阶平稳的、均匀的和水平各向同性的。估计的水平(表面)和垂直(深度)空间相关分量由高斯函数近似。波罗的海各海域的相关尺度大多大于 100 海里水平和 40 米垂直,其对海域或季节的依赖性相对较弱。数据的最可能噪声与信号比值大多在 0.6 到 1.2 之间。估计的相关函数和噪声与信号比允许最佳分析技术来评估样本中每个数据点在字段统计数据背景下的正确性。从分析中排除每个月样本的异常值。通过最优插值程序将观测到的波动插值到缺失数据的位置。通过不同的分段线性技术对离散的单元格和五年均值进行插值。由于均值插值的数据数量大大超过波动插值的数据数量,因此与波动插值相比,均值插值的插值误差可以忽略不计。插入保留观测点的均值和波动之和与实际观测值和其他一些线性插值估计值进行比较。在所有测试案例中,最优插值程序的表现都最好。