Biologie des Ecosystèmes Aquatiques, CEMAGREF, Lyon, France.
Environ Manage. 2010 May;45(5):939-52. doi: 10.1007/s00267-010-9465-7. Epub 2010 Mar 19.
Ecological regionalizations define geographic regions exhibiting relative homogeneity in ecological (i.e., environmental and biotic) characteristics. Multivariate clustering methods have been used to define ecological regions based on subjectively chosen environmental variables. We developed and tested three procedures for defining ecological regions based on spatial modeling of a multivariate target pattern that is represented by compositional dissimilarities between locations (e.g., taxonomic dissimilarities). The procedures use a "training dataset" representing the target pattern and models this as a function of environmental variables. The model is then extrapolated to the entire domain of interest. Environmental data for our analysis were drawn from a 400 m grid covering all of Switzerland and consisted of 12 variables describing climate, topography and lithology. Our target patterns comprised land cover composition of each grid cell that was derived from interpretation of aerial photographs. For Regionalization 1 we used conventional cluster analysis of the environmental variables to define 60 hierarchically organized levels comprising from 5 to 300 regions. Regionalization 1 provided a base-case for comparison with the model-based regionalizations. Regionalization 2, 3 and 4 also comprised 60 hierarchically organized levels and were derived by modeling land cover composition for 4000 randomly selected "training" cells. Regionalization 2 was based on cluster analysis of environmental variables that were transformed based on a Generalized Dissimilarity Model (GDM). Regionalization 3 and 4 were defined by clustering the training cells based on their land cover composition followed by predictive modeling of the distribution of the land cover clusters using Classification and Regression Tree (CART) and Random Forest (RF) models. Independent test data (i.e. not used to train the models) were used to test the discrimination of land cover composition at all hierarchical levels of the regionalizations using the classification strength (CS) statistic. CS for all the model-based regionalizations was significantly higher than for Regionalization 1. Regionalization 3 and 4 performed significantly better than Regionalization 2 at finer hierarchical levels (many regions) and Regionalization 4 performed significantly better than Regionalization 3 for coarse levels of detail (few regions). Compositional modeling can significantly increase the performance of numerically defined ecological regionalizations. CART and RF-based models appear to produce stronger regionalizations because discriminating variables are able to change at each hierarchic level.
生态区域化定义了在生态(即环境和生物)特征方面具有相对同质性的地理区域。多元聚类方法已被用于根据主观选择的环境变量来定义生态区域。我们开发并测试了三种基于空间建模的方法来定义生态区域,该方法基于位置之间的多元目标模式(例如分类差异)的组成差异表示。这些程序使用代表目标模式的“训练数据集”并将其建模为环境变量的函数。然后将模型外推到整个感兴趣的区域。我们的分析使用了来自瑞士 400 米网格的环境数据,其中包含 12 个描述气候、地形和岩性的变量。我们的目标模式由每个网格单元的土地覆盖组成组成,这些组成是通过对航空照片的解释得出的。对于区域化 1,我们使用环境变量的常规聚类分析来定义 60 个层次组织的级别,这些级别由 5 到 300 个区域组成。区域化 1 提供了与基于模型的区域化进行比较的基础案例。区域化 2、3 和 4 也包含 60 个层次组织的级别,并且是通过对 4000 个随机选择的“训练”细胞的土地覆盖组成进行建模而得出的。区域化 2 是基于基于广义差异模型(GDM)转换的环境变量聚类分析得出的。区域化 3 和 4 是通过基于其土地覆盖组成对训练单元进行聚类,然后使用分类和回归树(CART)和随机森林(RF)模型对土地覆盖聚类的分布进行预测建模来定义的。使用分类强度(CS)统计量,使用独立的测试数据(即未用于训练模型的数据)测试了区域化所有层次结构中土地覆盖组成的区分能力。所有基于模型的区域化的 CS 明显高于区域化 1。在更精细的层次结构(许多区域)中,区域化 3 和 4 的性能明显优于区域化 2,而在较粗的细节级别(少数区域)中,区域化 4 的性能明显优于区域化 3。组成建模可以显著提高数字定义的生态区域化的性能。基于 CART 和 RF 的模型似乎产生了更强的区域化,因为区分变量可以在每个层次级别上发生变化。