Ecol Appl. 2014 Jun;24(4):699-715. doi: 10.1890/13-0600.1.
Efforts to test and improve terrestrial biosphere models (TBMs) using a variety of data sources have become increasingly common. Yet, geographically extensive forest inventories have been under-exploited in previous model-data fusion efforts. Inventory observations of forest growth, mortality, and biomass integrate processes across a range of timescales, including slow timescale processes such as species turnover, that are likely to have important effects on ecosystem responses to environmental variation. However, the large number (thousands) of inventory plots precludes detailed measurements at each location, so that uncertainty in climate, soil properties, and other environmental drivers may be large. Errors in driver variables, if ignored, introduce bias into model-data fusion. We estimated errors in climate and soil drivers at U.S. Forest Inventory and Analysis (FIA) plots, and we explored the effects of these errors on model-data fusion with the Geophysical Fluid Dynamics Laboratory LM3V dynamic global vegetation model. When driver errors were ignored or assumed small at FIA plots, responses of biomass production in LM3V to precipitation and soil available water capacity appeared steeper than the corresponding responses estimated from FIA data. These differences became nonsignificant if driver errors at FIA plots were assumed to be large. Ignoring driver errors when optimizing LM3V parameter values yielded estimates for fine-root allocation that were larger than biometric estimates, which is consistent with the expected direction of bias. To explore whether complications posed by driver errors could be circumvented by relying on intensive study sites where driver errors are small, we performed a power analysis. To accurately quantify the response of biomass production to spatial variation in mean annual precipitation within the eastern United States would require at least 40 intensive study sites, which is larger than the number of sites typically available for individual biomes in existing plot networks. Driver errors may be accommodated by several existing model-data fusion approaches, including hierarchical Bayesian methods and ensemble filtering methods; however, these methods are computationally expensive. We propose a new approach, in which the TBM functional response is fit directly to the driver-error-corrected functional response estimated from data, rather than to the raw observations.
利用各种数据源来测试和改进陆地生物圈模型(TBM)的努力已变得越来越普遍。然而,在以前的模型-数据融合工作中,对具有广泛地理范围的森林清查的利用不足。森林生长、死亡率和生物量的清查观测综合了各种时间尺度的过程,包括物种更替等缓慢时间尺度的过程,这些过程可能对生态系统对环境变化的响应产生重要影响。然而,清查的地点数量众多(数千个),使得无法在每个地点进行详细测量,因此气候、土壤特性和其他环境驱动因素的不确定性可能很大。如果忽略驱动变量中的误差,则会在模型-数据融合中引入偏差。我们估计了美国森林清查和分析(FIA)清查点的气候和土壤驱动因素误差,并使用地球物理流体动力学实验室 LM3V 动态全球植被模型来探讨这些误差对模型-数据融合的影响。当在 FIA 清查点忽略或假设驱动因素误差较小时,LM3V 对降水和土壤有效水分容量的生物量生产响应似乎比从 FIA 数据估计的响应更为陡峭。如果假设 FIA 清查点的驱动因素误差较大,则这些差异变得不显著。在优化 LM3V 参数值时忽略驱动因素误差会导致细根分配的估计值大于生物计量学的估计值,这与预期的偏差方向一致。为了探讨是否可以通过依赖驱动因素误差较小的密集研究点来避免驱动因素误差带来的复杂性,我们进行了功率分析。要准确量化生物量生产对美国东部地区年均降水量空间变化的响应,至少需要 40 个密集研究点,这比现有清查网络中单个生物群落通常可用的点的数量要大。几种现有的模型-数据融合方法,包括层次贝叶斯方法和集合滤波方法,可以适应驱动因素误差;然而,这些方法计算成本很高。我们提出了一种新方法,其中 TBM 功能响应直接拟合到从数据中估计的驱动因素误差校正的功能响应,而不是原始观测值。