Institute of Biological and Environmental Sciences, School of Biological Science, University of Aberdeen, 23 Street Machar Drive, Aberdeen AB24 3UU, U.K.
Tasmanian Institute of Agriculture, University of Tasmania, Newnham Drive, Launceston, Tasmania 7248, Australia.
Environ Sci Technol. 2022 Sep 20;56(18):13485-13498. doi: 10.1021/acs.est.2c02023. Epub 2022 Sep 2.
There is a growing realization that the complexity of model ensemble studies depends not only on the models used but also on the experience and approach used by modelers to calibrate and validate results, which remain a source of uncertainty. Here, we applied a multi-criteria decision-making method to investigate the rationale applied by modelers in a model ensemble study where 12 process-based different biogeochemical model types were compared across five successive calibration stages. The modelers shared a common level of agreement about the importance of the variables used to initialize their models for calibration. However, we found inconsistency among modelers when judging the importance of input variables across different calibration stages. The level of subjective weighting attributed by modelers to calibration data decreased sequentially as the extent and number of variables provided increased. In this context, the perceived importance attributed to variables such as the fertilization rate, irrigation regime, soil texture, pH, and initial levels of soil organic carbon and nitrogen stocks was statistically different when classified according to model types. The importance attributed to input variables such as experimental duration, gross primary production, and net ecosystem exchange varied significantly according to the length of the modeler's experience. We argue that the gradual access to input data across the five calibration stages negatively influenced the consistency of the interpretations made by the modelers, with cognitive bias in "trial-and-error" calibration routines. Our study highlights that overlooking human and social attributes is critical in the outcomes of modeling and model intercomparison studies. While complexity of the processes captured in the model algorithms and parameterization is important, we contend that (1) the modeler's assumptions on the extent to which parameters should be altered and (2) modeler perceptions of the importance of model parameters are just as critical in obtaining a quality model calibration as numerical or analytical details.
人们越来越认识到,模型集成研究的复杂性不仅取决于所使用的模型,还取决于建模者用于校准和验证结果的经验和方法,而这些方法仍然是不确定性的来源。在这里,我们应用了一种多准则决策方法来研究模型集成研究中建模者应用的基本原理,在该研究中,比较了 12 种基于过程的不同生物地球化学模型类型,跨越了五个连续的校准阶段。建模者对用于初始化模型以进行校准的变量的重要性有共同的认识。然而,我们发现建模者在判断不同校准阶段输入变量的重要性时存在不一致性。建模者对校准数据赋予的主观权重随着提供的变量的范围和数量的增加而依次降低。在这种情况下,根据模型类型对变量进行分类时,施肥率、灌溉制度、土壤质地、pH 值以及土壤有机碳和氮储量的初始水平等变量的感知重要性存在统计学差异。输入变量的重要性,如实验持续时间、总初级生产力和净生态系统交换,根据建模者经验的长短而有显著差异。我们认为,在五个校准阶段逐步获得输入数据会对建模者做出的解释的一致性产生负面影响,在“反复试验”的校准例程中存在认知偏差。我们的研究强调,在建模和模型比较研究的结果中忽略人类和社会属性是至关重要的。虽然模型算法和参数化中捕获的过程的复杂性很重要,但我们认为,(1)建模者对应该改变参数的程度的假设,以及(2)建模者对模型参数重要性的看法,对于获得高质量的模型校准与数值或分析细节同样重要。