Nicholas School of the Environment, Duke University, Durham, North Carolina 27708, USA.
Ecol Appl. 2011 Jul;21(5):1523-36. doi: 10.1890/09-1212.1.
Recent developments suggest that predictive modeling could begin to play a larger role not only for data analysis, but also for data collection. We address the example of efficient wireless sensor networks, where inferential ecosystem models can be used to weigh the value of an observation against the cost of data collection. Transmission costs make observations "expensive"; networks will typically be deployed in remote locations without access to infrastructure (e.g., power). The capacity to sample intensively makes sensor networks valuable, but high-frequency data are informative only at specific times and locations. Sampling intervals will range from meters and seconds to landscapes and years, depending on the process, the current states of the system, the uncertainty about those states, and the perceived potential for rapid change. Given that intensive sampling is sometimes critical, but more often wasteful, how do we develop tools to control the measurement and transmission processes? We address the potential of data collection controlled and/or supplemented by inferential ecosystem models. In a given model, the value of an observation can be evaluated in terms of its contribution to estimates of state variables and important parameters. There will be more than one model applied to network data that will include as state variables water, carbon, energy balance, biogeochemistry, tree ecophysiology, and forest demographic processes. The value of an observation will depend on the application. Inference is needed to weigh the contributions against transmission cost. Network control must be dynamic and driven by models capable of learning about both the environment and the network. We discuss application of Bayesian inference to model data from a developing sensor network as a basis for controlling the measurement and transmission processes. Our examples involve soil moisture and sap flux, but we discuss broader application of the approach, including its implications for network design.
最近的发展表明,预测模型不仅可以在数据分析中发挥更大的作用,而且可以在数据收集方面发挥更大的作用。我们以高效的无线传感器网络为例,在这种网络中,可以使用推理生态系统模型来衡量观测值的价值与数据收集成本之间的关系。传输成本使观测变得“昂贵”;网络通常将部署在没有基础设施(例如,电力)的偏远地区。密集采样的能力使传感器网络具有价值,但高频数据只有在特定时间和地点才具有信息性。采样间隔将根据过程、系统当前状态、对这些状态的不确定性以及对快速变化的感知潜力,从米和秒到景观和年不等。鉴于密集采样有时是关键的,但更多的时候是浪费的,我们如何开发工具来控制测量和传输过程?我们探讨了通过推理生态系统模型控制和/或补充数据收集的潜力。在给定的模型中,可以根据其对状态变量和重要参数估计的贡献来评估观测值的价值。将有多个模型应用于网络数据,这些模型将包括水、碳、能量平衡、生物地球化学、树木生理生态学和森林人口统计过程等状态变量。观测值的价值将取决于应用。需要进行推断,以权衡对传输成本的贡献。网络控制必须是动态的,并由能够了解环境和网络的模型驱动。我们讨论了贝叶斯推断在控制测量和传输过程的基础上对发展中的传感器网络数据进行推断的应用。我们的例子涉及土壤湿度和树液流,但我们讨论了更广泛的应用方法,包括其对网络设计的影响。