Houston Tracy Durrant, Hiederer Roland
European Commission Joint Research Centre, Institute for Environment and Sustainability, Land Management and Natural Hazards Unit, Via Fermi, Ispra, VA I-21027, Italy.
J Environ Monit. 2009 Apr;11(4):774-81. doi: 10.1039/b818274b. Epub 2009 Feb 13.
Managing data in the context of environmental monitoring is associated with a number of particular difficulties. These can be broadly split into issues originating from the inherent heterogeneity of the parameters sampled, problems related to the long time scale of most monitoring programmes and situations that arise when attempting to maximise cost-effectiveness. The complexity of environmental systems is reflected in the considerable effort and cost required to collect good quality data describing the influencing factors that can improve our understanding of the interrelationships and allow us to draw conclusions about how changes will affect the systems. The resulting information is also frequently elaborate, costly and irreplaceable. Since the quality of the results obtained from analysing the data can only be as good as the data, proper management practices should be considered at all stages of the monitoring activity, if the value of the information is to be properly exploited. Using a Quality Assurance system can aid considerably in improving the overall quality of a database, and good metadata will help in the interpretation of the results. The benefits of implementing Quality Assurance principles to project management and data validation are demonstrated for the information collected for the long-term monitoring of the effects of air pollution on the forest environment under Forest Focus. However, there are limits in the ability of any computer system to detect erroneous or poor quality data, and the best approach is to minimise errors at the collection phase of the project as far as possible.
在环境监测背景下管理数据会遇到一些特殊困难。这些困难大致可分为以下几类:源于所采样参数固有异质性的问题、与大多数监测计划的长时间尺度相关的问题,以及在试图实现成本效益最大化时出现的情况。环境系统的复杂性体现在收集高质量数据所需的大量精力和成本上,这些数据用于描述那些能够增进我们对相互关系的理解并使我们能够得出变化将如何影响系统的结论的影响因素。由此产生的信息通常也很详尽、成本高昂且不可替代。由于从数据分析中获得的结果质量仅取决于数据本身,因此,如果要充分利用信息的价值,在监测活动的各个阶段都应考虑适当的管理做法。使用质量保证体系能够极大地有助于提高数据库的整体质量,而良好的元数据将有助于对结果进行解释。在《森林聚焦》项目下收集的关于空气污染对森林环境长期影响的信息中,展示了将质量保证原则应用于项目管理和数据验证的益处。然而,任何计算机系统检测错误或质量不佳数据的能力都有限,最佳方法是在项目收集阶段尽可能减少错误。