Science and Decisions Center, U.S. Geological Survey, 12201 Sunrise Valley Drive, Reston, VA, 20192, U.S.A.
The Wildlife Society, 5410 Grosvenor Lane, Suite 200, Bethesda, MD, 20814, U.S.A.
Conserv Biol. 2019 Jun;33(3):561-569. doi: 10.1111/cobi.13223. Epub 2018 Nov 27.
We examined features of citizen science that influence data quality, inferential power, and usefulness in ecology. As background context for our examination, we considered topics such as ecological sampling (probability based, purposive, opportunistic), linkage between sampling technique and statistical inference (design based, model based), and scientific paradigms (confirmatory, exploratory). We distinguished several types of citizen science investigations, from intensive research with rigorous protocols targeting clearly articulated questions to mass-participation internet-based projects with opportunistic data collection lacking sampling design, and examined overarching objectives, design, analysis, volunteer training, and performance. We identified key features that influence data quality: project objectives, design and analysis, and volunteer training and performance. Projects with good designs, trained volunteers, and professional oversight can meet statistical criteria to produce high-quality data with strong inferential power and therefore are well suited for ecological research objectives. Projects with opportunistic data collection, little or no sampling design, and minimal volunteer training are better suited for general objectives related to public education or data exploration because reliable statistical estimation can be difficult or impossible. In some cases, statistically robust analytical methods, external data, or both may increase the inferential power of certain opportunistically collected data. Ecological management, especially by government agencies, frequently requires data suitable for reliable inference. With standardized protocols, state-of-the-art analytical methods, and well-supervised programs, citizen science can make valuable contributions to conservation by increasing the scope of species monitoring efforts. Data quality can be improved by adhering to basic principles of data collection and analysis, designing studies to provide the data quality required, and including suitable statistical expertise, thereby strengthening the science aspect of citizen science and enhancing acceptance by the scientific community and decision makers.
我们研究了影响公民科学数据质量、推断力和在生态学中有用性的特征。作为我们研究的背景,我们考虑了生态采样的主题(基于概率、有目的、机会主义)、采样技术与统计推断之间的联系(基于设计、基于模型)和科学范式(验证性、探索性)。我们区分了几种类型的公民科学调查,从针对明确表述问题的严格协议的密集研究到缺乏采样设计的大规模参与的基于互联网的项目,以及考察了总体目标、设计、分析、志愿者培训和绩效。我们确定了影响数据质量的关键特征:项目目标、设计和分析以及志愿者培训和表现。具有良好设计、受过培训的志愿者和专业监督的项目可以满足统计标准,生成具有强大推断力的高质量数据,因此非常适合生态研究目标。具有机会主义数据收集、很少或没有采样设计以及最小化志愿者培训的项目更适合与公众教育或数据探索相关的一般目标,因为可靠的统计估计可能很困难或不可能。在某些情况下,统计上稳健的分析方法、外部数据或两者都可以提高某些机会性收集数据的推断力。生态管理,特别是政府机构,经常需要适合可靠推断的数据。通过标准化协议、最先进的分析方法和监督良好的计划,公民科学可以通过增加物种监测工作的范围,为保护做出有价值的贡献。通过遵守数据收集和分析的基本原则、设计研究以提供所需的数据质量以及包括适当的统计专业知识,可以提高数据质量,从而加强公民科学的科学方面,并增强科学界和决策者的接受度。