Reitan Trond, Nielsen Anders
Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, P.O. Box 1066 Blindern, NO-0316 Oslo, Norway.
PLoS One. 2016 Feb 12;11(2):e0149129. doi: 10.1371/journal.pone.0149129. eCollection 2016.
Studies in ecology are often describing observed variations in a certain ecological phenomenon by use of environmental explanatory variables. A common problem is that the numerical nature of the ecological phenomenon does not always fit the assumptions underlying traditional statistical tests. A text book example comes from pollination ecology where flower visits are normally reported as frequencies; number of visits per flower per unit time. Using visitation frequencies in statistical analyses comes with two major caveats: the lack of knowledge on its error distribution and that it does not include all information found in the data; 10 flower visits in 20 flowers is treated the same as recording 100 visits in 200 flowers. We simulated datasets with various "flower visitation distributions" over various numbers of flowers observed (exposure) and with different types of effects inducing variation in the data. The different datasets were then analyzed first with the traditional approach using number of visits per flower and then by using count data models. The analysis of count data gave a much better chance of detecting effects than the traditionally used frequency approach. We conclude that if the data structure, statistical analyses and interpretations of results are mixed up, valuable information can be lost.
生态学研究常常通过使用环境解释变量来描述某种生态现象中观察到的变化。一个常见的问题是,生态现象的数值性质并不总是符合传统统计检验所依据的假设。一个教科书式的例子来自传粉生态学,在那里花朵访花情况通常以频率来报告;即单位时间内每朵花的访花次数。在统计分析中使用访花频率存在两个主要问题:一是对其误差分布缺乏了解,二是它没有包含数据中的所有信息;20朵花中有10次访花与在200朵花中有100次访花被同等对待。我们模拟了在不同数量的观察花朵(暴露量)上具有各种“花朵访花分布”以及具有不同类型效应从而导致数据变化的数据集。然后,首先用传统方法对不同数据集进行分析,即使用每朵花的访花次数,接着使用计数数据模型进行分析。与传统使用的频率方法相比,计数数据分析更有可能检测到效应。我们得出结论,如果数据结构、统计分析和结果解释相互混淆,有价值的信息可能会丢失。