Tomal Jabed H, Ciborowski Jan J H
Department of Mathematics and Statistics Thompson Rivers University Kamloops BC Canada.
Department of Biological Sciences University of Windsor Windsor ON Canada.
Ecol Evol. 2020 Nov 16;10(23):13500-13517. doi: 10.1002/ece3.6955. eCollection 2020 Dec.
The relationships between an environmental variable and an ecological response are usually estimated by models fitted through the conditional mean of the response given environmental stress. For example, nonparametric loess and parametric piecewise linear regression model (PLRM) are often used to represent simple to complex nonlinear relationships. In contrast, piecewise linear quantile regression models (PQRM) fitted across various quantiles of the response can reveal nonlinearities in its range of variation across the explanatory variable.We assess the number and positions of candidate breakpoints using loess and compare the relative efficiencies of PLRM and PQRM to quantitatively determine the breakpoints' location and precision. We propose a nonparametric method to generate bootstrap confidence intervals for breakpoints using PQRM and prediction bands for loess and PQRM. We illustrated the applications using data from two aquatic studies suspected to exhibit multiple environmental breakpoints: relating a fish multimetric index of community health (MMI) to agricultural activity in wetlands' adjacent drainage basins; and relating cyanobacterial biomass to total phosphorus concentration in Canadian lakes.Two statistically significant breakpoints were detected in each dataset, demarcating boundaries of three linear segments, each with markedly different slopes. PQRM generated less biased, more accurate, and narrower confidence intervals for the breakpoints and narrower prediction bands than PLRM, especially for small samples and large error variability. In both applications, the relationship between the response and environmental variables was weak/nonsignificant below the lower threshold, strong through the midrange of the environmental gradient, and weak/nonsignificant beyond the upper threshold.We describe several advantages of PQRM over PLRM in characterizing environmental relationships where the scatter of points represents natural environmental variation rather than measurement error. The proposed methodology will be useful for detecting multiple breakpoints in ecological applications where the limits of variation are as important as the conditional mean of a function.
环境变量与生态响应之间的关系通常通过根据给定环境压力下响应的条件均值拟合的模型来估计。例如,非参数局部加权回归和参数分段线性回归模型(PLRM)常被用于表示从简单到复杂的非线性关系。相比之下,在响应的不同分位数上拟合的分段线性分位数回归模型(PQRM)可以揭示响应在解释变量范围内变化的非线性特征。我们使用局部加权回归评估候选断点的数量和位置,并比较PLRM和PQRM的相对效率,以定量确定断点的位置和精度。我们提出了一种非参数方法,使用PQRM生成断点的自助置信区间,以及局部加权回归和PQRM的预测带。我们使用来自两项水生研究的数据说明了这些应用,这两项研究疑似存在多个环境断点:一项是将鱼类群落健康多指标指数(MMI)与湿地相邻流域的农业活动相关联;另一项是将蓝藻生物量与加拿大湖泊中的总磷浓度相关联。在每个数据集中都检测到了两个具有统计学意义的断点,划分出三个线性段的边界,每个线性段的斜率明显不同。与PLRM相比,PQRM为断点生成的偏差更小、更准确且更窄的置信区间,以及更窄的预测带,特别是对于小样本和大误差变异性的情况。在这两个应用中,响应与环境变量之间的关系在较低阈值以下较弱/不显著,在环境梯度的中间范围内较强,而在较高阈值以上较弱/不显著。我们描述了PQRM相对于PLRM在表征环境关系方面的几个优点,其中点的散布代表自然环境变化而非测量误差。所提出的方法将有助于在生态应用中检测多个断点,在这些应用中,变化的极限与函数的条件均值同样重要。