Ebegil Meral, Özdemir Yaprak Arzu, Gökpinar Fikri
Faculty of Science Department of Statistics, Gazi University, Ankara, Turkey.
J Appl Stat. 2021 Mar 16;48(13-15):2473-2498. doi: 10.1080/02664763.2021.1895088. eCollection 2021.
In this study, some shrinkage estimators using a median ranked set sample in the presence of multicollinearity were studied. Initially, we constructed the multiple regression model using median ranked set sampling. We also adapted the Ridge and Liu-type estimators to these multiple regression model. To investigate the efficiency of these estimators, a simulation study was performed for a different number of explanatory variables, sample sizes, correlation coefficients, and error variances in perfect and imperfect ranking cases. In addition, these estimators were compared with other estimators that are based on ranked set sample using simulation study. It is shown that when the collinearity is moderate, Ridge estimator using median ranked set sample performs better than other estimators and when the collinearity increases, Liu-type estimator using median ranked set sample gets better than all other estimators do. When the collinearity is smaller than 0.95, ridge estimator based on median ranked set sample is more efficient than Liu-type estimator based on same sample. However, this threshold increases as the sample size increases and the number of explanatory variables decreases. In addition, real data example is presented to illustrate how collinearity affects the estimators under median ranked set sampling and ranked set sampling.
在本研究中,对存在多重共线性时使用中位数排序集样本的一些收缩估计量进行了研究。首先,我们使用中位数排序集抽样构建了多元回归模型。我们还将岭估计量和刘型估计量应用于这些多元回归模型。为了研究这些估计量的效率,针对不同数量的解释变量、样本量、相关系数以及完美和不完美排序情况下的误差方差进行了模拟研究。此外,通过模拟研究将这些估计量与其他基于排序集样本的估计量进行了比较。结果表明,当中共线性适中时,使用中位数排序集样本的岭估计量比其他估计量表现更好;当中共线性增加时,使用中位数排序集样本的刘型估计量比所有其他估计量表现更好。当共线性小于0.95时,基于中位数排序集样本的岭估计量比基于相同样本的刘型估计量更有效。然而,随着样本量增加和解释变量数量减少,这个阈值会增大。此外,给出了实际数据示例,以说明共线性如何影响中位数排序集抽样和排序集抽样下的估计量。