Alvandi Amirhossein, Hatefi Armin
Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA, USA.
Department of Mathematics and Statistics, 7512Memorial University of Newfoundland, Memorial University of Newfoundland, St John's, NL, Canada.
Stat Methods Med Res. 2021 Aug;30(8):1960-1975. doi: 10.1177/09622802211025989. Epub 2021 Jul 4.
In many surveys, we often deal with situations where measuring the study variable is expensive; however, there are easy-to-measure characteristics which can be used as ranking information to obtain more representative samples from the population. Ranked set sampling is successfully employed in these cases as an alternative to commonly used simple random sampling. When the data is ordinal categorical, it is common to apply the ordinal logistic regression approach to ranked set sampling data for the estimation of parameters. This technique first depends on the information of training data. Besides, one is not capable of using the ranking information in the estimation process. In this paper, we propose a ranked set sampling scheme in which ranking information from multiple sources can be combined and incorporated efficiently into both data collection and estimation. The ranked set sampling data is used for non-parametric and maximum likelihood estimation of ordinal categorical population. Through extensive simulation studies, the performance of estimators is evaluated. The methods are finally applied to analyze bone disorder data and obesity data.
在许多调查中,我们常常会遇到测量研究变量成本高昂的情况;然而,存在一些易于测量的特征,可将其用作排序信息,以便从总体中获取更具代表性的样本。在这些情况下,排序集抽样作为常用简单随机抽样的替代方法被成功采用。当数据为有序分类数据时,通常会应用有序逻辑回归方法对排序集抽样数据进行参数估计。该技术首先依赖于训练数据的信息。此外,在估计过程中无法使用排序信息。在本文中,我们提出了一种排序集抽样方案,其中来自多个来源的排序信息可以被有效组合并纳入数据收集和估计过程。排序集抽样数据用于有序分类总体的非参数估计和最大似然估计。通过广泛的模拟研究,对估计量的性能进行了评估。这些方法最终被应用于分析骨骼疾病数据和肥胖数据。