van den Hout Ardo, Böckenholt Ulf, van der Heijden Peter G M
J R Stat Soc Ser C Appl Stat. 2010 Aug;59(4):723-736. doi: 10.1111/j.1467-9876.2010.00720.x.
Randomized response is a misclassification design to estimate the prevalence of sensitive behaviour. Respondents who do not follow the instructions of the design are considered to be cheating. A mixture model is proposed to estimate the prevalence of sensitive behaviour and cheating in the case of a dual sampling scheme with direct questioning and randomized response. The mixing weight is the probability of cheating, where cheating is modelled separately for direct questioning and randomized response. For Bayesian inference, Markov chain Monte Carlo sampling is applied to sample parameter values from the posterior. The model makes it possible to analyse dual sample scheme data in a unified way and to assess cheating for direct questions as well as for randomized response questions. The research is illustrated with randomized response data concerning violations of regulations for social benefit.
随机化回答是一种用于估计敏感行为发生率的错误分类设计。不遵循该设计指示的受访者被视为作弊。本文提出了一种混合模型,用于在直接询问和随机化回答的双重抽样方案下估计敏感行为和作弊的发生率。混合权重是作弊的概率,其中分别针对直接询问和随机化回答对作弊进行建模。对于贝叶斯推断,应用马尔可夫链蒙特卡罗抽样从后验分布中抽取参数值。该模型使得能够以统一的方式分析双重抽样方案数据,并评估直接问题以及随机化回答问题中的作弊情况。通过关于违反社会福利规定的随机化回答数据对该研究进行了说明。