Härkänen Tommi, Kaikkonen Risto, Virtala Esa, Koskinen Seppo
Department of Health, Functional Capacity and Welfare National Institute for Health and Welfare (THL), P,O, Box 30, FI-00271 Helsinki, Finland.
BMC Public Health. 2014 Nov 6;14:1150. doi: 10.1186/1471-2458-14-1150.
To assess the nonresponse rates in a questionnaire survey with respect to administrative register data, and to correct the bias statistically.
The Finnish Regional Health and Well-being Study (ATH) in 2010 was based on a national sample and several regional samples. Missing data analysis was based on socio-demographic register data covering the whole sample. Inverse probability weighting (IPW) and doubly robust (DR) methods were estimated using the logistic regression model, which was selected using the Bayesian information criteria. The crude, weighted and true self-reported turnout in the 2008 municipal election and prevalences of entitlements to specially reimbursed medication, and the crude and weighted body mass index (BMI) means were compared.
The IPW method appeared to remove a relatively large proportion of the bias compared to the crude prevalence estimates of the turnout and the entitlements to specially reimbursed medication. Several demographic factors were shown to be associated with missing data, but few interactions were found.
Our results suggest that the IPW method can improve the accuracy of results of a population survey, and the model selection provides insight into the structure of missing data. However, health-related missing data mechanisms are beyond the scope of statistical methods, which mainly rely on socio-demographic information to correct the results.
评估问卷调查中相对于行政登记数据的无应答率,并进行统计学上的偏差校正。
2010年芬兰地区健康与幸福研究(ATH)基于全国样本和几个地区样本。缺失数据分析基于覆盖整个样本的社会人口登记数据。使用逻辑回归模型估计逆概率加权(IPW)和双重稳健(DR)方法,该模型通过贝叶斯信息准则进行选择。比较了2008年市政选举中的原始、加权和真实自我报告投票率,以及特殊报销药物的享有率,还比较了原始和加权体重指数(BMI)均值。
与投票率和特殊报销药物享有率的原始患病率估计相比,IPW方法似乎消除了相对较大比例的偏差。几个人口统计学因素显示与缺失数据有关,但发现的相互作用很少。
我们的结果表明,IPW方法可以提高人口调查结果的准确性,并且模型选择为缺失数据的结构提供了见解。然而,与健康相关的缺失数据机制超出了主要依赖社会人口信息校正结果的统计方法的范围。