Santin Gaëlle, Geoffroy Béatrice, Bénézet Laetitia, Delézire Pauline, Chatelot Juliette, Sitta Rémi, Bouyer Jean, Gueguen Alice
Department of Occupational Health, InVS French Institute for Public Health Surveillance, 12 rue du Val d'Osne, F-94415 Saint-Maurice, France.
Department of Occupational Health, InVS French Institute for Public Health Surveillance, 12 rue du Val d'Osne, F-94415 Saint-Maurice, France.
J Clin Epidemiol. 2014 Jun;67(6):722-30. doi: 10.1016/j.jclinepi.2013.10.017. Epub 2014 Jan 31.
To show how reweighting can correct for unit nonresponse bias in an occupational health surveillance survey by using data from administrative databases in addition to classic sociodemographic data.
In 2010, about 10,000 workers covered by a French health insurance fund were randomly selected and were sent a postal questionnaire. Simultaneously, auxiliary data from routine health insurance and occupational databases were collected for all these workers. To model the probability of response to the questionnaire, logistic regressions were performed with these auxiliary data to compute weights for correcting unit nonresponse. Corrected prevalences of questionnaire variables were estimated under several assumptions regarding the missing data process. The impact of reweighting was evaluated by a sensitivity analysis.
Respondents had more reimbursement claims for medical services than nonrespondents but fewer reimbursements for medical prescriptions or hospitalizations. Salaried workers, workers in service companies, or who had held their job longer than 6 months were more likely to respond. Corrected prevalences after reweighting were slightly different from crude prevalences for some variables but meaningfully different for others.
Linking health insurance and occupational data effectively corrects for nonresponse bias using reweighting techniques. Sociodemographic variables may be not sufficient to correct for nonresponse.
通过使用行政数据库中的数据以及经典的社会人口学数据,展示重新加权如何校正职业健康监测调查中的单位无应答偏倚。
2010年,从一家法国健康保险基金覆盖的约10,000名工人中随机抽取,并向他们发送了邮政问卷。同时,收集了所有这些工人的常规健康保险和职业数据库的辅助数据。为了对问卷应答概率进行建模,使用这些辅助数据进行逻辑回归以计算用于校正单位无应答的权重。在关于缺失数据过程的几个假设下估计问卷变量的校正患病率。通过敏感性分析评估重新加权的影响。
应答者的医疗服务报销申请比无应答者多,但医疗处方或住院报销较少。受薪工人、服务公司的工人或工作超过6个月的工人更有可能应答。重新加权后的校正患病率对于某些变量与粗患病率略有不同,但对于其他变量则有显著差异。
使用重新加权技术将健康保险和职业数据相联系可有效校正无应答偏倚。社会人口学变量可能不足以校正无应答情况。