• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用潜在类别识别项目缺失调查数据的模式:一项观察性研究。

Identifying patterns of item missing survey data using latent groups: an observational study.

作者信息

Barnett Adrian G, McElwee Paul, Nathan Andrea, Burton Nicola W, Turrell Gavin

机构信息

School of Public Health and Social Work and Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove, Queensland, Australia.

Institute for Health and Ageing, Australian Catholic University, Melbourne, Victoria, Australia.

出版信息

BMJ Open. 2017 Oct 30;7(10):e017284. doi: 10.1136/bmjopen-2017-017284.

DOI:10.1136/bmjopen-2017-017284
PMID:29084795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5665304/
Abstract

OBJECTIVES

To examine whether respondents to a survey of health and physical activity and potential determinants could be grouped according to the questions they missed, known as 'item missing'.

DESIGN

Observational study of longitudinal data.

SETTING

Residents of Brisbane, Australia.

PARTICIPANTS

6901 people aged 40-65 years in 2007.

MATERIALS AND METHODS

We used a latent class model with a mixture of multinomial distributions and chose the number of classes using the Bayesian information criterion. We used logistic regression to examine if participants' characteristics were associated with their modal latent class. We used logistic regression to examine whether the amount of item missing in a survey predicted wave missing in the following survey.

RESULTS

Four per cent of participants missed almost one-fifth of the questions, and this group missed more questions in the middle of the survey. Eighty-three per cent of participants completed almost every question, but had a relatively high missing probability for a question on sleep time, a question which had an inconsistent presentation compared with the rest of the survey. Participants who completed almost every question were generally younger and more educated. Participants who completed more questions were less likely to miss the next longitudinal wave.

CONCLUSIONS

Examining patterns in item missing data has improved our understanding of how missing data were generated and has informed future survey design to help reduce missing data.

摘要

目的

调查健康与身体活动及潜在决定因素调查的受访者是否可根据他们未回答的问题(即“项目缺失”)进行分组。

设计

对纵向数据的观察性研究。

研究地点

澳大利亚布里斯班的居民。

参与者

2007年6901名年龄在40至65岁之间的人。

材料与方法

我们使用了一个具有多项分布混合的潜在类别模型,并使用贝叶斯信息准则选择类别数量。我们使用逻辑回归来检验参与者的特征是否与其模态潜在类别相关。我们使用逻辑回归来检验一项调查中的项目缺失量是否能预测下一次调查中的波次缺失。

结果

4%的参与者几乎未回答五分之一的问题,且该组在调查中期未回答的问题更多。83%的参与者几乎回答了每一个问题,但关于睡眠时间的问题缺失概率相对较高,该问题与调查的其他部分呈现方式不一致。几乎回答了每一个问题的参与者通常更年轻且受教育程度更高。回答问题更多的参与者错过下一次纵向波次的可能性更小。

结论

检查项目缺失数据中的模式增进了我们对缺失数据产生方式的理解,并为未来的调查设计提供了信息,以帮助减少缺失数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/57b5729b0991/bmjopen-2017-017284f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/cf5030692e3f/bmjopen-2017-017284f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/e29e63a2f62a/bmjopen-2017-017284f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/c91140f792ff/bmjopen-2017-017284f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/6b70dd95c09b/bmjopen-2017-017284f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/57b5729b0991/bmjopen-2017-017284f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/cf5030692e3f/bmjopen-2017-017284f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/e29e63a2f62a/bmjopen-2017-017284f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/c91140f792ff/bmjopen-2017-017284f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/6b70dd95c09b/bmjopen-2017-017284f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a5b/5665304/57b5729b0991/bmjopen-2017-017284f05.jpg

相似文献

1
Identifying patterns of item missing survey data using latent groups: an observational study.使用潜在类别识别项目缺失调查数据的模式:一项观察性研究。
BMJ Open. 2017 Oct 30;7(10):e017284. doi: 10.1136/bmjopen-2017-017284.
2
Non-ignorable missingness in logistic regression.逻辑回归中的不可忽略缺失值
Stat Med. 2017 Aug 30;36(19):3005-3021. doi: 10.1002/sim.7349. Epub 2017 Jun 2.
3
Potential implications of missing income data in population-based surveys: an example from a postpartum survey in California.基于人群的调查中缺失收入数据的潜在影响:以加利福尼亚州的一项产后调查为例。
Public Health Rep. 2007 Nov-Dec;122(6):753-63. doi: 10.1177/003335490712200607.
4
On the impact of nonresponse in logistic regression: application to the 45 and Up study.关于逻辑回归中无应答的影响:应用于“45岁及以上”研究
BMC Med Res Methodol. 2017 May 8;17(1):80. doi: 10.1186/s12874-017-0355-z.
5
A Comparison of Web and Telephone Responses From a National HIV and AIDS Survey.一项全国艾滋病毒和艾滋病调查中网络和电话应答的比较。
JMIR Public Health Surveill. 2016 Jul 29;2(2):e37. doi: 10.2196/publichealth.5184.
6
The Funen Neck and Chest Pain study: analysing non-response bias by using national vital statistic data.菲英岛颈部和胸痛研究:利用国家生命统计数据分析无应答偏倚。
Eur J Epidemiol. 2006;21(3):171-80. doi: 10.1007/s10654-006-0006-x.
7
A simple imputation algorithm reduced missing data in SF-12 health surveys.一种简单的插补算法减少了SF-12健康调查中的缺失数据。
J Clin Epidemiol. 2005 Feb;58(2):142-9. doi: 10.1016/j.jclinepi.2004.06.005.
8
Do Participants With Different Patterns of Loss to Follow-Up Have Different Characteristics? A Multi-Wave Longitudinal Study.不同失访模式的参与者是否具有不同特征?一项多波纵向研究。
J Epidemiol. 2016;26(1):45-9. doi: 10.2188/jea.JE20150015. Epub 2015 Aug 29.
9
A Bayesian analysis of mixture structural equation models with non-ignorable missing responses and covariates.带有不可忽略缺失响应和协变量的混合结构方程模型的贝叶斯分析。
Stat Med. 2010 Aug 15;29(18):1861-74. doi: 10.1002/sim.3915.
10
Non-response in a survey of cardiovascular risk factors in the Dutch population: determinants and resulting biases.荷兰人群心血管危险因素调查中的无应答情况:决定因素及由此产生的偏倚
Public Health. 2006 Apr;120(4):297-308. doi: 10.1016/j.puhe.2005.09.008. Epub 2005 Dec 20.

引用本文的文献

1
Randomized controlled trial investigating the effect of a childbirth and parenting booklet intervention on paternal postpartum depression risk.一项随机对照试验,探究分娩与育儿手册干预措施对父亲产后抑郁风险的影响。
BMC Pregnancy Childbirth. 2025 Oct 15;25(1):1095. doi: 10.1186/s12884-025-08200-z.
2
Lower HAGOS subscale scores associated with a longer duration of groin problems in football players in the subsequent season.在接下来的赛季中,较低的HAGOS子量表得分与足球运动员腹股沟问题持续时间较长有关。
BMJ Open Sport Exerc Med. 2024 Apr 27;10(2):e001812. doi: 10.1136/bmjsem-2023-001812. eCollection 2024.

本文引用的文献

1
Using decision trees to understand structure in missing data.使用决策树来理解缺失数据中的结构。
BMJ Open. 2015 Jun 29;5(6):e007450. doi: 10.1136/bmjopen-2014-007450.
2
Effect of questionnaire length, personalisation and reminder type on response rate to a complex postal survey: randomised controlled trial.问卷长度、个性化和提醒类型对复杂邮寄调查响应率的影响:随机对照试验。
BMC Med Res Methodol. 2011 May 6;11:62. doi: 10.1186/1471-2288-11-62.
3
Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values.
缺失协变量值的多重插补与完全案例分析相比的偏差和效率。
Stat Med. 2010 Dec 10;29(28):2920-31. doi: 10.1002/sim.3944.
4
HABITAT: A longitudinal multilevel study of physical activity change in mid-aged adults.栖息地:一项关于中年成年人身体活动变化的纵向多层次研究。
BMC Public Health. 2009 Mar 5;9:76. doi: 10.1186/1471-2458-9-76.
5
Multivariate modelling of responses to conditional items: New possibilities for latent class analysis.
Stat Med. 2009 Jun 30;28(14):1927-39. doi: 10.1002/sim.3550.
6
Approaches for estimating prevalence ratios.估计患病率比的方法。
Occup Environ Med. 2008 Jul;65(7):481, 501-6. doi: 10.1136/oem.2007.034777.
7
Multiple imputation: review of theory, implementation and software.多重填补:理论、实施与软件综述
Stat Med. 2007 Jul 20;26(16):3057-77. doi: 10.1002/sim.2787.
8
Latent pattern mixture models for informative intermittent missing data in longitudinal studies.纵向研究中用于信息性间歇性缺失数据的潜在模式混合模型。
Biometrics. 2004 Jun;60(2):295-305. doi: 10.1111/j.0006-341X.2004.00173.x.
9
Handling drop-out in longitudinal studies.纵向研究中的失访处理。
Stat Med. 2004 May 15;23(9):1455-97. doi: 10.1002/sim.1728.
10
The use of fractional polynomials to model continuous risk variables in epidemiology.在流行病学中使用分数多项式对连续风险变量进行建模。
Int J Epidemiol. 1999 Oct;28(5):964-74. doi: 10.1093/ije/28.5.964.