• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

空间分析中缺失的数据:II型糖尿病危险因素空间分析中缺失数据插补方法的评估

Missing in space: an evaluation of imputation methods for missing data in spatial analysis of risk factors for type II diabetes.

作者信息

Baker Jannah, White Nicole, Mengersen Kerrie

机构信息

Queensland University of Technology School of Mathematical Sciences, Brisbane, Australia.

出版信息

Int J Health Geogr. 2014 Nov 20;13:47. doi: 10.1186/1476-072X-13-47.

DOI:10.1186/1476-072X-13-47
PMID:25410053
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4287494/
Abstract

BACKGROUND

Spatial analysis is increasingly important for identifying modifiable geographic risk factors for disease. However, spatial health data from surveys are often incomplete, ranging from missing data for only a few variables, to missing data for many variables. For spatial analyses of health outcomes, selection of an appropriate imputation method is critical in order to produce the most accurate inferences.

METHODS

We present a cross-validation approach to select between three imputation methods for health survey data with correlated lifestyle covariates, using as a case study, type II diabetes mellitus (DM II) risk across 71 Queensland Local Government Areas (LGAs). We compare the accuracy of mean imputation to imputation using multivariate normal and conditional autoregressive prior distributions.

RESULTS

Choice of imputation method depends upon the application and is not necessarily the most complex method. Mean imputation was selected as the most accurate method in this application.

CONCLUSIONS

Selecting an appropriate imputation method for health survey data, after accounting for spatial correlation and correlation between covariates, allows more complete analysis of geographic risk factors for disease with more confidence in the results to inform public policy decision-making.

摘要

背景

空间分析对于识别可改变的疾病地理风险因素愈发重要。然而,来自调查的空间健康数据往往不完整,从仅几个变量的数据缺失到许多变量的数据缺失不等。对于健康结果的空间分析,选择合适的插补方法对于得出最准确的推断至关重要。

方法

我们提出一种交叉验证方法,用于在三种针对具有相关生活方式协变量的健康调查数据的插补方法之间进行选择,以昆士兰州71个地方政府区域(LGA)的II型糖尿病(DM II)风险作为案例研究。我们将均值插补的准确性与使用多元正态和条件自回归先验分布进行插补的准确性进行比较。

结果

插补方法的选择取决于应用,不一定是最复杂的方法。在本应用中,均值插补被选为最准确的方法。

结论

在考虑空间相关性和协变量之间的相关性之后,为健康调查数据选择合适的插补方法,能够更全面地分析疾病的地理风险因素,并对结果更有信心,从而为公共政策决策提供依据。

相似文献

1
Missing in space: an evaluation of imputation methods for missing data in spatial analysis of risk factors for type II diabetes.空间分析中缺失的数据:II型糖尿病危险因素空间分析中缺失数据插补方法的评估
Int J Health Geogr. 2014 Nov 20;13:47. doi: 10.1186/1476-072X-13-47.
2
Comparison of methods for imputing ordinal data using multivariate normal imputation: a case study of non-linear effects in a large cohort study.使用多元正态插补法对有序数据进行插补方法的比较:一项大型队列研究中非线性效应的案例研究。
Stat Med. 2012 Dec 30;31(30):4164-74. doi: 10.1002/sim.5445. Epub 2012 Jul 24.
3
Evaluating geographic imputation approaches for zip code level data: an application to a study of pediatric diabetes.评估邮政编码级数据的地理推断方法:在儿科糖尿病研究中的应用。
Int J Health Geogr. 2009 Oct 8;8:54. doi: 10.1186/1476-072X-8-54.
4
Dealing with missing data in a multi-question depression scale: a comparison of imputation methods.处理多问题抑郁量表中的缺失数据:插补方法比较
BMC Med Res Methodol. 2006 Dec 13;6:57. doi: 10.1186/1471-2288-6-57.
5
Approach to addressing missing data for electronic medical records and pharmacy claims data research.电子病历和药房报销数据研究中缺失数据的处理方法。
Pharmacotherapy. 2015 Apr;35(4):380-7. doi: 10.1002/phar.1569.
6
Geographic Imputation of Missing Activity Space Data from Ecological Momentary Assessment (EMA) GPS Positions.基于生态瞬时评估 (EMA) GPS 位置缺失活动空间数据的地理推断。
Int J Environ Res Public Health. 2018 Dec 4;15(12):2740. doi: 10.3390/ijerph15122740.
7
Imputation strategies for missing continuous outcomes in cluster randomized trials.整群随机试验中连续缺失结局的插补策略。
Biom J. 2008 Jun;50(3):329-45. doi: 10.1002/bimj.200710423.
8
Effects of Different Missing Data Imputation Techniques on the Performance of Undiagnosed Diabetes Risk Prediction Models in a Mixed-Ancestry Population of South Africa.不同缺失数据插补技术对南非混合血统人群未诊断糖尿病风险预测模型性能的影响。
PLoS One. 2015 Sep 25;10(9):e0139210. doi: 10.1371/journal.pone.0139210. eCollection 2015.
9
Multiple imputation for missing data via sequential regression trees.基于序贯回归树的缺失数据多重插补法。
Am J Epidemiol. 2010 Nov 1;172(9):1070-6. doi: 10.1093/aje/kwq260. Epub 2010 Sep 14.
10
Handling missing data in nursing research with multiple imputation.采用多重填补法处理护理研究中的缺失数据。
Nurs Res. 2001 Nov-Dec;50(6):384-9. doi: 10.1097/00006199-200111000-00010.

引用本文的文献

1
SatHealth: A Multimodal Public Health Dataset with Satellite-based Environmental Factors.SatHealth:一个包含基于卫星的环境因素的多模态公共卫生数据集。
KDD. 2025 Aug;2025:5819-5830. doi: 10.1145/3711896.3737440. Epub 2025 Aug 3.
2
A multi-constraint Monte Carlo Simulation approach to downscaling cancer data.一种用于降尺度癌症数据的多约束蒙特卡罗模拟方法。
Health Place. 2025 Jan;91:103411. doi: 10.1016/j.healthplace.2024.103411. Epub 2025 Jan 6.
3
Retrospective multidisciplinary analysis of human alveolar echinococcosis in Hungary using spatial epidemiology approaches.

本文引用的文献

1
The impact of spatial scales and spatial smoothing on the outcome of bayesian spatial model.贝叶斯空间模型中空间尺度和空间平滑对结果的影响。
PLoS One. 2013 Oct 11;8(10):e75957. doi: 10.1371/journal.pone.0075957. eCollection 2013.
2
The incidence of type 2 diabetes in the United Kingdom from 1991 to 2010.英国 1991 年至 2010 年 2 型糖尿病发病率。
Diabetes Obes Metab. 2013 Sep;15(9):844-52. doi: 10.1111/dom.12123. Epub 2013 May 16.
3
Feasibility study of geospatial mapping of chronic disease risk to inform public health commissioning.
运用空间流行病学方法对匈牙利人体肺泡型包虫病进行回顾性多学科分析。
Sci Rep. 2024 Dec 28;14(1):31435. doi: 10.1038/s41598-024-83119-7.
4
Geographically weighted machine learning model for untangling spatial heterogeneity of type 2 diabetes mellitus (T2D) prevalence in the USA.基于地理加权机器学习模型的美国 2 型糖尿病(T2D)流行的空间异质性研究
Sci Rep. 2021 Mar 26;11(1):6955. doi: 10.1038/s41598-021-85381-5.
5
Bayesian spatial modelling of early childhood development in Australian regions.澳大利亚区域儿童早期发展的贝叶斯空间建模。
Int J Health Geogr. 2020 Oct 19;19(1):43. doi: 10.1186/s12942-020-00237-x.
6
Modeling Bronchiolitis Incidence Proportions in the Presence of Spatio-Temporal Uncertainty.在存在时空不确定性的情况下对细支气管炎发病率比例进行建模。
J Am Stat Assoc. 2020;115(529):66-78. doi: 10.1080/01621459.2019.1609480. Epub 2019 May 31.
7
Evaluation of geoimputation strategies in a large case study.在一项大型案例研究中评估地理推断策略。
Int J Health Geogr. 2018 Jul 31;17(1):30. doi: 10.1186/s12942-018-0151-y.
8
Estimating missing values in China's official socioeconomic statistics using progressive spatiotemporal Bayesian hierarchical modeling.利用渐进时空贝叶斯层次模型估计中国官方社会经济统计中的缺失值。
Sci Rep. 2018 Jul 3;8(1):10055. doi: 10.1038/s41598-018-28322-z.
9
Racial differences in spatial patterns for poor glycemic control in the Southeastern United States.美国东南部地区血糖控制不佳的空间模式存在种族差异。
Ann Epidemiol. 2018 Mar;28(3):153-159. doi: 10.1016/j.annepidem.2018.01.008. Epub 2018 Jan 11.
10
Spatial modelling of type II diabetes outcomes: a systematic review of approaches used.II 型糖尿病结局的空间建模:方法应用的系统综述。
R Soc Open Sci. 2015 Jun 17;2(6):140460. doi: 10.1098/rsos.140460. eCollection 2015 Jun.
慢性病风险的地理空间映射对公共卫生委托的可行性研究。
BMJ Open. 2012 Feb 15;2(1):e000711. doi: 10.1136/bmjopen-2011-000711. Print 2012.
4
Cost and clinical implications of diabetes prevention in an Australian setting: a long-term modeling analysis.在澳大利亚环境下预防糖尿病的成本和临床意义:一项长期建模分析。
Prim Care Diabetes. 2012 Jul;6(2):109-21. doi: 10.1016/j.pcd.2011.10.006. Epub 2011 Dec 6.
5
Risk models and scores for type 2 diabetes: systematic review.2 型糖尿病风险模型和评分:系统评价。
BMJ. 2011 Nov 28;343:d7163. doi: 10.1136/bmj.d7163.
6
Geographical mapping and Bayesian spatial modeling of malaria incidence in Sistan and Baluchistan province, Iran.伊朗锡斯坦和俾路支省疟疾发病率的地理制图和贝叶斯空间建模。
Asian Pac J Trop Med. 2011 Dec;4(12):985-92. doi: 10.1016/S1995-7645(11)60231-9.
7
Bayesian geostatistical modelling of malaria and lymphatic filariasis infections in Uganda: predictors of risk and geographical patterns of co-endemicity.乌干达疟疾和淋巴丝虫病感染的贝叶斯地统计学建模:风险预测因子和共流行的地理模式。
Malar J. 2011 Oct 11;10:298. doi: 10.1186/1475-2875-10-298.
8
Geo-mapping of caries risk in children and adolescents - a novel approach for allocation of preventive care.儿童和青少年龋齿风险的地理映射——一种用于分配预防保健的新方法。
BMC Oral Health. 2011 Sep 26;11:26. doi: 10.1186/1472-6831-11-26.
9
Mapping the risk of anaemia in preschool-age children: the contribution of malnutrition, malaria, and helminth infections in West Africa.绘制学龄前儿童贫血风险图:西非营养不良、疟疾和寄生虫感染的影响。
PLoS Med. 2011 Jun;8(6):e1000438. doi: 10.1371/journal.pmed.1000438. Epub 2011 Jun 7.
10
Epidemiology of multiple sclerosis in south-western Sardinia.撒丁岛西南部多发性硬化症的流行病学研究。
Mult Scler. 2011 Nov;17(11):1282-9. doi: 10.1177/1352458511408754. Epub 2011 Jun 7.