• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估稀疏医学数据模型中的拟合优度:一种模拟方法。

Evaluating the goodness of fit in models of sparse medical data: a simulation approach.

作者信息

Boyle P, Flowerdew R, Williams A

机构信息

School of Geography, University of Leeds, UK.

出版信息

Int J Epidemiol. 1997 Jun;26(3):651-6. doi: 10.1093/ije/26.3.651.

DOI:10.1093/ije/26.3.651
PMID:9222792
Abstract

BACKGROUND

Epidemiological studies of rare events, which are common in the medical literature, often involve modeling sparse data sets. Assessing the fit of these models may be complicated by the large numbers of observed zeros in the data set.

METHODS

Poisson models, fitted as generalized linear models, were used to investigate the referral patterns of patients suffering from end-stage renal failure in south west Wales. The usual method for assessing the goodness of fit is to compare the deviance with a chi 2 distribution with appropriate degrees of freedom. However, this test may be invalid when the data set is sparse, as the deviance values may be unusually low compared to the degrees of freedom. This would suggest that there is a problem with underdispersion when, in fact, the large numbers of zeros in the data set make the comparison with the chi 2 distribution unreliable. A simulation approach is advocated as an alternative method of assessing model fit in these situations.

RESULTS

Three models are considered in detail here. The first modelled the total referrals in each of the 245 wards in the study area and included two explanatory variables. These observations were not unusually sparse and both the chi 2 goodness of fit test and the simulation methodology outlined here suggested that the model did not fit. The second model included the population 'at risk' as an offset and the model improved considerably. Both the chi 2 test and the simulation approach suggested that this model did fit. Finally, the data were disaggregated into five age groups providing 1225 observations and a very sparse data set. According to the chi 2 goodness of fit test, the deviance was very low suggesting that the model was underdispersed. Using simulated data, it was shown that the deviance was not unusually low and that the model fitted the data reasonably well.

CONCLUSION

In cases where the data set being modelled is sparse, it is useful to test the goodness of fit of a Poisson model using a simulation approach, rather than relying on the chi 2 test.

摘要

背景

罕见事件的流行病学研究在医学文献中很常见,通常涉及对稀疏数据集进行建模。评估这些模型的拟合度可能会因数据集中大量观察到的零值而变得复杂。

方法

将泊松模型作为广义线性模型进行拟合,用于研究威尔士西南部终末期肾衰竭患者的转诊模式。评估拟合优度的常用方法是将偏差与具有适当自由度的卡方分布进行比较。然而,当数据集稀疏时,此检验可能无效,因为与自由度相比,偏差值可能异常低。这表明存在过度离散的问题,而实际上数据集中的大量零值使得与卡方分布的比较不可靠。提倡使用模拟方法作为在这些情况下评估模型拟合度的替代方法。

结果

这里详细考虑了三个模型。第一个模型对研究区域内245个病房中的每一个的总转诊量进行建模,并包括两个解释变量。这些观察结果并非异常稀疏,卡方拟合优度检验和此处概述的模拟方法均表明该模型不拟合。第二个模型将“处于风险中的”人群作为偏移量,模型有了显著改进。卡方检验和模拟方法均表明该模型拟合良好。最后,数据被分解为五个年龄组,提供了1225个观察值,形成了一个非常稀疏的数据集。根据卡方拟合优度检验,偏差非常低,表明模型存在过度离散。使用模拟数据表明,偏差并非异常低,并且模型对数据的拟合相当好。

结论

在对稀疏数据集进行建模的情况下,使用模拟方法而不是依赖卡方检验来检验泊松模型的拟合优度是有用的。

相似文献

1
Evaluating the goodness of fit in models of sparse medical data: a simulation approach.评估稀疏医学数据模型中的拟合优度:一种模拟方法。
Int J Epidemiol. 1997 Jun;26(3):651-6. doi: 10.1093/ije/26.3.651.
2
Two goodness-of-fit tests for logistic regression models with continuous covariates.针对具有连续协变量的逻辑回归模型的两种拟合优度检验。
Stat Med. 2002 Jan 15;21(1):79-93. doi: 10.1002/sim.943.
3
Global goodness-of-fit tests in logistic regression with sparse data.稀疏数据逻辑回归中的全局拟合优度检验。
Stat Med. 2002 Dec 30;21(24):3789-801. doi: 10.1002/sim.1421.
4
Goodness-of-fit tests for GEE modeling with binary responses.二元响应的广义估计方程(GEE)建模的拟合优度检验。
Biometrics. 1998 Jun;54(2):720-9.
5
[Goodness of fit in polytomous items: Type I error rates and empirical power for three fit-indexes].[多分类项目的拟合优度:三种拟合指数的I型错误率和实证检验力]
Psicothema. 2009 Nov;21(4):639-45.
6
The sum of standardized residuals: Goodness-of-fit test for binary response models.标准化残差之和:二项反应模型拟合优度检验。
Stat Med. 2018 May 20;37(11):1932-1941. doi: 10.1002/sim.7644. Epub 2018 Mar 26.
7
Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test.评估大样本中逻辑回归模型的拟合优度:Hosmer-Lemeshow 检验的改进。
Biometrics. 2020 Jun;76(2):549-560. doi: 10.1111/biom.13249. Epub 2020 Apr 6.
8
Goodness-of-fit tests for modified Poisson regression possibly producing fitted values exceeding one in binary outcome analysis.拟合优度检验用于修正泊松回归,在二项结果分析中,拟合值可能超过 1。
Stat Methods Med Res. 2024 Jul;33(7):1185-1196. doi: 10.1177/09622802241254220. Epub 2024 May 23.
9
Functional linear models for zero-inflated count data with application to modeling hospitalizations in patients on dialysis.用于零膨胀计数数据的功能线性模型及其在透析患者住院情况建模中的应用。
Stat Med. 2014 Nov 30;33(27):4825-40. doi: 10.1002/sim.6241. Epub 2014 Jun 19.
10
Testing goodness of fit of a uniform truncation model.检验均匀截断模型的拟合优度。
Biometrics. 2007 Jun;63(2):405-12. doi: 10.1111/j.1541-0420.2006.00710.x.

引用本文的文献

1
Inference for Under-Dispersed Data: Assessing the Performance of an Airborne Spacing Algorithm.欠分散数据的推断:评估一种机载间距算法的性能。
Qual Eng. 2018;30(4):546-555. doi: 10.1080/08982112.2018.1482339. Epub 2018 Oct 18.
2
Mild cognitive impairment and structural brain abnormalities in a sexagenarian with a history of childhood traumatic brain injury.一位 60 岁老人有童年创伤性脑损伤病史,表现为轻度认知障碍和结构性脑异常。
J Neurosci Res. 2018 Apr;96(4):652-660. doi: 10.1002/jnr.24084. Epub 2017 May 20.