• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

何时可以忽略群组层面的聚类?多层模型与具有稀疏数据的单层面模型的比较。

When can group level clustering be ignored? Multilevel models versus single-level models with sparse data.

作者信息

Clarke P

机构信息

Institute for Social Research, University of Michigan, 426 Thompson Street, Ann Arbor, MI 48106-1248, USA.

出版信息

J Epidemiol Community Health. 2008 Aug;62(8):752-8. doi: 10.1136/jech.2007.060798.

DOI:10.1136/jech.2007.060798
PMID:18621963
Abstract

OBJECTIVE

The use of multilevel modelling with data from population-based surveys is often limited by the small number of cases per level-2 unit, prompting many researchers to use single-level techniques such as ordinary least squares regression.

DESIGN

Monte Carlo simulations are used to investigate the effects of data sparseness on the validity of parameter estimates in two-level versus single-level models.

SETTING

Both linear and non-linear hierarchical models are simulated in order to examine potential differences in the effects of small group size across continuous and discrete outcomes. Results are then compared with those obtained using disaggregated techniques (ordinary least squares and logistic regression).

MAIN RESULTS

At the extremes of data sparseness (two observations per group), the group level variance components are overestimated in the two-level models. But with an average of only five observations per group, valid and reliable estimates of all parameters can be obtained when using a two-level model with either a continuous or a discrete outcome. In contrast, researchers run the risk of Type I error (standard errors biased downwards) when using single-level models even when there are as few as two observations per group on average. Bias is magnified when modelling discrete outcomes.

CONCLUSIONS

Multilevel models can be reliably estimated with an average of only five observations per group. Disaggregated techniques carry an increased risk of Type I error, even in situations where there is only limited clustering in the data.

摘要

目的

基于人群调查数据使用多水平模型常常受到二级单位中病例数较少的限制,这促使许多研究人员使用诸如普通最小二乘法回归等单水平技术。

设计

采用蒙特卡洛模拟来研究数据稀疏性对二级模型与单水平模型中参数估计有效性的影响。

设置

对线性和非线性分层模型进行模拟,以检验在连续和离散结果中小组规模较小所产生影响的潜在差异。然后将结果与使用分解技术(普通最小二乘法和逻辑回归)获得的结果进行比较。

主要结果

在数据稀疏的极端情况下(每组两个观测值),二级模型中的组水平方差成分被高估。但每组平均仅有五个观测值时,使用具有连续或离散结果的二级模型可以获得所有参数的有效且可靠的估计值。相比之下,即使平均每组仅有两个观测值,研究人员在使用单水平模型时也存在I型错误的风险(标准误差向下偏差)。在对离散结果进行建模时偏差会放大。

结论

每组平均仅有五个观测值时,多水平模型也能得到可靠估计。即使在数据中聚类有限的情况下,分解技术也会增加I型错误的风险。

相似文献

1
When can group level clustering be ignored? Multilevel models versus single-level models with sparse data.何时可以忽略群组层面的聚类?多层模型与具有稀疏数据的单层面模型的比较。
J Epidemiol Community Health. 2008 Aug;62(8):752-8. doi: 10.1136/jech.2007.060798.
2
Comparing methods of analysing datasets with small clusters: case studies using four paediatric datasets.比较具有小聚类的数据集的分析方法:使用四个儿科数据集的案例研究
Paediatr Perinat Epidemiol. 2009 Jul;23(4):380-92. doi: 10.1111/j.1365-3016.2009.01046.x.
3
A simulation study of sample size for multilevel logistic regression models.多水平逻辑回归模型样本量的模拟研究
BMC Med Res Methodol. 2007 Jul 16;7:34. doi: 10.1186/1471-2288-7-34.
4
Making treatment effect inferences from multiple-baseline data: the utility of multilevel modeling approaches.从多基线数据推断治疗效果:多层次建模方法的效用。
Behav Res Methods. 2009 May;41(2):372-84. doi: 10.3758/BRM.41.2.372.
5
Just-identified versus overidentified two-level hierarchical linear models with missing data.具有缺失数据的刚识别与过度识别二级分层线性模型。
Biometrics. 2007 Dec;63(4):1262-8. doi: 10.1111/j.1541-0420.2007.00818.x. Epub 2007 May 14.
6
Synthesizing single-case studies: a Monte Carlo examination of a three-level meta-analytic model.综合单案例研究:三级元分析模型的蒙特卡罗检验。
Behav Res Methods. 2012 Sep;44(3):795-805. doi: 10.3758/s13428-011-0180-y.
7
The application of multilevel, multivariate modelling to orthodontic research data.多级多变量建模在正畸研究数据中的应用。
Community Dent Health. 2000 Dec;17(4):236-42.
8
Analysing multitrait-multimethod data with structural equation models for ordinal variables applying the WLSMV estimator: what sample size is needed for valid results?使用WLSMV估计量对有序变量应用结构方程模型分析多特质-多方法数据:获得有效结果需要多大样本量?
Br J Math Stat Psychol. 2006 May;59(Pt 1):195-213. doi: 10.1348/000711005X67490.
9
Comparison of statistical methods for analysis of clustered binary observations.用于分析聚类二元观测值的统计方法比较。
Stat Med. 2005 Mar 30;24(6):911-23. doi: 10.1002/sim.1958.
10
Strategies for analyzing multilevel cluster-randomized studies with binary outcomes collected at varying intervals of time.用于分析在不同时间间隔收集二元结局的多水平整群随机研究的策略。
Stat Med. 2008 Dec 20;27(29):6055-71. doi: 10.1002/sim.3446.

引用本文的文献

1
Children's Caregiving and Growth in Northwestern Tanzania: Limited Evidence That Support From Specific Caregivers Is Associated With Better Growth.坦桑尼亚西北部儿童的照料与成长:关于特定照料者的支持与更好成长相关的证据有限
Am J Hum Biol. 2025 Mar;37(3):e70029. doi: 10.1002/ajhb.70029.
2
Spatial variation and associated factors of unmet need for family planning among reproductive age women in Ethiopia, insights from 2021 PMA-Ethiopia data: multilevel logistic regression analysis.埃塞俄比亚育龄妇女计划生育需求未满足情况的空间差异及相关因素:基于2021年埃塞俄比亚人口与健康调查数据的见解——多层次逻辑回归分析
BMC Public Health. 2025 Feb 20;25(1):704. doi: 10.1186/s12889-025-21917-y.
3
Evaluating fit indices in a multilevel latent growth model with unbalanced design: a Monte Carlo study.
评估非平衡设计的多层次潜在增长模型中的拟合指数:一项蒙特卡罗研究。
Front Psychol. 2024 May 3;15:1366850. doi: 10.3389/fpsyg.2024.1366850. eCollection 2024.
4
An empirical analysis of the demand for family planning satisfied by modern methods among married or in-union women in Nigeria: Application of multilevel binomial logistic modelling technique.尼日利亚已婚或同居妇女对现代计划生育方法需求的实证分析:多水平二项逻辑回归模型技术的应用。
PLoS One. 2024 Mar 21;19(3):e0300744. doi: 10.1371/journal.pone.0300744. eCollection 2024.
5
Application of Hierarchical/Multilevel Models and Quality of Reporting (2010-2020): A Systematic Review.分层/多级模型的应用和报告质量(2010-2020):系统评价。
ScientificWorldJournal. 2024 Mar 8;2024:4658333. doi: 10.1155/2024/4658333. eCollection 2024.
6
Structural racism in primary schools and changes in epigenetic age acceleration among Black and White youth.小学中的结构性种族主义与黑人和白人青少年表观遗传年龄加速变化。
Soc Sci Med. 2024 Apr;347:116724. doi: 10.1016/j.socscimed.2024.116724. Epub 2024 Mar 2.
7
A Longitudinal Multilevel Analysis of the Effects of Contraceptive Failures on Unintended Pregnancies among Women in Urban Nigeria.尼日利亚城市女性避孕失败对意外怀孕影响的纵向多层分析
J Urban Health. 2024 Feb;101(1):193-204. doi: 10.1007/s11524-023-00819-7. Epub 2024 Jan 29.
8
Association of neighborhood recreational facilities and depressive symptoms among Chinese older adults.社区休闲设施与中国老年人抑郁症状的关联。
BMC Geriatr. 2023 Oct 17;23(1):667. doi: 10.1186/s12877-023-04369-0.
9
Fathers favour sons, mothers don't discriminate: Sex-biased parental care in northwestern Tanzania.父亲偏爱儿子,母亲一视同仁:坦桑尼亚西北部存在性别偏见的亲代抚育行为
Evol Hum Sci. 2019 Dec 4;1:e13. doi: 10.1017/ehs.2019.14. eCollection 2019.
10
Crossover of Engagement Among Academic Staff and Students During COVID-19.新冠疫情期间学术人员与学生互动的交叉情况。
Psychol Res Behav Manag. 2023 Aug 10;16:3121-3137. doi: 10.2147/PRBM.S416739. eCollection 2023.