• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

聚类数据模型中的结构化相关性。

Structured correlation in models for clustered data.

作者信息

Chao Edward C

机构信息

Insightful Corporation, 1700 Westlake Avenue N. Suite 500, Seattle, WA 98109, USA.

出版信息

Stat Med. 2006 Jul 30;25(14):2450-68. doi: 10.1002/sim.2368.

DOI:10.1002/sim.2368
PMID:16220520
Abstract

Correlation is always a concern in the analysis of clustered data. One area of interest is to develop a general correlation modelling approach for high dimensional data with unbalanced hierarchical and heterogeneous data structures, e.g. multilevel data. Commonly used correlation structures might have limitation for such situations. In this paper, we propose two extensions, multiblock and multilayer correlations. These methods are very flexible in modelling correlation and can be incorporated in many multivariate approaches, while the major discussion focuses on the applications under the generalized estimating equations (GEE) methods. The approaches are especially useful in GEE when each cluster is large and complex but the number of clusters is small. If an incorrect correlation is applied to such data, the results are less efficient. Multiblock and multilayer correlations extend GEE methods to model complicated multilevel data with arbitrary number of levels and cluster size. The extended estimating equation for correlation parameters has an orthogonal property, and the computation is very efficient. A simulation study compares the conventional methods versus the proposed methods, and it shows the gain in relative efficiency and the flexibility in modelling various structures.

摘要

在聚类数据的分析中,相关性始终是一个需要关注的问题。一个感兴趣的领域是为具有不平衡分层和异构数据结构的高维数据(例如多级数据)开发一种通用的相关性建模方法。常用的相关结构在这种情况下可能存在局限性。在本文中,我们提出了两种扩展方法,即多块相关和多层相关。这些方法在相关性建模方面非常灵活,可以纳入许多多变量方法中,而主要讨论集中在广义估计方程(GEE)方法下的应用。当每个聚类大且复杂但聚类数量较少时,这些方法在GEE中特别有用。如果将不正确的相关性应用于此类数据,结果的效率会较低。多块相关和多层相关将GEE方法扩展到对具有任意层数和聚类大小的复杂多级数据进行建模。相关参数的扩展估计方程具有正交性,并且计算效率非常高。一项模拟研究比较了传统方法与所提出的方法,并显示了相对效率的提高以及在建模各种结构方面的灵活性。

相似文献

1
Structured correlation in models for clustered data.聚类数据模型中的结构化相关性。
Stat Med. 2006 Jul 30;25(14):2450-68. doi: 10.1002/sim.2368.
2
Efficiency of regression estimates for clustered data.聚类数据回归估计的效率。
Biometrics. 1996 Jun;52(2):500-11.
3
Inference for marginal linear models for clustered longitudinal data with potentially informative cluster sizes.具有潜在信息性簇大小的聚类纵向数据边缘线性模型的推断。
Stat Methods Med Res. 2011 Aug;20(4):347-67. doi: 10.1177/0962280209347043. Epub 2010 Mar 11.
4
Properties of analysis methods that account for clustering in volume-outcome studies when the primary predictor is cluster size.当主要预测因素是聚类大小时,在体积-结果研究中考虑聚类的分析方法的属性。
Stat Med. 2007 Apr 30;26(9):2017-35. doi: 10.1002/sim.2657.
5
The application of multilevel, multivariate modelling to orthodontic research data.多级多变量建模在正畸研究数据中的应用。
Community Dent Health. 2000 Dec;17(4):236-42.
6
A caveat concerning independence estimating equations with multivariate binary data.关于具有多元二元数据的独立性估计方程的一项注意事项。
Biometrics. 1995 Mar;51(1):309-17.
7
Generalization of the Mantel-Haenszel estimating function for sparse clustered binary data.稀疏聚类二元数据的Mantel-Haenszel估计函数的推广。
Biometrics. 2005 Dec;61(4):973-81. doi: 10.1111/j.1541-0420.2005.00362.x.
8
Variance estimation for clustered recurrent event data with a small number of clusters.具有少量聚类的聚类复发事件数据的方差估计。
Stat Med. 2005 Oct 15;24(19):3037-51. doi: 10.1002/sim.2157.
9
Comparing methods of analysing datasets with small clusters: case studies using four paediatric datasets.比较具有小聚类的数据集的分析方法:使用四个儿科数据集的案例研究
Paediatr Perinat Epidemiol. 2009 Jul;23(4):380-92. doi: 10.1111/j.1365-3016.2009.01046.x.
10
Modelling and analysing exchangeable binary data with random cluster sizes.对具有随机聚类大小的可交换二元数据进行建模和分析。
Stat Med. 2003 Aug 15;22(15):2401-16. doi: 10.1002/sim.1527.

引用本文的文献

1
Modeling Factors Associated with Dialysis Adequacy Using Longitudinal Data Analysis: Generalized Estimating Equation Versus Quadratic Inference Function.使用纵向数据分析建模透析充分性的相关因素:广义估计方程与二次推断函数。
J Res Health Sci. 2023 Jun;23(2):e00582. doi: 10.34172/jrhs.2023.117.
2
Disseminating, implementing, and evaluating patient-centered outcomes to improve cardiovascular care using a stepped-wedge design: healthy hearts for Oklahoma.采用阶梯楔形设计传播、实施和评估以患者为中心的结果以改善心血管护理:俄克拉荷马州的健康心脏计划
BMC Health Serv Res. 2018 Jun 4;18(1):404. doi: 10.1186/s12913-018-3189-4.
3
Can the buck always be passed to the highest level of clustering?
责任总能推给最高层级的集群吗?
BMC Med Res Methodol. 2016 Mar 8;16:29. doi: 10.1186/s12874-016-0127-1.
4
Optimal combination of estimating equations in the analysis of multilevel nested correlated data.多水平嵌套相关数据分析中估计方程的最优组合。
Stat Med. 2010 Feb 20;29(4):464-73. doi: 10.1002/sim.3776.
5
Using second-order generalized estimating equations to model heterogeneous intraclass correlation in cluster-randomized trials.使用二阶广义估计方程对整群随机试验中的异质性组内相关进行建模。
Stat Med. 2009 Feb 28;28(5):814-27. doi: 10.1002/sim.3518.
6
Payer leverage and hospital compliance with a benchmark: a population-based observational study.支付方影响力与医院对基准的遵守情况:一项基于人群的观察性研究。
BMC Health Serv Res. 2007 Jul 18;7:112. doi: 10.1186/1472-6963-7-112.