• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

回顾性抽样对聚类数据二元回归模型的影响。

The effect of retrospective sampling on binary regression models for clustered data.

作者信息

Neuhaus J M, Jewell N P

机构信息

Department of Epidemiology, University of California, San Francisco, 94143-0560.

出版信息

Biometrics. 1990 Dec;46(4):977-90.

PMID:2085642
Abstract

Recently a great deal of attention has been given to binary regression models for clustered or correlated observations. The data of interest are of the form of a binary dependent or response variable, together with independent variables X1,...., Xk, where sets of observations are grouped together into clusters. A number of models and methods of analysis have been suggested to study such data. Many of these are extensions in some way of the familiar logistic regression model for binary data that are not grouped (i.e., each cluster is of size 1). In general, the analyses of these clustered data models proceed by assuming that the observed clusters are a simple random sample of clusters selected from a population of clusters. In this paper, we consider the application of these procedures to the case where the clusters are selected randomly in a manner that depends on the pattern of responses in the cluster. For example, we show that ignoring the retrospective nature of the sample design, by fitting standard logistic regression models for clustered binary data, may result in misleading estimates of the effects of covariates and the precision of estimated regression coefficients.

摘要

最近,对于聚类或相关观测的二元回归模型给予了大量关注。感兴趣的数据形式为二元因变量或响应变量,以及自变量X1,…,Xk,其中观测集被分组为聚类。已经提出了许多模型和分析方法来研究此类数据。其中许多在某种程度上是针对未分组的二元数据(即每个聚类大小为1)的熟悉逻辑回归模型的扩展。一般来说,这些聚类数据模型的分析是通过假设观测到的聚类是从聚类总体中选取的聚类的简单随机样本进行的。在本文中,我们考虑将这些程序应用于聚类以依赖于聚类中响应模式的方式随机选择的情况。例如,我们表明,通过为聚类二元数据拟合标准逻辑回归模型而忽略样本设计的回顾性性质,可能会导致协变量效应的误导性估计以及估计回归系数的精度。

相似文献

1
The effect of retrospective sampling on binary regression models for clustered data.回顾性抽样对聚类数据二元回归模型的影响。
Biometrics. 1990 Dec;46(4):977-90.
2
Multivariate methods for clustered binary data with multiple subclasses, with application to binary longitudinal data.适用于具有多个子类别的聚类二元数据的多变量方法及其在二元纵向数据中的应用。
Biometrics. 1992 Sep;48(3):721-31.
3
Random effects probit and logistic regression models for three-level data.用于三级数据的随机效应概率单位和逻辑回归模型。
Biometrics. 1997 Dec;53(4):1527-37.
4
A caveat concerning independence estimating equations with multivariate binary data.关于具有多元二元数据的独立性估计方程的一项注意事项。
Biometrics. 1995 Mar;51(1):309-17.
5
Regression models for clustered binary responses: implications of ignoring the intracluster correlation in an analysis of perinatal mortality in twin gestations.聚类二元反应的回归模型:在双胎妊娠围产期死亡率分析中忽略集群内相关性的影响。
Ann Epidemiol. 2005 Apr;15(4):293-301. doi: 10.1016/j.annepidem.2004.08.007.
6
Comparison of statistical methods for analysis of clustered binary observations.用于分析聚类二元观测值的统计方法比较。
Stat Med. 2005 Mar 30;24(6):911-23. doi: 10.1002/sim.1958.
7
Improved procedures for estimation of disease prevalence using ranked set sampling.使用排序集抽样估计疾病患病率的改进程序。
Biom J. 2007 Aug;49(4):530-8. doi: 10.1002/bimj.200610302.
8
Logistic regression for clustered binary data in proband studies with application to familial aggregation of sleep disorders.先证者研究中聚类二元数据的逻辑回归及其在睡眠障碍家族聚集性研究中的应用
Biometrics. 1991 Dec;47(4):1257-65.
9
[Meta-analysis of the Italian studies on short-term effects of air pollution].[意大利关于空气污染短期影响研究的荟萃分析]
Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.
10
Performance of a mixed effects logistic regression model for binary outcomes with unequal cluster size.用于不等聚类大小的二元结局的混合效应逻辑回归模型的性能
J Biopharm Stat. 2005;15(3):513-26. doi: 10.1081/BIP-200056554.

引用本文的文献

1
Efficient designs and analysis of two-phase studies with longitudinal binary data.具有纵向二分类数据的两阶段研究的高效设计和分析。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad010.
2
Outcome-dependent sampling in cluster-correlated data settings with application to hospital profiling.聚类相关数据设置中基于结果的抽样及其在医院概况分析中的应用
J R Stat Soc Ser A Stat Soc. 2020 Jan;183(1):379-402. doi: 10.1111/rssa.12503. Epub 2019 Aug 29.
3
The Role of Conditional Likelihoods in Latent Variable Modeling.条件似然在潜变量建模中的作用。
Psychometrika. 2022 Sep;87(3):799-834. doi: 10.1007/s11336-021-09816-8. Epub 2022 Jan 10.
4
A note on bias due to fitting prospective multivariate generalized linear models to categorical outcomes ignoring retrospective sampling schemes.关于在忽略回顾性抽样方案的情况下,将前瞻性多变量广义线性模型应用于分类结果时产生偏差的说明。
J Multivar Anal. 2009 Mar;100(3):459-472. doi: 10.1016/j.jmva.2008.05.011. Epub 2008 Jun 7.
5
On the analysis of two-phase designs in cluster-correlated data settings.在群组相关数据环境下的两阶段设计分析。
Stat Med. 2019 Oct 15;38(23):4611-4624. doi: 10.1002/sim.8321. Epub 2019 Jul 29.
6
On the Analysis of Case-Control Studies in Cluster-correlated Data Settings.在群组相关数据环境下的病例对照研究分析。
Epidemiology. 2018 Jan;29(1):50-57. doi: 10.1097/EDE.0000000000000763.
7
Approaches to retrospective sampling for longitudinal transition regression models.纵向转换回归模型的回顾性抽样方法。
Stat Interface. 2014;7(1):75-85. doi: 10.4310/SII.2014.v7.n1.a9.
8
Using Retrospective Sampling to Estimate Models of Relationship Status in Large Longitudinal Social Networks.利用回顾性抽样来估计大型纵向社交网络中的关系状态模型。
Comput Stat Data Anal. 2015 Feb 1;82:35-46. doi: 10.1016/j.csda.2014.08.001.
9
On the validity of within-nuclear-family genetic association analysis in samples of extended families.关于大家庭样本中核家庭内基因关联分析的有效性
Stat Appl Genet Mol Biol. 2015 Dec;14(6):533-49. doi: 10.1515/sagmb-2015-0056.
10
Likelihood-based analysis of longitudinal data from outcome-related sampling designs.基于似然法对来自与结果相关抽样设计的纵向数据进行分析。
Biometrics. 2014 Mar;70(1):44-52. doi: 10.1111/biom.12108. Epub 2013 Nov 21.