• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

纵向二元数据的惩罚联合广义估计方程

Penalized joint generalized estimating equations for longitudinal binary data.

作者信息

Huang Youjun, Pan Jianxin

机构信息

Mathematical College, Sichuan University, Chengdu, P. R. China.

Department of Mathematics, The University of Manchester, Manchester, UK.

出版信息

Biom J. 2022 Jan;64(1):57-73. doi: 10.1002/bimj.202000336. Epub 2021 Sep 29.

DOI:10.1002/bimj.202000336
PMID:34587284
Abstract

In statistical research, variable selection and feature extraction are a typical issue. Variable selection in linear models has been fully developed, while it has received relatively little attention for longitudinal data. Since a longitudinal study involves within-subject correlations, the likelihood function of discrete longitudinal responses generally cannot be expressed in analytically closed form, and standard variable selection methods cannot be directly applied. As an alternative, the penalized generalized estimating equation (PGEE) is helpful but very likely results in incorrect variable selection if the working correlation matrix is misspecified. In many circumstances, the within-subject correlations are of interest and need to be modeled together with the mean. For longitudinal binary data, it becomes more challenging because the within-subject correlation coefficients have the so-called Fréchet-Hoeffding upper bound. In this paper, we proposed smoothly clipped absolute deviation (SCAD)-based and least absolute shrinkage and selection operator (LASSO)-based penalized joint generalized estimating equation (PJGEE) methods to simultaneously model the mean and correlations for longitudinal binary data, together with variable selection in the mean model. The estimated correlation coefficients satisfy the upper bound constraints. Simulation studies under different scenarios are made to assess the performance of the proposed method. Compared to existing PGEE methods that specify a working correlation matrix for longitudinal binary data, the proposed PJGEE method works much better in terms of variable selection consistency and parameter estimation accuracy. A real data set on Clinical Global Impression is analyzed for illustration.

摘要

在统计研究中,变量选择和特征提取是一个典型问题。线性模型中的变量选择已经得到充分发展,而对于纵向数据的变量选择却相对较少受到关注。由于纵向研究涉及个体内部的相关性,离散纵向响应的似然函数通常无法以解析封闭形式表示,标准的变量选择方法也不能直接应用。作为一种替代方法,惩罚广义估计方程(PGEE)是有帮助的,但如果工作相关矩阵指定错误,很可能导致错误的变量选择。在许多情况下,个体内部的相关性是令人感兴趣的,需要与均值一起进行建模。对于纵向二元数据,这变得更具挑战性,因为个体内部的相关系数具有所谓的弗雷歇 - 霍夫丁上界。在本文中,我们提出了基于平滑截断绝对偏差(SCAD)和基于最小绝对收缩与选择算子(LASSO)的惩罚联合广义估计方程(PJGEE)方法,用于同时对纵向二元数据的均值和相关性进行建模,以及均值模型中的变量选择。估计的相关系数满足上界约束。我们进行了不同场景下的模拟研究,以评估所提出方法的性能。与为纵向二元数据指定工作相关矩阵的现有PGEE方法相比,所提出的PJGEE方法在变量选择一致性和参数估计准确性方面表现得更好。我们分析了一个关于临床总体印象的真实数据集作为例证。

相似文献

1
Penalized joint generalized estimating equations for longitudinal binary data.纵向二元数据的惩罚联合广义估计方程
Biom J. 2022 Jan;64(1):57-73. doi: 10.1002/bimj.202000336. Epub 2021 Sep 29.
2
Variable selection for binary spatial regression: Penalized quasi-likelihood approach.二元空间回归的变量选择:惩罚拟似然方法。
Biometrics. 2016 Dec;72(4):1164-1172. doi: 10.1111/biom.12525. Epub 2016 Apr 8.
3
Penalized generalized estimating equations for high-dimensional longitudinal data analysis.用于高维纵向数据分析的惩罚广义估计方程
Biometrics. 2012 Jun;68(2):353-60. doi: 10.1111/j.1541-0420.2011.01678.x. Epub 2011 Sep 28.
4
Variable selection via penalized generalized estimating equations for a marginal survival model.基于边际生存模型的惩罚广义估计方程进行变量选择
Stat Methods Med Res. 2020 Sep;29(9):2493-2506. doi: 10.1177/0962280220901728. Epub 2020 Jan 29.
5
A comparison of bias-adjusted generalized estimating equations for sparse binary data in small-sample longitudinal studies.在小样本纵向研究中,稀疏二项数据的偏差调整广义估计方程比较。
Stat Med. 2023 Jul 10;42(15):2711-2727. doi: 10.1002/sim.9744. Epub 2023 Apr 16.
6
Penalized variable selection for accelerated failure time models with random effects.具有随机效应的加速失效时间模型的惩罚变量选择。
Stat Med. 2019 Feb 28;38(5):878-892. doi: 10.1002/sim.8023. Epub 2018 Nov 8.
7
A readily available improvement over method of moments for intra-cluster correlation estimation in the context of cluster randomized trials and fitting a GEE-type marginal model for binary outcomes.在群组随机试验和拟合二项结局的 GEE 型边缘模型的背景下,一种现成的改进方法,可以用于估计群组内相关性。
Clin Trials. 2019 Feb;16(1):41-51. doi: 10.1177/1740774518803635. Epub 2018 Oct 8.
8
Variable selection for longitudinal zero-inflated power series transition model.纵向零膨胀幂级数转移模型的变量选择。
J Biopharm Stat. 2021 Sep 3;31(5):668-685. doi: 10.1080/10543406.2021.1944177. Epub 2021 Jul 30.
9
Fixed and random effects selection in mixed effects models.混合效应模型中的固定效应和随机效应选择
Biometrics. 2011 Jun;67(2):495-503. doi: 10.1111/j.1541-0420.2010.01463.x. Epub 2010 Jul 21.
10
Correlation structure and variable selection in generalized estimating equations via composite likelihood information criteria.基于复合似然信息准则的广义估计方程中的相关结构与变量选择
Stat Med. 2016 Jun 30;35(14):2377-90. doi: 10.1002/sim.6871. Epub 2016 Jan 28.

引用本文的文献

1
Evaluate effects of the National Essential Public Health Service Program on hypertension control of Chinese community-dwelling people during the COVID-19 epidemic: a population-based multi-centre retrospective longitudinal study.评估国家基本公共卫生服务项目在新型冠状病毒肺炎疫情期间对中国社区居民高血压控制的影响:一项基于人群的多中心回顾性纵向研究。
BMC Prim Care. 2025 Jul 16;26(1):227. doi: 10.1186/s12875-025-02927-6.
2
Impact of the National Essential Public Health Service Package on Blood Pressure Control in Chinese People With Hypertension: Retrospective Population-Based Longitudinal Study.国家基本公共卫生服务包对中国高血压患者血压控制的影响:基于人群的回顾性纵向研究
JMIR Public Health Surveill. 2025 Feb 6;11:e65783. doi: 10.2196/65783.