分析聚类连续响应变量的有序回归模型。

Analyzing clustered continuous response variables with ordinal regression models.

机构信息

Department of Biostatistics, Vanderbilt University, Nashville, Tennessee, USA.

Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA.

出版信息

Biometrics. 2023 Dec;79(4):3764-3777. doi: 10.1111/biom.13904. Epub 2023 Jul 17.

DOI:10.1111/biom.13904

PMID:37459181

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10792095/

Abstract

Continuous response data are regularly transformed to meet regression modeling assumptions. However, approaches taken to identify the appropriate transformation can be ad hoc and can increase model uncertainty. Further, the resulting transformations often vary across studies leading to difficulties with synthesizing and interpreting results. When a continuous response variable is measured repeatedly within individuals or when continuous responses arise from clusters, analyses have the additional challenge caused by within-individual or within-cluster correlations. We extend a widely used ordinal regression model, the cumulative probability model (CPM), to fit clustered, continuous response data using generalized estimating equations for ordinal responses. With the proposed approach, estimates of marginal model parameters, cumulative distribution functions , expectations, and quantiles conditional on covariates can be obtained without pretransformation of the response data. While computational challenges arise with large numbers of distinct values of the continuous response variable, we propose feasible and computationally efficient approaches to fit CPMs under commonly used working correlation structures. We study finite sample operating characteristics of the estimators via simulation and illustrate their implementation with two data examples. One studies predictors of CD4:CD8 ratios in a cohort living with HIV, and the other investigates the association of a single nucleotide polymorphism and lung function decline in a cohort with early chronic obstructive pulmonary disease.

摘要

连续响应数据通常会转换以满足回归建模假设。然而，用于确定适当转换的方法可能是特定的，并且会增加模型不确定性。此外，由于研究之间的转换方法不同，导致综合和解释结果变得困难。当连续响应变量在个体内被多次测量或连续响应来自聚类时，分析会受到个体内或聚类内相关性引起的额外挑战。我们扩展了一种广泛使用的有序回归模型，累积概率模型（CPM），使用广义估计方程对聚类的连续响应数据进行拟合有序响应。通过所提出的方法，可以在不对响应数据进行预转换的情况下获得边际模型参数、累积分布函数、条件协变量的期望和分位数的估计值。虽然在连续响应变量的大量不同值的情况下会出现计算挑战，但我们提出了在常用工作相关结构下拟合 CPM 的可行且计算高效的方法。我们通过模拟研究了估计量的有限样本工作特性，并通过两个数据示例说明了它们的实现。一个研究了艾滋病毒感染者队列中 CD4：CD8 比值的预测因子，另一个研究了早期慢性阻塞性肺疾病队列中单个核苷酸多态性与肺功能下降的关联。

相似文献

Analyzing clustered continuous response variables with ordinal regression models.分析聚类连续响应变量的有序回归模型。

Biometrics. 2023 Dec;79(4):3764-3777. doi: 10.1111/biom.13904. Epub 2023 Jul 17.

Modeling continuous response variables using ordinal regression.使用有序回归对连续响应变量进行建模。

Stat Med. 2017 Nov 30;36(27):4316-4335. doi: 10.1002/sim.7433. Epub 2017 Sep 5.

ORTH.Ord: An R package for analyzing correlated ordinal outcomes using alternating logistic regressions with orthogonalized residuals.正交化残差的交替逻辑回归分析相关有序结局的 ORTH 包。

Comput Methods Programs Biomed. 2023 Jul;237:107567. doi: 10.1016/j.cmpb.2023.107567. Epub 2023 Apr 29.

An empirical comparison of two novel transformation models.两种新型转换模型的实证比较。

Stat Med. 2020 Feb 28;39(5):562-576. doi: 10.1002/sim.8425. Epub 2019 Dec 6.

Variable selection via penalized generalized estimating equations for a marginal survival model.基于边际生存模型的惩罚广义估计方程进行变量选择

Stat Methods Med Res. 2020 Sep;29(9):2493-2506. doi: 10.1177/0962280220901728. Epub 2020 Jan 29.

Marginal analysis of ordinal clustered longitudinal data with informative cluster size.具有信息性聚类大小的有序聚类纵向数据的边际分析。

Biometrics. 2019 Sep;75(3):938-949. doi: 10.1111/biom.13050. Epub 2019 Apr 4.

GEECORR: A SAS macro for regression models of correlated binary responses and within-cluster correlation using generalized estimating equations.GEECORR：一个使用广义估计方程的相关二元反应和组内相关回归模型的 SAS 宏。

Comput Methods Programs Biomed. 2021 Sep;208:106276. doi: 10.1016/j.cmpb.2021.106276. Epub 2021 Jul 14.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

GEEMAEE: A SAS macro for the analysis of correlated outcomes based on GEE and finite-sample adjustments with application to cluster randomized trials.GEEMAEE：一种基于 GEE 和有限样本调整的分析相关结局的 SAS 宏，用于群组随机试验。

Comput Methods Programs Biomed. 2023 Mar;230:107362. doi: 10.1016/j.cmpb.2023.107362. Epub 2023 Jan 20.

A bias-reduced generalized estimating equation approach for proportional odds models with small-sample longitudinal ordinal data.一种用于小样本纵向有序数据的比例优势模型的偏倚降低广义估计方程方法。

BMC Med Res Methodol. 2024 Jun 28;24(1):140. doi: 10.1186/s12874-024-02259-6.

引用本文的文献

Between- and Within-Cluster Spearman Rank Correlations.簇间和簇内斯皮尔曼等级相关性。

Stat Med. 2025 Feb 10;44(3-4):e10326. doi: 10.1002/sim.10326.

Partner-Based HIV Treatment for Seroconcordant Couples Attending Antenatal and Postnatal Care in Rural Mozambique: A Cluster Randomized Controlled Trial.基于伴侣的 HIV 治疗在莫桑比克农村地区接受产前和产后护理的血清学一致夫妇中的应用：一项整群随机对照试验。

J Acquir Immune Defic Syndr. 2024 Jul 1;96(3):259-269. doi: 10.1097/QAI.0000000000003440.

本文引用的文献

Asymptotic Properties for Cumulative Probability Models for Continuous Outcomes.连续型结局累积概率模型的渐近性质

Mathematics (Basel). 2023 Dec 2;11(24). doi: 10.3390/math11244896. Epub 2023 Dec 7.

Impact of HBV and HCV coinfection on CD4 cells among HIV-infected patients: a longitudinal retrospective study.乙肝病毒和丙肝病毒合并感染对HIV感染患者CD4细胞的影响：一项纵向回顾性研究。

J Infect Dev Ctries. 2018 Nov 30;12(11):1009-1018. doi: 10.3855/jidc.10035.

An empirical comparison of two novel transformation models.两种新型转换模型的实证比较。

Stat Med. 2020 Feb 28;39(5):562-576. doi: 10.1002/sim.8425. Epub 2019 Dec 6.

Determinants of Restoration of CD4 and CD8 Cell Counts and Their Ratio in HIV-1-Positive Individuals With Sustained Virological Suppression on Antiretroviral Therapy.抗逆转录病毒治疗后病毒学抑制持续的 HIV-1 阳性个体中 CD4 和 CD8 细胞计数及其比值恢复的决定因素。

J Acquir Immune Defic Syndr. 2019 Mar 1;80(3):292-300. doi: 10.1097/QAI.0000000000001913.

One-Step Generalized Estimating Equations with Large Cluster Sizes.具有大聚类规模的一步广义估计方程

J Comput Graph Stat. 2017;26(3):734-737. doi: 10.1080/10618600.2017.1321552. Epub 2017 Jul 27.

Modeling continuous response variables using ordinal regression.使用有序回归对连续响应变量进行建模。

Stat Med. 2017 Nov 30;36(27):4316-4335. doi: 10.1002/sim.7433. Epub 2017 Sep 5.

CD4:CD8 ratio comparison between cohorts of HIV-positive Asians and Caucasians upon commencement of antiretroviral therapy.开始抗逆转录病毒治疗时，HIV 阳性亚洲人和高加索人队列之间的 CD4:CD8 比值比较。

Antivir Ther. 2017;22(8):659-668. doi: 10.3851/IMP3155.

CD4/CD8 ratio and CD8 counts predict CD4 response in HIV-1-infected drug naive and in patients on cART.CD4/CD8 比值和 CD8 细胞计数可预测初治 HIV-1 感染患者及接受抗逆转录病毒治疗（cART）患者的 CD4 反应。

Medicine (Baltimore). 2016 Oct;95(42):e5094. doi: 10.1097/MD.0000000000005094.

CD4+/CD8+ ratio, age, and risk of serious noncommunicable diseases in HIV-infected adults on antiretroviral therapy.接受抗逆转录病毒治疗的HIV感染成人的CD4+/CD8+比值、年龄与严重非传染性疾病风险

AIDS. 2016 Mar 27;30(6):899-908. doi: 10.1097/QAD.0000000000001005.

GEE for multinomial responses using a local odds ratios parameterization.使用局部优势比参数化的多项响应广义估计方程。

Biometrics. 2013 Sep;69(3):633-40. doi: 10.1111/biom.12054. Epub 2013 May 31.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验