在群组相关数据环境下的病例对照研究分析。

On the Analysis of Case-Control Studies in Cluster-correlated Data Settings.

机构信息

From the Harvard T.H. Chan School of Public Health, Boston, MA.

出版信息

Epidemiology. 2018 Jan;29(1):50-57. doi: 10.1097/EDE.0000000000000763.

DOI:10.1097/EDE.0000000000000763

PMID:29068840

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5718962/

Abstract

In resource-limited settings, long-term evaluation of national antiretroviral treatment (ART) programs often relies on aggregated data, the analysis of which may be subject to ecological bias. As researchers and policy makers consider evaluating individual-level outcomes such as treatment adherence or mortality, the well-known case-control design is appealing in that it provides efficiency gains over random sampling. In the context that motivates this article, valid estimation and inference requires acknowledging any clustering, although, to our knowledge, no statistical methods have been published for the analysis of case-control data for which the underlying population exhibits clustering. Furthermore, in the specific context of an ongoing collaboration in Malawi, rather than performing case-control sampling across all clinics, case-control sampling within clinics has been suggested as a more practical strategy. To our knowledge, although similar outcome-dependent sampling schemes have been described in the literature, a case-control design specific to correlated data settings is new. In this article, we describe this design, discuss balanced versus unbalanced sampling techniques, and provide a general approach to analyzing case-control studies in cluster-correlated settings based on inverse probability-weighted generalized estimating equations. Inference is based on a robust sandwich estimator with correlation parameters estimated to ensure appropriate accounting of the outcome-dependent sampling scheme. We conduct comprehensive simulations, based in part on real data on a sample of N = 78,155 program registrants in Malawi between 2005 and 2007, to evaluate small-sample operating characteristics and potential trade-offs associated with standard case-control sampling or when case-control sampling is performed within clusters.

摘要

在资源有限的情况下，对国家抗逆转录病毒治疗 (ART) 项目的长期评估通常依赖于汇总数据，而对这些数据的分析可能存在生态偏差。随着研究人员和政策制定者考虑评估治疗依从性或死亡率等个体水平的结果，众所周知的病例对照设计在效率上优于随机抽样，因此具有吸引力。在本文所依据的背景下，有效估计和推断需要承认任何聚类现象，尽管据我们所知，对于基础人群存在聚类的病例对照数据，还没有发表过用于分析的统计方法。此外，在马拉维正在进行的合作的具体背景下，与在所有诊所进行病例对照抽样相比，建议在诊所内进行病例对照抽样，因为这是一种更实用的策略。据我们所知，尽管文献中已经描述了类似的基于结果的抽样方案，但针对相关数据设置的病例对照设计是新的。在本文中，我们描述了这种设计，讨论了平衡与非平衡抽样技术，并提供了一种基于逆概率加权广义估计方程分析聚类相关环境中病例对照研究的一般方法。推断基于稳健的三明治估计量，并估计相关参数，以确保适当考虑基于结果的抽样方案。我们进行了全面的模拟，部分基于 2005 年至 2007 年在马拉维的一个 N = 78155 名项目登记者样本的真实数据，以评估小样本的操作特征和与标准病例对照抽样相关的潜在权衡，或当病例对照抽样在聚类中进行时。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9d8/5718962/2e94369a70f9/nihms909428f1.jpg

相似文献

On the Analysis of Case-Control Studies in Cluster-correlated Data Settings.在群组相关数据环境下的病例对照研究分析。

Epidemiology. 2018 Jan;29(1):50-57. doi: 10.1097/EDE.0000000000000763.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

On the analysis of two-phase designs in cluster-correlated data settings.在群组相关数据环境下的两阶段设计分析。

Stat Med. 2019 Oct 15;38(23):4611-4624. doi: 10.1002/sim.8321. Epub 2019 Jul 29.

Strategies for monitoring and evaluation of resource-limited national antiretroviral therapy programs: the two-phase design.资源有限国家抗逆转录病毒治疗项目的监测与评估策略：两阶段设计

BMC Med Res Methodol. 2015 Apr 7;15:31. doi: 10.1186/s12874-015-0027-9.

Human resources requirements for highly active antiretroviral therapy scale-up in Malawi.马拉维扩大高效抗逆转录病毒治疗的人力资源需求。

BMC Health Serv Res. 2007 Dec 19;7:208. doi: 10.1186/1472-6963-7-208.

Optimal sampling allocation for outcome-dependent designs in cluster-correlated data settings.在聚类相关数据环境下，基于结局相关设计的最优抽样分配。

Stat Methods Med Res. 2022 Dec;31(12):2400-2414. doi: 10.1177/09622802221122423. Epub 2022 Aug 30.

Estimation of mortality among HIV-infected people on antiretroviral treatment in East Africa: a sampling based approach in an observational, multisite, cohort study.在东非接受抗逆转录病毒治疗的艾滋病毒感染者的死亡率估计：一项基于抽样的观察性多地点队列研究。

Lancet HIV. 2015 Mar;2(3):e107-16. doi: 10.1016/S2352-3018(15)00002-8. Epub 2015 Jan 28.

A readily available improvement over method of moments for intra-cluster correlation estimation in the context of cluster randomized trials and fitting a GEE-type marginal model for binary outcomes.在群组随机试验和拟合二项结局的 GEE 型边缘模型的背景下，一种现成的改进方法，可以用于估计群组内相关性。

Clin Trials. 2019 Feb;16(1):41-51. doi: 10.1177/1740774518803635. Epub 2018 Oct 8.

A nation-wide malaria knowledge, attitudes and practices survey in Malawi: objectives and methodology.马拉维全国疟疾知识、态度和实践调查：目标与方法

Trop Med Parasitol. 1994 Mar;45(1):54-6.

Are they really lost? "true" status and reasons for treatment discontinuation among HIV infected patients on antiretroviral therapy considered lost to follow up in Urban Malawi.他们真的失联了吗？马拉维城市中，抗逆转录病毒治疗中被认为失联的艾滋病毒感染者的“真实”状态和治疗中断原因。

PLoS One. 2013 Sep 26;8(9):e75761. doi: 10.1371/journal.pone.0075761. eCollection 2013.

引用本文的文献

Outcome-dependent sampling in cluster-correlated data settings with application to hospital profiling.聚类相关数据设置中基于结果的抽样及其在医院概况分析中的应用

J R Stat Soc Ser A Stat Soc. 2020 Jan;183(1):379-402. doi: 10.1111/rssa.12503. Epub 2019 Aug 29.

Sampling strategies to evaluate the prognostic value of a new biomarker on a time-to-event end-point.采样策略评估时间事件终点新生物标志物的预后价值。

BMC Med Res Methodol. 2021 Apr 30;21(1):93. doi: 10.1186/s12874-021-01283-0.

Small-sample inference for cluster-based outcome-dependent sampling schemes in resource-limited settings: Investigating low birthweight in Rwanda.在资源有限的情况下，基于群组的结果依赖抽样方案的小样本推断：以卢旺达的低出生体重为例。

Biometrics. 2022 Jun;78(2):701-715. doi: 10.1111/biom.13423. Epub 2021 Jan 28.

New Designs for New Epidemiology.新流行病学的新设计。

Epidemiology. 2018 Jan;29(1):76-77. doi: 10.1097/EDE.0000000000000768.

本文引用的文献

BMC Med Res Methodol. 2015 Apr 7;15:31. doi: 10.1186/s12874-015-0027-9.

On the analysis of hybrid designs that combine group- and individual-level data.关于结合组级和个体级数据的混合设计的分析。

Biometrics. 2015 Mar;71(1):227-236. doi: 10.1111/biom.12220. Epub 2014 Sep 22.

Outcome vector dependent sampling with longitudinal continuous response data: stratified sampling based on summary statistics.具有纵向连续响应数据的结果向量依赖抽样：基于汇总统计量的分层抽样。

Biometrics. 2013 Jun;69(2):405-16. doi: 10.1111/biom.12013. Epub 2013 Feb 14.

Outcome-dependent sampling for longitudinal binary response data based on a time-varying auxiliary variable.基于时变辅助变量的纵向二分类反应数据的依结局抽样。

Stat Med. 2012 Sep 28;31(22):2441-56. doi: 10.1002/sim.4359. Epub 2011 Nov 16.

Designs for the combination of group- and individual-level data.群组数据和个体数据联合设计。

Epidemiology. 2011 May;22(3):382-9. doi: 10.1097/EDE.0b013e3182125cff.

Providing universal access to antiretroviral therapy in Thyolo, Malawi through task shifting and decentralization of HIV/AIDS care.在马拉维蒂约洛，通过任务转移和艾滋病毒/艾滋病护理的权力下放，为所有人提供抗逆转录病毒治疗。

Trop Med Int Health. 2010 Dec;15(12):1413-20. doi: 10.1111/j.1365-3156.2010.02649.x. Epub 2010 Oct 19.

Using touchscreen electronic medical record systems to support and monitor national scale-up of antiretroviral therapy in Malawi.利用触屏式电子病历系统支持并监测马拉维国家范围扩大抗逆转录病毒疗法。

PLoS Med. 2010 Aug 10;7(8):e1000319. doi: 10.1371/journal.pmed.1000319.

The Combination of Ecological and Case-Control Data.生态数据与病例对照数据的结合

J R Stat Soc Series B Stat Methodol. 2008 Feb 1;70(1):73-93. doi: 10.1111/j.1467-9868.2007.00628.x.

On outcome-dependent sampling designs for longitudinal binary response data with time-varying covariates.关于具有时变协变量的纵向二元响应数据的基于结果的抽样设计。

Biostatistics. 2008 Oct;9(4):735-49. doi: 10.1093/biostatistics/kxn006. Epub 2008 Mar 27.

Ecologic studies revisited.再谈生态学研究。

Annu Rev Public Health. 2008;29:75-90. doi: 10.1146/annurev.publhealth.29.020907.090821.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验