Department of Surgery, Surgical Outcomes and Quality Improvement Center, Feinberg School of Medicine, Northwestern University, Chicago, IL.
Department of Statistics, Department of Psychology, Department of Medical Social Sciences, School of Education and Social Policy, Institute for Policy Research, Northwestern University, Evanston, IL.
Health Serv Res. 2018 Aug;53(4):2567-2590. doi: 10.1111/1475-6773.12752. Epub 2017 Aug 21.
OBJECTIVE/STUDY QUESTION: To estimate and compare sample average treatment effects (SATE) and population average treatment effects (PATE) of a resident duty hour policy change on patient and resident outcomes using data from the Flexibility in Duty Hour Requirements for Surgical Trainees Trial ("FIRST Trial").
DATA SOURCES/STUDY SETTING: Secondary data from the National Surgical Quality Improvement Program and the FIRST Trial (2014-2015).
The FIRST Trial was a cluster-randomized pragmatic noninferiority trial designed to evaluate the effects of a resident work hour policy change to permit greater flexibility in scheduling on patient and resident outcomes. We estimated hierarchical logistic regression models to estimate the SATE of a policy change on outcomes within an intent-to-treat framework. Propensity score-based poststratification was used to estimate PATE.
DATA COLLECTION/EXTRACTION METHODS: This study was a secondary analysis of previously collected data.
Although SATE estimates suggested noninferiority of outcomes under flexible duty hour policy versus standard policy, the noninferiority of a policy change was inconclusively noninferior based on PATE estimates due to imprecision.
Propensity score-based poststratification can be valuable tools to address trial generalizability but may yield imprecise estimates of PATE when sparse strata exist.
目的/研究问题:利用 Flexibility in Duty Hour Requirements for Surgical Trainees Trial(“FIRST 试验”)的数据,评估和比较居民值班时间政策变化对患者和居民结局的治疗效果。
数据来源/研究环境:国家手术质量改进计划和 FIRST 试验的二次数据(2014-2015 年)。
FIRST 试验是一项集群随机实用非劣效性试验,旨在评估改变居民工作时间政策以允许更灵活的排班对患者和居民结局的影响。我们估计了分层逻辑回归模型,以在意向治疗框架内估计政策变化对结局的治疗效果。基于倾向评分的后分层用于估计 PATE。
数据收集/提取方法:这是对先前收集的数据进行的二次分析。
尽管 SATE 估计表明灵活值班时间政策下的结局不劣于标准政策,但由于不精确,基于 PATE 估计,政策变化的非劣效性不确定。
基于倾向评分的后分层是解决试验可推广性的有价值工具,但当稀疏分层存在时,可能会产生 PATE 的不精确估计。