Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Hum Reprod. 2019 Mar 1;34(3):549-557. doi: 10.1093/humrep/dey374.
Which methodological approaches are most appropriate for analyzing IVF data with multiple cycles in the context of a binary outcome?
Both mixed effect models and generalized estimating equation (GEE) modeling approaches can account for multiple IVF cycles and may reduce bias over first-cycle only approaches, but CIs were narrowest with cluster-weighted generalized estimating equation models (CWGEE).
There is a lack of consensus among investigators regarding how to best incorporate data from multiple cycles and whether to present odds or risks in the analysis of IVF data. Failure to account for correlated outcomes within individuals and informative cluster size may lead to invalid CIs and biased estimates.
STUDY DESIGN, SIZE, DURATION: The Environment and Reproductive Health (EARTH) Study is an ongoing prospective cohort study of subfertile couples conducted at an academic medical center. This cohort was established in 2004 and follows couples seeking treatment for infertility throughout the course of their treatment and pregnancy.
PARTICIPANTS/MATERIALS, SETTING, METHODS: Women aged 18-46 years enrolled in the EARTH Study from 2004 to 2017 who initiated at least one IVF cycle were eligible. Cycle initiation was defined as beginning ovulation induction with the intent to progress through an IVF or ICSI cycle. This analysis included 442 women undergoing 642 cycles who met the study inclusion criteria. We compared the results and interpretations of log-binomial and logistic models restricting to the first cycle, as well as mixed effects models, unweighted GEE models, and CWGEE models including all cycles. This analysis was conducted for two distinct exposures: maternal age at cycle initiation, and maternal preconception urinary concentrations of di(2-ethylhexyl) phthalate (DEHP) metabolites (previously reported to be associated with a decreased probability of live birth).
In general, the CIs were widest for mixed effects models and narrowest for CWGEE models. Further, in models evaluating the sum of urinary concentrations of DEHP metabolites (∑DEHP, available for 91% of women), the point estimates were surprisingly different between the first-cycle and multiple-cycle models. We observed significant associations between maternal age and live birth in all models. However, we observed no associations between ∑DEHP and live birth.
LIMITATIONS, REASONS FOR CAUTION: This analysis was limited to an example dataset in which the true effect of any exposure is unknown. While this allows us to observe model performance in the context of real data, future analyses should be conducted within simulated datasets under various assumptions to further evaluate the appropriateness of each approach. In addition, we did not address differential loss to follow-up in our statistical approaches.
The use of CWGEE models should be more widely considered in the analysis of IVF data with multiple cycles per woman. The CWGEE approach is computationally simple, addresses non-ignorable (informative) cluster size, and is robust against mis-specification of the underlying covariance structure. Among the methods compared in this analysis, CWGEE models generally yielded the narrowest CIs, possibly indicating the most precise estimates. We also stress the importance of estimating risks rather than odds in the analysis of IVF data.
STUDY FUNDING/COMPETING INTEREST(S): The project was funded by Grants (R01ES022955, R01ES009718, and P30ES000002) from the National Institutes of Health. None of the authors has any conflicts of interest to declare.
对于二元结局的 IVF 数据,哪种方法最适合分析?
混合效应模型和广义估计方程(GEE)模型都可以考虑多个 IVF 周期,并且可以减少仅第一周期方法的偏差,但聚类加权广义估计方程模型(CWGEE)的置信区间最窄。
研究人员在如何最好地整合多个周期的数据以及在分析 IVF 数据时是否呈现 odds 或 risks 方面存在分歧。未能在个体内考虑相关结局和信息丰富的聚类大小可能导致无效的置信区间和有偏差的估计。
研究设计、大小和持续时间:环境与生殖健康(EARTH)研究是一项正在进行的前瞻性队列研究,针对在学术医疗中心就诊的不育夫妇。该队列成立于 2004 年,在夫妇接受不孕症治疗的整个过程中以及怀孕过程中对他们进行随访。
参与者/材料、设置、方法:2004 年至 2017 年期间在 EARTH 研究中登记并至少开始了一次 IVF 周期的年龄在 18-46 岁的女性符合入选标准。周期启动定义为开始排卵诱导,目的是进行 IVF 或 ICSI 周期。这项分析包括 442 名接受 642 个周期的女性,她们符合研究纳入标准。我们比较了将第一周期限制为对数二项式和逻辑模型的结果和解释,以及混合效应模型、未加权 GEE 模型和包括所有周期的 CWGEE 模型。这项分析针对两个不同的暴露因素进行:母亲在周期启动时的年龄和母亲在受孕前尿液中邻苯二甲酸二(2-乙基己基)酯(DEHP)代谢物的浓度(先前报道与活产概率降低有关)。
一般来说,混合效应模型的置信区间最宽,CWGEE 模型的置信区间最窄。此外,在评估 DEHP 代谢物尿液浓度总和(∑DEHP,可用于 91%的女性)的模型中,第一周期和多周期模型之间的点估计值差异惊人。我们在所有模型中都观察到母亲年龄与活产之间存在显著关联。然而,我们没有观察到∑DEHP 与活产之间的关联。
局限性、谨慎的原因:这项分析仅限于真实数据中未知任何暴露因素的真实效果的示例数据集。虽然这使我们能够在真实数据的背景下观察模型性能,但未来的分析应在各种假设下在模拟数据集中进行,以进一步评估每种方法的适当性。此外,我们在统计方法中没有解决随访的差异丢失。
在分析每个女性多个周期的 IVF 数据时,应更广泛地考虑使用 CWGEE 模型。CWGEE 方法计算简单,解决了不可忽略的(信息丰富的)聚类大小问题,并且对底层协方差结构的错误指定具有稳健性。在本分析中比较的方法中,CWGEE 模型通常产生最窄的置信区间,这可能表明最精确的估计。我们还强调了在分析 IVF 数据时估计风险而不是 odds 的重要性。
研究资金/利益冲突:该项目由美国国立卫生研究院的拨款(R01ES022955、R01ES009718 和 P30ES000002)资助。作者均无利益冲突。