Chesang Caroline, Sharples Linda D, Gray Christen M, Nossiter Julie, van der Meulen Jan, Cowling Thomas E, Keogh Ruth H
Department of Medical Statistics, London School of Hygiene and Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
Department of Medical Statistics, London School of Hygiene and Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
J Clin Epidemiol. 2025 Mar 25;182:111767. doi: 10.1016/j.jclinepi.2025.111767.
If randomized controlled trials can be successfully emulated using real-world data (RWD), confidence in the validity of RWD for estimating treatment effects for questions that have not been assessed in trials increases. We used routinely collected administrative and clinical national linked datasets from England to emulate the PR07 trial for high-risk prostate cancer patients, which compared the effects of radiotherapy added to hormone therapy (RT+HT) within 8 weeks of randomization (the "grace period") and hormone therapy (HT) only on all-cause mortality. We highlight methodological choices required and challenges encountered in emulating this trial.
Patients diagnosed with prostate cancer from 2014 to 2020 were identified from the routine national linked datasets. Diagnosis was taken as the time zero. As few patients initiated radiotherapy within 8 weeks of diagnosis, we considered target trials with grace periods of 4-6 months. Estimands of interest were hazard ratios (HRs) and survival probabilities over 7 years. The cloning-censoring-and-weighting (CCW) approach was used to control for measured confounding and to allow for the grace period. We also used an extension (the "landmark-CCW" approach), in which we consider several time-origins post-diagnosis, enabling us to use a grace period of 8 weeks as in PR07.
A total of 2,690 patients were eligible for inclusion in the emulated trial. The CCW analysis using a grace period of 6 months gave an estimated HR of 0.48 (95% confidence interval [CI]: 0.34-0.60) and 7-year survival estimates of 80.7% (95% CI: 74.3-87.0) for the RT+HT strategy and 65.6% (95% CI: 62.8-68.1) for HT only strategy, and corresponding risk difference of 15.1% (95% CI: 11.5-18.9). The corresponding HR from the landmark-CCW approach was 0.58 (95% CI: 0.51-0.65) and with survival estimates of 80.7% (95% CI: 77.7-83.8) for RT+HT strategy and 69.8% (95% CI: 68.2-71.4) for the HT only strategy, and a risk difference of 10.9% (95% CI: 6.3-15.9).
Our findings from the emulated trial using RWD are broadly consistent with those from PR07, with RT+HT estimated to result in better survival compared to HT only. However, the findings were not replicated exactly, with PR07 reporting an HR of 0.77 (95% CI: 0.61-0.98) over 7 years of follow-up. Differences may be in part due to challenges in defining time zero and allowing for a treatment grace period of the same duration as in PR07. Our study considered ways in which these challenges can be addressed, and our findings affirm the utility of RWD for estimating treatment effects.
Clinical trials are the best way to test whether treatments work, but they are expensive, take years to complete, and focus on narrow research questions. If we can use real-world data (RWD) such as patient health records to mimic these trials, we may be able to answer additional medical questions relevant to patients who are prescribed these medications. This study aimed to see if we could recreate the results of a past clinical trial (PR07) using national health data from England. The PR07 trial looked at two treatments for high-risk prostate cancer: hormone therapy (HT) alone and radiotherapy added to hormone therapy (RT+HT) within 8 weeks of starting the study. This trial was chosen for several reasons. It included high-risk patients who were studied for up to 7 years, meaning that there was enough information to study survival outcomes. Being a major UK-based trial, it was useful for comparing with the data recorded in UK national health data. The trial also allowed some flexibility in when treatment started, which was an interesting factor to examine. Its main goal was to see if adding radiotherapy to hormone therapy provided extra benefits to patients. We wanted to see whether the trial results were replicated using the UK health data. If results were replicated, it would provide confidence in further exploration of questions not covered by the trial. In a trial, randomization time is the point where doctors allocate patients to one of the treatments being assessed, usually 1 new treatment and 1 control treatment. Patients in the study are then followed up from that point. However, defining a starting point in a non-trial setting, using UK national health data, is not as straightforward. In the PR07 trial, patients started RT+HT within 8 weeks of randomization. In the UK datasets, we use diagnosis as a starting point, because it is available for both treatment groups. However, very few patients started RT+HT within 8 weeks of diagnosis. To address this, we allowed for longer periods of 4-6 months from diagnosis for RT+HT initiation. Recent developments provided statistical methods for estimating the cause-effect relationship between treatment received and survival patterns. In this study, we show how these approaches can be used to estimate survival patterns if all patients had RT+HT within 4-6 months from diagnosis compared with if no patients had any RT within 4-6 months. We account for the different treatment initiation times and the main differences between the RT+HT and HT only patients. Using these methods, we estimate that RT+HT has an 11%-15% higher survival rate at 7 years compared to HT only. We discuss similarities and differences between these findings and those in the original PR07 trial.
如果能够使用真实世界数据(RWD)成功模拟随机对照试验,那么对于RWD在估计试验中未评估问题的治疗效果方面的有效性的信心就会增强。我们使用从英格兰常规收集的行政和临床全国关联数据集,对高危前列腺癌患者的PR07试验进行模拟,该试验比较了在随机分组后8周内(“宽限期”)添加放疗的激素治疗(RT+HT)与单纯激素治疗(HT)对全因死亡率的影响。我们强调了在模拟该试验时所需的方法选择和遇到的挑战。
从常规全国关联数据集中识别出2014年至2020年被诊断为前列腺癌的患者。将诊断时间作为时间零点。由于很少有患者在诊断后8周内开始放疗,我们考虑了宽限期为4至6个月的目标试验。感兴趣的估计量是风险比(HRs)和7年的生存概率。采用克隆-删失-加权(CCW)方法来控制测量到的混杂因素,并考虑宽限期。我们还使用了一种扩展方法(“地标-CCW”方法),其中我们考虑诊断后的几个时间原点,使我们能够像在PR07中一样使用8周的宽限期。
共有2690名患者符合纳入模拟试验的条件。使用6个月宽限期的CCW分析得出,RT+HT策略的估计HR为0.48(95%置信区间[CI]:0.34 - 0.60),7年生存估计为80.7%(95%CI:74.3 - 87.0),单纯HT策略的生存估计为65.6%(95%CI:62.8 - 68.1),相应的风险差异为15.1%(95%CI:11.5 - 18.9)。地标-CCW方法得出的相应HR为0.58(95%CI:0.51 - 0.65),RT+HT策略的生存估计为80.7%(95%CI:77.7 - 83.8),单纯HT策略的生存估计为69.8%(95%CI:68.2 - 71.4),风险差异为10.9%(95%CI:6.3 - 15.9)。
我们使用RWD进行模拟试验的结果与PR07的结果大致一致,估计RT+HT与单纯HT相比能带来更好的生存。然而,结果并未完全重现,PR07报告在7年随访中的HR为0.77(95%CI:0.61 - 0.98)。差异可能部分归因于定义时间零点以及考虑与PR07相同持续时间的治疗宽限期方面的挑战。我们的研究考虑了应对这些挑战的方法,并且我们的发现证实了RWD在估计治疗效果方面的效用。
临床试验是检验治疗方法是否有效的最佳方式,但它们成本高昂,需要数年才能完成,并且专注于狭窄的研究问题。如果我们能够使用诸如患者健康记录之类的真实世界数据(RWD)来模拟这些试验,我们或许能够回答与使用这些药物的患者相关的其他医学问题。本研究旨在探讨是否能够使用来自英格兰的国家健康数据重现过去一项临床试验(PR07)的结果。PR07试验研究了高危前列腺癌的两种治疗方法:单独的激素治疗(HT)以及在研究开始后8周内添加放疗的激素治疗(RT+HT)。选择该试验有几个原因。它纳入了高危患者并进行了长达7年的研究,这意味着有足够的信息来研究生存结果。作为一项主要基于英国的试验,它有助于与英国国家健康数据中记录的数据进行比较。该试验在治疗开始时间方面也具有一定灵活性,这是一个值得研究的有趣因素。其主要目标是查看在激素治疗中添加放疗是否能为患者带来额外益处。我们想看看使用英国健康数据是否能重现试验结果。如果结果能够重现,将为进一步探索试验未涵盖的问题提供信心。在试验中,随机分组时间是医生将患者分配到所评估的一种治疗方法(通常是一种新治疗方法和一种对照治疗方法)的时间点。然后从该点开始对研究中的患者进行随访。然而,在非试验环境中使用英国国家健康数据来定义一个起点并非那么简单。在PR07试验中,患者在随机分组后8周内开始接受RT+HT。在英国的数据集中,我们将诊断作为起点,因为两个治疗组都有该信息。然而,很少有患者在诊断后8周内开始接受RT+HT。为了解决这个问题,我们允许从诊断开始有长达4至6个月的时间用于开始RT+HT治疗。最近的进展提供了用于估计接受的治疗与生存模式之间因果关系的统计方法。在本研究中,我们展示了如何使用这些方法来估计如果所有患者在诊断后4至6个月内都接受RT+HT与如果没有患者在4至6个月内接受任何放疗时的生存模式。我们考虑了不同的治疗开始时间以及RT+HT组和单纯HT组患者之间的主要差异。使用这些方法,我们估计与单纯HT相比,RT+HT在7年时的生存率高11% - 15%。我们讨论了这些发现与原始PR07试验结果之间的异同。