Ricketts K, Williams M, Liu Z-W, Gibson A
Department of Medical Physics and Bioengineering, University College London, UK.
Radiotherapy Department, University College London Hospital, London, UK; Department of Clinical Oncology, Imperial College Healthcare Trust, Charing Cross Hospital, Fulham Palace Road, London W6 8RF, UK.
Comput Methods Programs Biomed. 2014 Dec;117(3):412-24. doi: 10.1016/j.cmpb.2014.08.008. Epub 2014 Sep 10.
Overall survival (OS) and progression free survival (PFS) are key outcome measures for head and neck cancer as they reflect treatment efficacy, and have implications for patients and health services. The UK has recently developed a series of national cancer audits which aim to estimate survival and recurrence by relying on institutions manually submitting interval data on patient status, a labour-intensive method. However, nationally, data are routinely collected on hospital admissions, surgery, radiotherapy and chemotherapy. We have developed a technique to automate the interpretation of these routine datasets, allowing us to derive patterns of treatment in head and neck cancer patients from routinely acquired data.
We identified 122 patients with head and neck cancer and extracted treatment histories from hospital notes to provide a gold standard dataset. We obtained routinely collected local data on inpatient admission and procedures, chemotherapy and radiotherapy for these patients and analysed them with a computer algorithm which identified relevant time points and then calculated OS and PFS. We validated these by comparison with the gold standard dataset. The algorithm was then optimised to maximise correct identification of each timepoint, and minimise false identification of recurrence events.
Of the 122 patients, 82% had locally advanced disease. OS was 88% at 1 year and 77% at 2 years and PFS was 75% and 66% at 1 and 2 years. 40 patients developed recurrent disease. Our automated method provided an estimated OS of 87% and 77% and PFS of 87% and 78% at 1 and 2 years; 98% and 82% of patients showed good agreement between the automated technique and Gold standard dataset of OS and PFS respectively (ratio of Gold standard to routine intervals of between 0.8 and 1.2). The automated technique correctly assigned recurrence in 101 out of 122 (83%) of the patients: 21 of the 40 patients with recurrent disease were correctly identified, 19 were too unwell to receive further treatment and were missed. Of the 82 patients who did not develop a recurrence, 77 were correctly identified and 2 were incorrectly identified as having recurrent disease when they did not.
We have demonstrated that our algorithm can be used to automate the interpretation of routine datasets to extract survival information for this sample of patients. It currently underestimates recurrence rates due to many patients not being well-enough to be treated for recurrent disease. With some further optimisation, this technique could be extended to a national level, providing a new approach to measuring outcomes on a larger scale than is currently possible. This could have implications for healthcare provision and policy for a range of different disease types.
总生存期(OS)和无进展生存期(PFS)是头颈癌的关键疗效指标,因为它们反映了治疗效果,对患者和医疗服务具有重要意义。英国最近开展了一系列全国性癌症审计,旨在通过依靠各机构手动提交患者状态的间隔数据来估计生存率和复发率,这是一种劳动密集型方法。然而,在全国范围内,医院入院、手术、放疗和化疗的数据是常规收集的。我们开发了一种技术来自动解读这些常规数据集,从而能够从常规获取的数据中得出头颈癌患者的治疗模式。
我们确定了122名头颈癌患者,并从医院病历中提取治疗史以提供一个金标准数据集。我们获取了这些患者常规收集的关于住院入院和诊疗程序、化疗和放疗的本地数据,并用一种计算机算法对其进行分析,该算法能识别相关时间点,然后计算总生存期和无进展生存期。我们通过与金标准数据集进行比较来验证这些结果。然后对该算法进行优化,以最大限度地正确识别每个时间点,并尽量减少对复发事件的错误识别。
在这122名患者中,82%患有局部晚期疾病。1年时总生存率为88%,2年时为77%;1年和2年时无进展生存率分别为75%和66%。40名患者出现了疾病复发。我们的自动化方法得出1年和2年时的总生存率估计值分别为87%和77%,无进展生存率估计值分别为87%和78%;分别有98%和82%的患者在总生存期和无进展生存期方面,自动化技术与金标准数据集之间显示出良好的一致性(金标准与常规间隔的比率在0.8至1.2之间)。自动化技术在122名患者中的101名(83%)中正确判定了复发情况:40名复发患者中有21名被正确识别,19名患者身体状况太差无法接受进一步治疗而被遗漏。在82名未复发的患者中,77名被正确识别,2名在未复发时被错误识别为患有复发性疾病。
我们已经证明,我们的算法可用于自动解读常规数据集,以提取该患者样本的生存信息。目前,由于许多患者身体状况不佳无法接受复发性疾病的治疗,该算法低估了复发率。通过进一步优化,这项技术可以扩展到全国范围,提供一种比目前更大规模测量疗效的新方法。这可能会对一系列不同疾病类型的医疗服务提供和政策产生影响。