Hippisley-Cox Julia, Coupland Carol
Division of Primary Care, University Park, Nottingham, UK.
BMJ Open. 2015 Mar 17;5(3):e007825. doi: 10.1136/bmjopen-2015-007825.
To derive and validate a set of clinical risk prediction algorithm to estimate the 10-year risk of 11 common cancers.
Prospective open cohort study using routinely collected data from 753 QResearch general practices in England. We used 565 practices to develop the scores and 188 for validation.
4.96 million patients aged 25-84 years in the derivation cohort; 1.64 million in the validation cohort. Patients were free of the relevant cancer at baseline.
Cox proportional hazards models in the derivation cohort to derive 10-year risk algorithms. Risk factors considered included age, ethnicity, deprivation, body mass index, smoking, alcohol, previous cancer diagnoses, family history of cancer, relevant comorbidities and medication. Measures of calibration and discrimination in the validation cohort.
Incident cases of blood, breast, bowel, gastro-oesophageal, lung, oral, ovarian, pancreas, prostate, renal tract and uterine cancers. Cancers were recorded on any one of four linked data sources (general practitioner (GP), mortality, hospital or cancer records).
We identified 228,241 incident cases during follow-up of the 11 types of cancer. Of these 25,444 were blood; 41,315 breast; 32,626 bowel, 12,808 gastro-oesophageal; 32,187 lung; 4811 oral; 6635 ovarian; 7119 pancreatic; 35,256 prostate; 23,091 renal tract; 6949 uterine cancers. The lung cancer algorithm had the best performance with an R(2) of 64.2%; D statistic of 2.74; receiver operating characteristic curve statistic of 0.91 in women. The sensitivity for the top 10% of women at highest risk of lung cancer was 67%. Performance of the algorithms in men was very similar to that for women.
We have developed and validated a prediction models to quantify absolute risk of 11 common cancers. They can be used to identify patients at high risk of cancers for prevention or further assessment. The algorithms could be integrated into clinical computer systems and used to identify high-risk patients.
There is a simple web calculator to implement the Qcancer 10 year risk algorithm together with the open source software for download (available at http://qcancer.org/10yr/).
推导并验证一组临床风险预测算法,以估计11种常见癌症的10年发病风险。
前瞻性开放队列研究,使用从英格兰753家QResearch全科诊所常规收集的数据。我们用565家诊所的数据来制定评分,188家诊所的数据用于验证。
推导队列中有496万年龄在25 - 84岁的患者;验证队列中有164万患者。患者在基线时无相关癌症。
在推导队列中使用Cox比例风险模型来推导10年风险算法。考虑的风险因素包括年龄、种族、贫困程度、体重指数、吸烟、饮酒、既往癌症诊断、癌症家族史、相关合并症和用药情况。在验证队列中进行校准和区分度测量。
11种癌症的发病情况。癌症记录在四个相互关联的数据源(全科医生(GP)、死亡率、医院或癌症记录)中的任何一个上。
在11种癌症的随访期间,我们共识别出228,241例发病病例。其中血癌25,444例;乳腺癌41,315例;结直肠癌32,626例,胃食管癌12,808例;肺癌32,187例;口腔癌4811例;卵巢癌6635例;胰腺癌7119例;前列腺癌35,256例;肾癌23,091例;子宫癌6949例。肺癌算法表现最佳,女性的R(2)为64.2%;D统计量为2.74;受试者工作特征曲线统计量为0.91。肺癌风险最高的前10%女性的敏感度为67%。算法在男性中的表现与女性非常相似。
我们已经开发并验证了一种预测模型,以量化11种常见癌症的绝对风险。它们可用于识别癌症高危患者,以便进行预防或进一步评估。这些算法可整合到临床计算机系统中,用于识别高危患者。
有一个简单的网络计算器来实施Qcancer 10年风险算法,同时还有开源软件可供下载(可在http://qcancer.org/10yr/获取)。