Ten Haaf Kevin, Jeon Jihyoun, Tammemägi Martin C, Han Summer S, Kong Chung Yin, Plevritis Sylvia K, Feuer Eric J, de Koning Harry J, Steyerberg Ewout W, Meza Rafael
Department of Public Health, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
Department of Epidemiology, University of Michigan, Ann Arbor, Michigan, United States of America.
PLoS Med. 2017 Apr 4;14(4):e1002277. doi: 10.1371/journal.pmed.1002277. eCollection 2017 Apr.
Selection of candidates for lung cancer screening based on individual risk has been proposed as an alternative to criteria based on age and cumulative smoking exposure (pack-years). Nine previously established risk models were assessed for their ability to identify those most likely to develop or die from lung cancer. All models considered age and various aspects of smoking exposure (smoking status, smoking duration, cigarettes per day, pack-years smoked, time since smoking cessation) as risk predictors. In addition, some models considered factors such as gender, race, ethnicity, education, body mass index, chronic obstructive pulmonary disease, emphysema, personal history of cancer, personal history of pneumonia, and family history of lung cancer.
Retrospective analyses were performed on 53,452 National Lung Screening Trial (NLST) participants (1,925 lung cancer cases and 884 lung cancer deaths) and 80,672 Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial (PLCO) ever-smoking participants (1,463 lung cancer cases and 915 lung cancer deaths). Six-year lung cancer incidence and mortality risk predictions were assessed for (1) calibration (graphically) by comparing the agreement between the predicted and the observed risks, (2) discrimination (area under the receiver operating characteristic curve [AUC]) between individuals with and without lung cancer (death), and (3) clinical usefulness (net benefit in decision curve analysis) by identifying risk thresholds at which applying risk-based eligibility would improve lung cancer screening efficacy. To further assess performance, risk model sensitivities and specificities in the PLCO were compared to those based on the NLST eligibility criteria. Calibration was satisfactory, but discrimination ranged widely (AUCs from 0.61 to 0.81). The models outperformed the NLST eligibility criteria over a substantial range of risk thresholds in decision curve analysis, with a higher sensitivity for all models and a slightly higher specificity for some models. The PLCOm2012, Bach, and Two-Stage Clonal Expansion incidence models had the best overall performance, with AUCs >0.68 in the NLST and >0.77 in the PLCO. These three models had the highest sensitivity and specificity for predicting 6-y lung cancer incidence in the PLCO chest radiography arm, with sensitivities >79.8% and specificities >62.3%. In contrast, the NLST eligibility criteria yielded a sensitivity of 71.4% and a specificity of 62.2%. Limitations of this study include the lack of identification of optimal risk thresholds, as this requires additional information on the long-term benefits (e.g., life-years gained and mortality reduction) and harms (e.g., overdiagnosis) of risk-based screening strategies using these models. In addition, information on some predictor variables included in the risk prediction models was not available.
Selection of individuals for lung cancer screening using individual risk is superior to selection criteria based on age and pack-years alone. The benefits, harms, and feasibility of implementing lung cancer screening policies based on risk prediction models should be assessed and compared with those of current recommendations.
基于个体风险选择肺癌筛查候选人已被提议作为基于年龄和累积吸烟暴露量(包年数)标准的替代方案。对九个先前建立的风险模型识别最有可能患肺癌或死于肺癌者的能力进行了评估。所有模型均将年龄和吸烟暴露的各个方面(吸烟状态、吸烟持续时间、每日吸烟量、吸烟包年数、戒烟时间)视为风险预测因素。此外,一些模型还考虑了性别、种族、民族、教育程度、体重指数、慢性阻塞性肺疾病、肺气肿、个人癌症史、个人肺炎史以及肺癌家族史等因素。
对53452名国家肺癌筛查试验(NLST)参与者(1925例肺癌病例和884例肺癌死亡)以及80672名前列腺、肺癌、结直肠癌和卵巢癌筛查试验(PLCO)曾经吸烟的参与者(1463例肺癌病例和915例肺癌死亡)进行了回顾性分析。对以下方面评估了六年肺癌发病率和死亡率风险预测:(1)校准(通过图形方式),即比较预测风险与观察到的风险之间的一致性;(2)区分度(受试者操作特征曲线下面积[AUC]),用于区分患肺癌(死亡)和未患肺癌者;(3)临床实用性(决策曲线分析中的净效益),通过确定基于风险的资格标准可提高肺癌筛查效果的风险阈值。为进一步评估性能,将PLCO中风险模型的敏感性和特异性与基于NLST资格标准的敏感性和特异性进行了比较。校准结果令人满意,但区分度差异很大(AUC范围为0.61至0.81)。在决策曲线分析中,这些模型在相当大的风险阈值范围内优于NLST资格标准,所有模型的敏感性更高,部分模型的特异性略高。PLCOm2012、巴赫和两阶段克隆扩增发病率模型的总体性能最佳,在NLST中AUC>0.68,在PLCO中AUC>0.77。这三个模型在PLCO胸部X线摄影组中预测6年肺癌发病率的敏感性和特异性最高,敏感性>79.8%,特异性>62.3%。相比之下,NLST资格标准的敏感性为71.4%,特异性为62.2%。本研究的局限性包括未确定最佳风险阈值,因为这需要关于使用这些模型的基于风险的筛查策略的长期益处(例如,获得的生命年数和死亡率降低)和危害(例如,过度诊断)的额外信息。此外,风险预测模型中包含的一些预测变量的信息不可用。
使用个体风险选择肺癌筛查对象优于仅基于年龄和包年数的选择标准。应评估基于风险预测模型实施肺癌筛查政策的益处、危害和可行性,并与当前建议进行比较。