Division of Family Medicine and Primary Care, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Solna, Sweden.
Academic Primary Health Care Centre, Region Stockholm, Stockholm, Sweden.
PLoS One. 2022 Oct 21;17(10):e0276703. doi: 10.1371/journal.pone.0276703. eCollection 2022.
The aim of the present study was to investigate the predictive ability for lung cancer of symptoms reported in an adaptive e-questionnaire, separately for never smokers, former smokers, and current smokers.
Consecutive patients referred for suspected lung cancer were recruited between September 2014 and November 2015 from the lung clinic at the Karolinska University Hospital, Stockholm, Sweden. A total of 504 patients were later diagnosed with lung cancer (n = 310) or no cancer (n = 194). All participants answered an adaptive e-questionnaire with a maximum of 342 items, covering background variables and symptoms/sensations suspected to be associated with lung cancer. Stochastic gradient boosting, stratified on smoking status, was used to train and test a model for predicting the presence of lung cancer.
Among never smokers, 17 predictors contributed to predicting lung cancer with 82% of the patients being correctly classified, compared with 26 predictors with an accuracy of 77% among current smokers and 36 predictors with an accuracy of 63% among former smokers. Age, sex, and education level were the most important predictors in all models.
Methods or tools to assess the likelihood of lung cancer based on smoking status and to prioritize investigative and treatment measures among all patients seeking care with diffuse symptoms are much needed. Our study presents risk assessment models for patients with different smoking status that may be developed into clinical risk assessment tools that can help clinicians in assessing a patient's risk of having lung cancer.
本研究旨在针对从未吸烟者、前吸烟者和现吸烟者,分别研究在适应性电子问卷中报告的症状对肺癌的预测能力。
2014 年 9 月至 2015 年 11 月,连续从瑞典斯德哥尔摩卡罗林斯卡大学医院的肺部诊所招募疑似肺癌患者。共有 504 名患者后来被诊断为肺癌(n=310)或无癌症(n=194)。所有参与者均回答了一个自适应电子问卷,最多有 342 个项目,涵盖了背景变量以及疑似与肺癌相关的症状/感觉。基于吸烟状况,使用随机梯度增强对模型进行了训练和测试,以预测肺癌的存在。
在从未吸烟者中,有 17 个预测因子有助于预测肺癌,82%的患者被正确分类,而在现吸烟者中,有 26 个预测因子的准确率为 77%,在曾经吸烟者中,有 36 个预测因子的准确率为 63%。在所有模型中,年龄、性别和教育水平是最重要的预测因子。
需要开发基于吸烟状况评估肺癌可能性的方法或工具,并在所有有弥漫性症状寻求治疗的患者中确定调查和治疗措施的优先级。我们的研究提出了针对不同吸烟状况患者的风险评估模型,这些模型可能会被开发成临床风险评估工具,帮助临床医生评估患者患肺癌的风险。