Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington DC, United States of America.
Department of Health Management and Policy, University of Michigan School of Public Health, Ann Arbor, MI, United States of America.
PLoS One. 2023 Jun 8;18(6):e0286883. doi: 10.1371/journal.pone.0286883. eCollection 2023.
Identifying determinants of smoking cessation is critical for developing optimal cessation treatments and interventions. Machine learning (ML) is becoming more prevalent for smoking cessation success prediction in treatment programs. However, only individuals with an intention to quit smoking cigarettes participate in such programs, which limits the generalizability of the results. This study applies data from the Population Assessment of Tobacco and Health (PATH), a United States longitudinal nationally representative survey, to select primary determinants of smoking cessation and to train ML classification models for predicting smoking cessation among the general population. An analytical sample of 9,281 adult current established smokers from the PATH survey wave 1 was used to develop classification models to predict smoking cessation by wave 2. Random forest and gradient boosting machines were applied for variable selection, and the SHapley Additive explanation method was used to show the effect direction of the top-ranked variables. The final model predicted wave 2 smoking cessation for current established smokers in wave 1 with an accuracy of 72% in the test dataset. The validation results showed that a similar model could predict wave 3 smoking cessation of wave 2 smokers with an accuracy of 70%. Our analysis indicated that more past 30 days e-cigarette use at the time of quitting, fewer past 30 days cigarette use before quitting, ages older than 18 at smoking initiation, fewer years of smoking, poly tobacco past 30-days use before quitting, and higher BMI resulted in higher chances of cigarette cessation for adult smokers in the US.
确定戒烟成功的决定因素对于开发最佳的戒烟治疗和干预措施至关重要。机器学习 (ML) 越来越多地用于预测治疗计划中的戒烟成功。然而,只有有戒烟意愿的人才会参与此类项目,这限制了结果的普遍性。本研究应用来自美国纵向全国代表性调查“人口烟草与健康评估 (PATH)”的数据,选择戒烟的主要决定因素,并为一般人群的戒烟预测训练 ML 分类模型。使用来自 PATH 调查波 1 的 9281 名成年当前已确立吸烟者的分析样本,开发分类模型以预测波 2 的戒烟情况。随机森林和梯度提升机用于变量选择,SHapley Additive 解释方法用于显示排名最高的变量的影响方向。最终模型在测试数据集中预测波 1 当前已确立吸烟者的波 2 戒烟,准确率为 72%。验证结果表明,类似的模型可以预测波 2 吸烟者的波 3 戒烟,准确率为 70%。我们的分析表明,在戒烟时更多地使用过去 30 天的电子烟、在戒烟前更少地使用过去 30 天的香烟、在 18 岁以上开始吸烟、吸烟年限较少、在戒烟前更多地使用多烟草制品过去 30 天、以及更高的 BMI 会增加美国成年吸烟者戒烟的机会。