PhD Student in the Postgraduate Program in Health Sciences, State University of Maringa, Maringa, Parana, Brazil.
Master in Computer Sciences, State University of Maringa, Maringa, Parana, Brazil.
PLoS One. 2024 Mar 4;19(3):e0295970. doi: 10.1371/journal.pone.0295970. eCollection 2024.
Smoking cessation is an important public health policy worldwide. However, as far as we know, there is a lack of screening of variables related to the success of therapeutic intervention (STI) in Brazilian smokers by machine learning (ML) algorithms. To address this gap in the literature, we evaluated the ability of eight ML algorithms to correctly predict the STI in Brazilian smokers who were treated at a smoking cessation program in Brazil between 2006 and 2017. The dataset was composed of 12 variables and the efficacies of the algorithms were measured by accuracy, sensitivity, specificity, positive predictive value (PPV) and area under the receiver operating characteristic curve. We plotted a decision tree flowchart and also measured the odds ratio (OR) between each independent variable and the outcome, and the importance of the variable for the best model based on PPV. The mean global values for the metrics described above were, respectively, 0.675±0.028, 0.803±0.078, 0.485±0.146, 0.705±0.035 and 0.680±0.033. Supporting vector machines performed the best algorithm with a PPV of 0.726±0.031. Smoking cessation drug use was the roof of decision tree with OR of 4.42 and importance of variable of 100.00. Increase in the number of relapses also promoted a positive outcome, while higher consumption of cigarettes resulted in the opposite. In summary, the best model predicted 72.6% of positive outcomes correctly. Smoking cessation drug use and higher number of relapses contributed to quit smoking, while higher consumption of cigarettes showed the opposite effect. There are important strategies to reduce the number of smokers and increase STI by increasing services and drug treatment for smokers.
戒烟是全球重要的公共卫生政策。然而,据我们所知,在巴西吸烟者中,通过机器学习(ML)算法对与治疗干预成功相关的变量进行筛查还很缺乏。为了填补这一文献空白,我们评估了 8 种 ML 算法在巴西戒烟计划中接受治疗的巴西吸烟者中正确预测治疗干预成功的能力。数据集由 12 个变量组成,算法的功效通过准确性、敏感性、特异性、阳性预测值(PPV)和受试者工作特征曲线下的面积来衡量。我们绘制了决策树流程图,还测量了每个独立变量与结果之间的比值(OR),以及基于 PPV 的最佳模型中变量的重要性。上述指标的平均全球值分别为 0.675±0.028、0.803±0.078、0.485±0.146、0.705±0.035 和 0.680±0.033。支持向量机的表现最好,PPV 为 0.726±0.031。戒烟药物的使用是决策树的顶点,OR 为 4.42,变量的重要性为 100.00。复发次数的增加也促进了积极的结果,而吸烟量的增加则产生了相反的效果。总的来说,最佳模型正确预测了 72.6%的阳性结果。戒烟药物的使用和更高的复发次数有助于戒烟,而更高的吸烟量则产生相反的效果。通过增加对吸烟者的服务和药物治疗,可以采取重要的策略来减少吸烟者的数量和提高治疗干预的成功率。