• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于人群烟草与健康研究的机器学习分析:吸烟起始的实证研究中是否充分捕捉到了相关风险因素?

Are the Relevant Risk Factors Being Adequately Captured in Empirical Studies of Smoking Initiation? A Machine Learning Analysis Based on the Population Assessment of Tobacco and Health Study.

机构信息

Department of Health Management and Policy, School of Public Health, University of Michigan, Ann Arbor, MI, USA.

Department of Oncology, School of Medicine, Georgetown University, Washington, DC, USA.

出版信息

Nicotine Tob Res. 2023 Jul 14;25(8):1481-1488. doi: 10.1093/ntr/ntad066.

DOI:10.1093/ntr/ntad066
PMID:37099744
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10347975/
Abstract

INTRODUCTION

Cigarette smoking continues to pose a threat to public health. Identifying individual risk factors for smoking initiation is essential to further mitigate this epidemic. To the best of our knowledge, no study today has used machine learning (ML) techniques to automatically uncover informative predictors of smoking onset among adults using the Population Assessment of Tobacco and Health (PATH) study.

AIMS AND METHODS

In this work, we employed random forest paired with Recursive Feature Elimination to identify relevant PATH variables that predict smoking initiation among adults who have never smoked at baseline between two consecutive PATH waves. We included all potentially informative baseline variables in wave 1 (wave 4) to predict past 30-day smoking status in wave 2 (wave 5). Using the first and most recent pairs of PATH waves was found sufficient to identify the key risk factors of smoking initiation and test their robustness over time. The eXtreme Gradient Boosting method was employed to test the quality of these selected variables.

RESULTS

As a result, classification models suggested about 60 informative PATH variables among many candidate variables in each baseline wave. With these selected predictors, the resulting models have a high discriminatory power with the area under the specificity-sensitivity curves of around 80%. We examined the chosen variables and discovered important features. Across the considered waves, two factors, (1) BMI, and (2) dental and oral health status, robustly appeared as important predictors of smoking initiation, besides other well-established predictors.

CONCLUSIONS

Our work demonstrates that ML methods are useful to predict smoking initiation with high accuracy, identifying novel smoking initiation predictors, and to enhance our understanding of tobacco use behaviors.

IMPLICATIONS

Understanding individual risk factors for smoking initiation is essential to prevent smoking initiation. With this methodology, a set of the most informative predictors of smoking onset in the PATH data were identified. Besides reconfirming well-known risk factors, the findings suggested additional predictors of smoking initiation that have been overlooked in previous work. More studies that focus on the newly discovered factors (BMI and dental and oral health status,) are needed to confirm their predictive power against the onset of smoking as well as determine the underlying mechanisms.

摘要

简介

吸烟仍然对公共健康构成威胁。确定吸烟起始的个体风险因素对于进一步减轻这一流行至关重要。据我们所知,目前尚无研究使用机器学习 (ML) 技术自动发现使用人口评估烟草和健康 (PATH) 研究的成年人吸烟起始的信息预测因子。

目的和方法

在这项工作中,我们采用随机森林与递归特征消除相结合的方法,在基线时从未吸烟过的成年人在两个连续的 PATH 波之间,识别出与吸烟起始相关的相关 PATH 变量。我们将波 1 中的所有潜在信息变量(波 4)纳入预测波 2 中过去 30 天吸烟状况(波 5)。发现使用第一和最近的 PATH 波足以识别吸烟起始的关键风险因素,并随着时间的推移测试其稳健性。极端梯度增强方法被用于测试这些选定变量的质量。

结果

结果表明,在每个基线波的众多候选变量中,分类模型建议了大约 60 个信息性 PATH 变量。使用这些选定的预测因子,生成的模型具有较高的判别力,特异性-敏感性曲线下的面积约为 80%。我们检查了选定的变量,并发现了重要的特征。在考虑的波中,两个因素,(1)BMI,和(2)牙齿和口腔健康状况,除了其他已确立的预测因子外,还作为吸烟起始的重要预测因子稳定出现。

结论

我们的工作表明,ML 方法可用于高度准确地预测吸烟起始,识别新的吸烟起始预测因子,并增强我们对烟草使用行为的理解。

意义

了解吸烟起始的个体风险因素对于预防吸烟起始至关重要。使用这种方法,确定了 PATH 数据中吸烟发作的最具信息性预测因子集。除了重新确认众所周知的风险因素外,研究结果还提出了以前工作中被忽视的吸烟起始的其他预测因子。需要更多关注新发现因素(BMI 和牙齿和口腔健康状况)的研究来确认它们对吸烟起始的预测能力,并确定潜在的机制。

相似文献

1
Are the Relevant Risk Factors Being Adequately Captured in Empirical Studies of Smoking Initiation? A Machine Learning Analysis Based on the Population Assessment of Tobacco and Health Study.基于人群烟草与健康研究的机器学习分析:吸烟起始的实证研究中是否充分捕捉到了相关风险因素?
Nicotine Tob Res. 2023 Jul 14;25(8):1481-1488. doi: 10.1093/ntr/ntad066.
2
Association of Noncigarette Tobacco Product Use With Future Cigarette Smoking Among Youth in the Population Assessment of Tobacco and Health (PATH) Study, 2013-2015.非香烟烟草制品使用与人群烟草和健康评估(PATH)研究中青少年未来吸烟的关联,2013-2015 年。
JAMA Pediatr. 2018 Feb 1;172(2):181-187. doi: 10.1001/jamapediatrics.2017.4173.
3
Correlates of tobacco product initiation among youth and young adults between waves 1-4 of the population assessment of tobacco and Health (PATH) study (2013-2018).青少年和青年在人群烟草与健康评估(PATH)研究的第 1-4 波(2013-2018 年)中开始使用烟草产品的相关因素。
Addict Behav. 2022 Nov;134:107396. doi: 10.1016/j.addbeh.2022.107396. Epub 2022 Jun 7.
4
Association of Electronic Nicotine Delivery System Use With Cigarette Smoking Relapse Among Former Smokers in the United States.电子尼古丁传送系统使用与美国曾吸烟者复吸香烟的关联。
JAMA Netw Open. 2020 Jun 1;3(6):e204813. doi: 10.1001/jamanetworkopen.2020.4813.
5
Prospective predictors of flavored e-cigarette use: A one-year longitudinal study of young adults in the U.S.前瞻性预测因素与调味电子烟使用的关系:美国一项针对青年成年人的为期一年的纵向研究
Drug Alcohol Depend. 2018 Oct 1;191:279-285. doi: 10.1016/j.drugalcdep.2018.07.020. Epub 2018 Aug 25.
6
Identifying emerging predictors for adolescent electronic nicotine delivery systems use: A machine learning analysis of the Population Assessment of Tobacco and Health Study.识别青少年使用电子烟的新预测因素:一项对《美国人口烟草与健康调查》的机器学习分析。
Prev Med. 2021 Apr;145:106418. doi: 10.1016/j.ypmed.2021.106418. Epub 2021 Jan 7.
7
Tobacco and nicotine delivery product use in a U.S. national sample of women of reproductive age.美国育龄女性中烟草和尼古丁传递产品的使用情况。
Prev Med. 2018 Dec;117:61-68. doi: 10.1016/j.ypmed.2018.03.001. Epub 2018 Mar 17.
8
Trajectories of ENDS and cigarette use among dual users: analysis of waves 1 to 5 of the PATH Study.双重使用者中ENDS 和香烟使用的轨迹:PATH 研究第 1 至 5 波分析。
Tob Control. 2024 Mar 19;33(e1):e62-e68. doi: 10.1136/tc-2022-057405.
9
Initiation of any tobacco and five tobacco products across 3 years among youth, young adults and adults in the USA: findings from the PATH Study Waves 1-3 (2013-2016).美国青少年、年轻成年人和成年人在三年内开始使用任何烟草和五种烟草制品:来自 PATH 研究波 1-3(2013-2016 年)的发现。
Tob Control. 2020 May;29(Suppl 3):s178-s190. doi: 10.1136/tobaccocontrol-2019-055573.
10
Longitudinal transitions in initiation, cessation, and relapse of cigarette smoking and e-cigarette use among US youth and adults: Validation of a microsimulation model.美国青少年和成年人中香烟和电子烟使用的起始、停止和复吸的纵向转变:微模拟模型的验证。
PLoS One. 2023 Apr 14;18(4):e0284426. doi: 10.1371/journal.pone.0284426. eCollection 2023.

引用本文的文献

1
Identifying Key Predictors of Smoking Cessation Success: Text-Based Feature Selection Using a Large Language Model.识别戒烟成功的关键预测因素:使用大语言模型的基于文本的特征选择
medRxiv. 2025 Jun 20:2025.06.18.25329854. doi: 10.1101/2025.06.18.25329854.
2
Harnessing machine learning in contemporary tobacco research.当代烟草研究中机器学习的应用
Toxicol Rep. 2024 Dec 19;14:101877. doi: 10.1016/j.toxrep.2024.101877. eCollection 2025 Jun.
3
Key Risk Factors Associated With Electronic Nicotine Delivery Systems Use Among Adolescents.与青少年使用电子烟相关的主要危险因素。
JAMA Netw Open. 2023 Oct 2;6(10):e2337101. doi: 10.1001/jamanetworkopen.2023.37101.

本文引用的文献

1
Machine learning models of tobacco susceptibility and current use among adolescents from 97 countries in the Global Youth Tobacco Survey, 2013-2017.2013 - 2017年全球青少年烟草调查中97个国家青少年烟草易感性和当前使用情况的机器学习模型
PLOS Glob Public Health. 2021 Dec 8;1(12):e0000060. doi: 10.1371/journal.pgph.0000060. eCollection 2021.
2
Tobacco Product Use Among Adults - United States, 2020.2020年美国成年人烟草制品使用情况
MMWR Morb Mortal Wkly Rep. 2022 Mar 18;71(11):397-405. doi: 10.15585/mmwr.mm7111a1.
3
A machine learning approach to predict e-cigarette use and dependence among Ontario youth.一种机器学习方法,用于预测安大略省青年中电子烟的使用和依赖情况。
Health Promot Chronic Dis Prev Can. 2022 Jan;42(1):21-28. doi: 10.24095/hpcdp.42.1.04.
4
Is Adolescent E-Cigarette Use Associated With Subsequent Smoking? A New Look.青少年使用电子烟是否与随后的吸烟有关?新的研究视角。
Nicotine Tob Res. 2022 Mar 26;24(5):710-718. doi: 10.1093/ntr/ntab243.
5
A Machine Learning Approach to Identify Predictors of Frequent Vaping and Vulnerable Californian Youth Subgroups.机器学习方法识别频繁吸电子烟的预测因子和易受影响的加州青年亚组。
Nicotine Tob Res. 2022 Jun 15;24(7):1028-1036. doi: 10.1093/ntr/ntab257.
6
Machine learning applications in tobacco research: a scoping review.机器学习在烟草研究中的应用:范围综述。
Tob Control. 2023 Jan;32(1):99-109. doi: 10.1136/tobaccocontrol-2020-056438. Epub 2021 Aug 27.
7
Smoking Susceptibility and Tobacco Media Engagement Among Youth Never Smokers.青少年从不吸烟者的吸烟易感性和烟草媒体参与度。
Pediatrics. 2021 Jun;147(6). doi: 10.1542/peds.2020-017921. Epub 2021 May 3.
8
E-cigarette use and combustible tobacco cigarette smoking uptake among non-smokers, including relapse in former smokers: umbrella review, systematic review and meta-analysis.电子烟使用与非吸烟者中可燃烟草卷烟的使用(包括前吸烟者的复吸):伞式综述、系统评价和荟萃分析。
BMJ Open. 2021 Mar 30;11(3):e045603. doi: 10.1136/bmjopen-2020-045603.
9
U.S. healthcare spending attributable to cigarette smoking in 2014.2014 年美国因吸烟导致的医疗保健支出。
Prev Med. 2021 Sep;150:106529. doi: 10.1016/j.ypmed.2021.106529. Epub 2021 Mar 23.
10
Examining proximity exposure in a social network as a mechanism driving peer influence of adolescent smoking.研究社交网络中的近距离暴露作为推动青少年吸烟同伴影响的一种机制。
Addict Behav. 2021 Jun;117:106853. doi: 10.1016/j.addbeh.2021.106853. Epub 2021 Feb 4.