• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

识别戒烟成功的关键预测因素:使用大语言模型的基于文本的特征选择

Identifying Key Predictors of Smoking Cessation Success: Text-Based Feature Selection Using a Large Language Model.

作者信息

Le Thuy T T, Yang Jiongxuan, Zhao Zimo, Zhang Kaidi, Li Wenjun, Hu Yan

机构信息

University of Michigan School of Public Health, Department of Health Management and Policy, Ann Arbor, MI, USA.

University of Michigan School of Public Health, Department of Biostatistics, Ann Arbor, MI, USA.

出版信息

medRxiv. 2025 Jun 20:2025.06.18.25329854. doi: 10.1101/2025.06.18.25329854.

DOI:10.1101/2025.06.18.25329854
PMID:40585098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12204296/
Abstract

BACKGROUND

The most effective way to reduce mortality and morbidity among current smokers is to quit smoking. Although about half of smokers attempted to quit, only one-tenth succeeded in 2022.

OBJECTIVE

To identify key predictors of smoking cessation success to inform cessation interventions and increase quitting rates.

METHODS

We analyzed data from waves 5 and 6 of the Population Assessment of Tobacco and Health (PATH) study (December 2018 to November 2021). Using OpenAI's GPT-4.1, we identified the top 45 variables from wave 5 that are highly predictive of 12-month smoking abstinence in wave 6, based on descriptions of survey variables. We then validated the predictive power of the GPT-4.1-selected variables by comparing the performance of eXtreme Gradient Boosting (XGBoost) trained on different sets of variables. Finally, we derived insights into the top 10 variables, ranked according to their SHapley Additive exPlanations values.

RESULTS

The performance of XGBoost trained with all possible wave 5 variables and the 45 selected variables was almost identical (AUC:0.749 vs AUC:0.752). The top 10 variables included past 30-day smoking frequency, minutes from waking up to smoking first cigarette, important people's views on tobacco use, prevalence of tobacco use among close associates, daily electronic nicotine product use, emotional dependence, and health harm concerns.

CONCLUSION

This study demonstrates the ability of OpenAI's GPT-4.1 to identify the top 45 PATH wave 5 variables associated with 12-month smoking abstinence using only their descriptions. This approach could help researchers design more effective survey questionnaires and improve efficiency of data collection.

摘要

背景

降低当前吸烟者死亡率和发病率的最有效方法是戒烟。尽管约一半吸烟者尝试戒烟,但2022年只有十分之一的人成功戒烟。

目的

确定戒烟成功的关键预测因素,为戒烟干预提供依据并提高戒烟率。

方法

我们分析了烟草与健康人口评估(PATH)研究第5波和第6波(2018年12月至2021年11月)的数据。根据调查变量的描述,使用OpenAI的GPT-4.1从第5波中识别出45个对第6波中12个月戒烟具有高度预测性的变量。然后,通过比较在不同变量集上训练的极端梯度提升(XGBoost)的性能,验证GPT-4.1选择的变量的预测能力。最后,我们根据前10个变量的SHapley值对其进行了深入分析。

结果

使用所有可能的第5波变量和45个选定变量训练的XGBoost性能几乎相同(AUC:0.749对AUC:0.752)。前10个变量包括过去30天的吸烟频率、醒来至吸第一支烟的分钟数、重要人物对烟草使用的看法、亲密伙伴中的烟草使用流行率、每日电子尼古丁产品使用情况、情感依赖以及对健康危害的担忧。

结论

本研究证明了OpenAI的GPT-4.1仅根据描述就能识别与12个月戒烟相关的前45个PATH第5波变量的能力。这种方法可以帮助研究人员设计更有效的调查问卷并提高数据收集效率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/042e/12204296/1b2284b9ac5a/nihpp-2025.06.18.25329854v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/042e/12204296/1b2284b9ac5a/nihpp-2025.06.18.25329854v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/042e/12204296/1b2284b9ac5a/nihpp-2025.06.18.25329854v1-f0001.jpg

相似文献

1
Identifying Key Predictors of Smoking Cessation Success: Text-Based Feature Selection Using a Large Language Model.识别戒烟成功的关键预测因素:使用大语言模型的基于文本的特征选择
medRxiv. 2025 Jun 20:2025.06.18.25329854. doi: 10.1101/2025.06.18.25329854.
2
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
3
Smoking cessation medicines and e-cigarettes: a systematic review, network meta-analysis and cost-effectiveness analysis.戒烟药物和电子烟:系统评价、网络荟萃分析和成本效益分析。
Health Technol Assess. 2021 Oct;25(59):1-224. doi: 10.3310/hta25590.
4
Effectiveness and cost-effectiveness of computer and other electronic aids for smoking cessation: a systematic review and network meta-analysis.计算机和其他电子戒烟辅助手段的有效性和成本效益:系统评价和网络荟萃分析。
Health Technol Assess. 2012;16(38):1-205, iii-v. doi: 10.3310/hta16380.
5
Differences in the effectiveness of individual-level smoking cessation interventions by socioeconomic status.个体层面戒烟干预措施的有效性在社会经济地位方面的差异。
Cochrane Database Syst Rev. 2025 Jan 27;1(1):CD015120. doi: 10.1002/14651858.CD015120.pub2.
6
Different doses, durations and modes of delivery of nicotine replacement therapy for smoking cessation.不同剂量、持续时间和尼古丁替代疗法给药方式对戒烟的效果。
Cochrane Database Syst Rev. 2023 Jun 19;6(6):CD013308. doi: 10.1002/14651858.CD013308.pub2.
7
Healthcare financing systems for increasing the use of tobacco dependence treatment.用于增加烟草依赖治疗使用的医疗保健融资系统。
Cochrane Database Syst Rev. 2017 Sep 12;9(9):CD004305. doi: 10.1002/14651858.CD004305.pub5.
8
Tobacco packaging design for reducing tobacco use.用于减少烟草使用的烟草包装设计。
Cochrane Database Syst Rev. 2017 Apr 27;4(4):CD011244. doi: 10.1002/14651858.CD011244.pub2.
9
Tobacco cessation interventions for young people.针对年轻人的戒烟干预措施。
Cochrane Database Syst Rev. 2006 Oct 18(4):CD003289. doi: 10.1002/14651858.CD003289.pub4.
10
Interventions for tobacco use cessation in people living with HIV.HIV 感染者的戒烟干预措施。
Cochrane Database Syst Rev. 2024 Aug 5;8(8):CD011120. doi: 10.1002/14651858.CD011120.pub3.

本文引用的文献

1
Daily or Nondaily Vaping and Smoking Cessation Among Smokers.吸烟者的每日或非每日电子烟使用与戒烟
JAMA Netw Open. 2025 Mar 3;8(3):e250089. doi: 10.1001/jamanetworkopen.2025.0089.
2
Current applications and challenges in large language models for patient care: a systematic review.用于患者护理的大语言模型的当前应用与挑战:一项系统综述
Commun Med (Lond). 2025 Jan 21;5(1):26. doi: 10.1038/s43856-024-00717-2.
3
Genome-wide association study of varenicline-aided smoking cessation.伐尼克兰辅助戒烟的全基因组关联研究。
Nicotine Tob Res. 2025 Jan 10;27(10):1684-94. doi: 10.1093/ntr/ntaf009.
4
Practical guide to SHAP analysis: Explaining supervised machine learning model predictions in drug development.SHAP 分析实用指南:在药物研发中解释有监督机器学习模型预测。
Clin Transl Sci. 2024 Nov;17(11):e70056. doi: 10.1111/cts.70056.
5
A systematic review and network meta-analysis of population-level interventions to tackle smoking behaviour.一项关于解决吸烟行为的人群层面干预措施的系统评价和网状荟萃分析。
Nat Hum Behav. 2024 Dec;8(12):2367-2391. doi: 10.1038/s41562-024-02002-7. Epub 2024 Oct 7.
6
Associations of Close Social Connections With Smoking and Vaping: A Population Study in England.紧密社会关系与吸烟及吸电子烟的关联:一项英国的人群研究。
Nicotine Tob Res. 2025 Feb 24;27(3):447-456. doi: 10.1093/ntr/ntae225.
7
Adult Smoking Cessation - United States, 2022.成人戒烟 - 美国,2022 年。
MMWR Morb Mortal Wkly Rep. 2024 Jul 25;73(29):633-641. doi: 10.15585/mmwr.mm7329a1.
8
Key Risk Factors Associated With Electronic Nicotine Delivery Systems Use Among Adolescents.与青少年使用电子烟相关的主要危险因素。
JAMA Netw Open. 2023 Oct 2;6(10):e2337101. doi: 10.1001/jamanetworkopen.2023.37101.
9
Are the Relevant Risk Factors Being Adequately Captured in Empirical Studies of Smoking Initiation? A Machine Learning Analysis Based on the Population Assessment of Tobacco and Health Study.基于人群烟草与健康研究的机器学习分析:吸烟起始的实证研究中是否充分捕捉到了相关风险因素?
Nicotine Tob Res. 2023 Jul 14;25(8):1481-1488. doi: 10.1093/ntr/ntad066.
10
Association between family or peer views towards tobacco use and past 30-day smoking cessation among adults with mental health problems.有心理健康问题的成年人中,家人或同伴对吸烟的看法与过去30天戒烟之间的关联。
Prev Med Rep. 2022 Jul 5;28:101886. doi: 10.1016/j.pmedr.2022.101886. eCollection 2022 Aug.