Department of Psychiatry and Behavioral Sciences, University of Texas Health Science Center at Houston, Houston, TX.
Oklahoma Tobacco Research Center, Stephenson Cancer Center, Oklahoma City, OK.
Nicotine Tob Res. 2019 Jan 4;21(2):173-179. doi: 10.1093/ntr/ntx201.
Machine learning algorithms such as elastic net regression and backward selection provide a unique and powerful approach to model building given a set of psychosocial predictors of smoking lapse measured repeatedly via ecological momentary assessment (EMA). Understanding these predictors may aid in developing interventions for smoking lapse prevention.
In a randomized-controlled smoking cessation trial, smartphone-based EMAs were collected from 92 participants following a scheduled quit date. This secondary analysis utilized elastic net-penalized cox proportional hazards regression and model approximation via backward elimination to (1) optimize a predictive model of time to first lapse and (2) simplify that model to its core constituent predictors to maximize parsimony and generalizability.
Elastic net proportional hazards regression selected 17 of 26 possible predictors from 2065 EMAs to model time to first lapse. The predictors with the highest magnitude regression coefficients were having consumed alcohol in the past hour, being around and interacting with a smoker, and having cigarettes easily available. This model was reduced using backward elimination, retaining five predictors and approximating to 93.9% of model fit. The retained predictors included those mentioned above as well as feeling irritable and being in areas where smoking is either discouraged or allowed (as opposed to not permitted).
The strongest predictors of smoking lapse were environmental in nature (e.g., being in smoking-permitted areas) as opposed to internal factors such as psychological affect. Interventions may be improved by a renewed focus of interventions on these predictors.
The present study demonstrated the utility of machine learning algorithms to optimize the prediction of time to smoking lapse using EMA data. The two models generated by the present analysis found that environmental factors were most strongly related to smoking lapse. The results support the use of machine learning algorithms to investigate intensive longitudinal data, and provide a foundation for the development of highly tailored, just-in-time interventions that can target on multiple antecedents of smoking lapse.
基于机器学习算法,如弹性网络回归和向后选择,可以为模型构建提供独特而强大的方法,前提是给定一套通过生态瞬时评估(EMA)反复测量的吸烟复发的社会心理预测因子。了解这些预测因子可能有助于开发预防吸烟复发的干预措施。
在一项随机对照戒烟试验中,从 92 名参与者在预定的戒烟日期后,通过智能手机收集基于 EMA 的数据。这项二次分析利用弹性网络惩罚 Cox 比例风险回归和通过向后消除进行模型近似,以(1)优化首次复发时间的预测模型,以及(2)简化该模型以保留其核心组成预测因子,以最大限度地提高简约性和通用性。
弹性网络比例风险回归从 2065 个 EMA 中选择了 26 个可能预测因子中的 17 个,以预测首次复发时间。回归系数最大的预测因子是过去 1 小时内饮酒、与吸烟者共处和互动以及香烟容易获得。使用向后消除法减少模型,保留五个预测因子,并近似拟合度为 93.9%。保留的预测因子包括上述提到的预测因子,以及感到烦躁和处于吸烟被鼓励或允许的区域(而不是不允许)。
吸烟复发的最强预测因子是环境因素(例如,处于允许吸烟的区域),而不是内部因素,如心理影响。干预措施可以通过重新关注这些预测因子来得到改善。
本研究展示了机器学习算法在使用 EMA 数据优化吸烟复发时间预测方面的效用。本分析生成的两个模型发现,环境因素与吸烟复发最密切相关。结果支持使用机器学习算法研究密集的纵向数据,并为开发高度定制的、即时的干预措施提供基础,这些干预措施可以针对多个吸烟复发的前因进行干预。