Bae Sang Won, Suffoletto Brian, Zhang Tongze, Chung Tammy, Ozolcer Melik, Islam Mohammad Rahul, Dey Anind K
Human-Computer Interaction and Human-Centered AI Systems Lab, AI for Healthcare Lab, School of Systems and Enterprises, Stevens Institute of Technology, Hoboken, NJ, United States.
Department of Emergency Medicine, Stanford University, Stanford, CA, United States.
JMIR Form Res. 2023 May 4;7:e39862. doi: 10.2196/39862.
Digital just-in-time adaptive interventions can reduce binge-drinking events (BDEs; consuming ≥4 drinks for women and ≥5 drinks for men per occasion) in young adults but need to be optimized for timing and content. Delivering just-in-time support messages in the hours prior to BDEs could improve intervention impact.
We aimed to determine the feasibility of developing a machine learning (ML) model to accurately predict future, that is, same-day BDEs 1 to 6 hours prior BDEs, using smartphone sensor data and to identify the most informative phone sensor features associated with BDEs on weekends and weekdays to determine the key features that explain prediction model performance.
We collected phone sensor data from 75 young adults (aged 21 to 25 years; mean 22.4, SD 1.9 years) with risky drinking behavior who reported their drinking behavior over 14 weeks. The participants in this secondary analysis were enrolled in a clinical trial. We developed ML models testing different algorithms (eg, extreme gradient boosting [XGBoost] and decision tree) to predict same-day BDEs (vs low-risk drinking events and non-drinking periods) using smartphone sensor data (eg, accelerometer and GPS). We tested various "prediction distance" time windows (more proximal: 1 hour; distant: 6 hours) from drinking onset. We also tested various analysis time windows (ie, the amount of data to be analyzed), ranging from 1 to 12 hours prior to drinking onset, because this determines the amount of data that needs to be stored on the phone to compute the model. Explainable artificial intelligence was used to explore interactions among the most informative phone sensor features contributing to the prediction of BDEs.
The XGBoost model performed the best in predicting imminent same-day BDEs, with 95% accuracy on weekends and 94.3% accuracy on weekdays (F-score=0.95 and 0.94, respectively). This XGBoost model needed 12 and 9 hours of phone sensor data at 3- and 6-hour prediction distance from the onset of drinking on weekends and weekdays, respectively, prior to predicting same-day BDEs. The most informative phone sensor features for BDE prediction were time (eg, time of day) and GPS-derived features, such as the radius of gyration (an indicator of travel). Interactions among key features (eg, time of day and GPS-derived features) contributed to the prediction of same-day BDEs.
We demonstrated the feasibility and potential use of smartphone sensor data and ML for accurately predicting imminent (same-day) BDEs in young adults. The prediction model provides "windows of opportunity," and with the adoption of explainable artificial intelligence, we identified "key contributing features" to trigger just-in-time adaptive intervention prior to the onset of BDEs, which has the potential to reduce the likelihood of BDEs in young adults.
ClinicalTrials.gov NCT02918565; https://clinicaltrials.gov/ct2/show/NCT02918565.
数字即时自适应干预可以减少年轻人的暴饮事件(BDEs;女性每次饮用≥4杯酒,男性每次饮用≥5杯酒),但需要在时间和内容上进行优化。在暴饮事件发生前数小时发送即时支持信息可能会提高干预效果。
我们旨在确定开发一种机器学习(ML)模型的可行性,该模型使用智能手机传感器数据准确预测未来,即暴饮事件发生前1至6小时的当日暴饮事件,并识别与周末和工作日暴饮事件相关的最具信息性的手机传感器特征,以确定解释预测模型性能的关键特征。
我们从75名有危险饮酒行为的年轻人(年龄在21至25岁之间;平均22.4岁,标准差1.9岁)收集手机传感器数据,这些人在14周内报告了他们饮酒行为。参与本次二次分析的参与者参加了一项临床试验。我们开发了ML模型,测试不同算法(如极端梯度提升[XGBoost]和决策树),以使用智能手机传感器数据(如加速度计和全球定位系统)预测当日暴饮事件(与低风险饮酒事件和非饮酒期相比)。我们测试了从饮酒开始的各种“预测距离”时间窗口(更近:1小时;更远:6小时)。我们还测试了各种分析时间窗口(即要分析的数据量),范围从饮酒开始前1至12小时,因为这决定了为计算模型需要存储在手机上的数据量。使用可解释人工智能来探索有助于预测暴饮事件的最具信息性的手机传感器特征之间的相互作用。
XGBoost模型在预测即将发生的当日暴饮事件方面表现最佳,周末准确率为95%,工作日准确率为94.3%(F分数分别为0.95和0.94)。该XGBoost模型在周末和工作日预测当日暴饮事件之前,分别需要在距饮酒开始3小时和6小时的预测距离处获取12小时和9小时的手机传感器数据。用于预测暴饮事件的最具信息性的手机传感器特征是时间(如一天中的时间)和基于全球定位系统的特征,如回转半径(出行指标)。关键特征(如一天中的时间和基于全球定位系统的特征)之间的相互作用有助于预测当日暴饮事件。
我们证明了智能手机传感器数据和机器学习在准确预测年轻人即将发生的(当日)暴饮事件方面的可行性和潜在用途。该预测模型提供了“机会窗口”,通过采用可解释人工智能,我们识别出了“关键贡献特征”,以便在暴饮事件发生前触发即时自适应干预,这有可能降低年轻人发生暴饮事件的可能性。
ClinicalTrials.gov NCT02918565;https://clinicaltrials.gov/ct2/show/NCT02918565 。