Iwamoto Hiroki, Nakano Saki, Tajima Ryotaro, Kiguchi Ryo, Yoshida Yuki, Kitanishi Yoshitake, Aoki Yasunori
Shionogi & Co., Ltd., Osaka, Japan.
Department of Psychiatry, Nippon Life Hospital, Osaka, Japan.
JMIR AI. 2024 Aug 2;3:e55840. doi: 10.2196/55840.
Work characteristics, such as teleworking rate, have been studied in relation to stress. However, the use of work-related data to improve a high-performance stress prediction model that suits an individual's lifestyle has not been evaluated.
This study aims to develop a novel, high-performance algorithm to predict an employee's stress among a group of employees with similar working characteristics.
This prospective observational study evaluated participants' responses to web‑based questionnaires, including attendance records and data collected using a wearable device. Data spanning 12 weeks (between January 17, 2022, and April 10, 2022) were collected from 194 Shionogi Group employees. Participants wore the Fitbit Charge 4 wearable device, which collected data on daily sleep, activity, and heart rate. Daily work shift data included details of working hours. Weekly questionnaire responses included the K6 questionnaire for depression/anxiety, a behavioral questionnaire, and the number of days lunch was missed. The proposed prediction model used a neighborhood cluster (N=20) with working-style characteristics similar to those of the prediction target person. Data from the previous week predicted stress levels the following week. Three models were compared by selecting appropriate training data: (1) single model, (2) proposed method 1, and (3) proposed method 2. Shapley Additive Explanations (SHAP) were calculated for the top 10 extracted features from the Extreme Gradient Boosting (XGBoost) model to evaluate the amount and contribution direction categorized by teleworking rates (mean): low: <0.2 (more than 4 days/week in office), middle: 0.2 to <0.6 (2 to 4 days/week in office), and high: ≥0.6 (less than 2 days/week in office).
Data from 190 participants were used, with a teleworking rate ranging from 0% to 79%. The area under the curve (AUC) of the proposed method 2 was 0.84 (true positive vs false positive: 0.77 vs 0.26). Among participants with low teleworking rates, most features extracted were related to sleep, followed by activity and work. Among participants with high teleworking rates, most features were related to activity, followed by sleep and work. SHAP analysis showed that for participants with high teleworking rates, skipping lunch, working more/less than scheduled, higher fluctuations in heart rate, and lower mean sleep duration contributed to stress. In participants with low teleworking rates, coming too early or late to work (before/after 9 AM), a higher/lower than mean heart rate, lower fluctuations in heart rate, and burning more/fewer calories than normal contributed to stress.
Forming a neighborhood cluster with similar working styles based on teleworking rates and using it as training data improved the prediction performance. The validity of the neighborhood cluster approach is indicated by differences in the contributing features and their contribution directions among teleworking levels.
UMIN UMIN000046394; https://www.umin.ac.jp/ctr/index.htm.
诸如远程工作率等工作特征已与压力相关联进行了研究。然而,尚未评估使用与工作相关的数据来改进适合个人生活方式的高性能压力预测模型。
本研究旨在开发一种新颖的高性能算法,以预测具有相似工作特征的一组员工中的个体压力。
这项前瞻性观察性研究评估了参与者对基于网络问卷的回答,包括出勤记录和使用可穿戴设备收集的数据。收集了194名盐野义集团员工在2022年1月17日至2022年4月10日期间12周的数据。参与者佩戴Fitbit Charge 4可穿戴设备,该设备收集每日睡眠、活动和心率数据。每日工作班次数据包括工作时间细节。每周问卷回答包括用于评估抑郁/焦虑的K6问卷、一份行为问卷以及错过午餐的天数。所提出的预测模型使用了一个与预测目标人员工作方式特征相似的邻域聚类(N = 20)。前一周的数据用于预测下一周的压力水平。通过选择合适的训练数据比较了三种模型:(1)单一模型,(2)所提出的方法1,以及(3)所提出的方法2。对极端梯度提升(XGBoost)模型提取的前10个特征计算了夏普利加法解释(SHAP),以评估按远程工作率(均值)分类的影响量和贡献方向:低:<0.2(每周在办公室工作超过4天),中:0.2至<0.6(每周在办公室工作2至4天),高:≥0.6(每周在办公室工作少于2天)。
使用了190名参与者的数据,远程工作率范围为0%至79%。所提出的方法2的曲线下面积(AUC)为0.84(真阳性与假阳性:0.77对0.26)。在远程工作率低的参与者中,提取的大多数特征与睡眠相关,其次是活动和工作。在远程工作率高的参与者中,大多数特征与活动相关,其次是睡眠和工作。SHAP分析表明,对于远程工作率高的参与者,错过午餐、工作时间多于/少于计划、心率波动较大以及平均睡眠时间较短会导致压力。在远程工作率低的参与者中,上班太早或太晚(上午9点之前/之后)、心率高于/低于均值、心率波动较小以及卡路里消耗比正常多/少会导致压力。
基于远程工作率形成具有相似工作方式的邻域聚类并将其用作训练数据可提高预测性能。远程工作水平之间影响特征及其贡献方向的差异表明了邻域聚类方法的有效性。
UMIN UMIN000046394;https://www.umin.ac.jp/ctr/index.htm。