Faust Louis, Cui Jie, Knepper Camille, Nasseri Mona, Worrell Gregory, Brinkmann Benjamin H
Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN 55905, USA.
Brain Neurology and Engineering Laboratory, Department of Neurology, Mayo Clinic, Rochester, MN 55905, USA.
Sensors (Basel). 2025 Sep 6;25(17):5562. doi: 10.3390/s25175562.
: To evaluate the feasibility and effectiveness of wrist-worn wearable devices combined with machine learning (ML) approaches for detecting a diverse array of seizure types beyond generalized tonic-clonic (GTC), including focal, generalized, and subclinical seizures. : Twenty-eight patients undergoing inpatient video-EEG monitoring at Mayo Clinic were concurrently monitored using Empatica E4 wrist-worn devices. These devices captured accelerometry, blood volume pulse, electrodermal activity, skin temperature, and heart rate. Seizures were annotated by neurologists. The data were preprocessed to experiment with various segment lengths (10 s and 60 s) and multiple feature sets. Three ML strategies, XGBoost, deep learning models (LSTM, CNN, Transformer), and ROCKET, were evaluated using leave-one-patient-out cross-validation. Performance was assessed using area under the receiver operating characteristic curve (AUROC), seizure-wise recall (SW-Recall), and false alarms per hour (FA/h). : Detection performance varied by seizure type and model. GTC seizures were detected most reliably (AUROC = 0.86, SW-Recall = 0.81, FA/h = 3.03). Hyperkinetic and tonic seizures showed high SW-Recall but also high FA/h. Subclinical and aware-dyscognitive seizures exhibited the lowest SW-Recall and highest FA/h. MultiROCKET and XGBoost performed best overall, though no single model was optimal for all seizure types. Longer segments (60 s) generally reduced FA/h. Feature set effectiveness varied, with multi-biosignal sets improving performance across seizure types. : Wrist-worn wearables combined with ML can extend seizure detection beyond GTC seizures, though performance remains limited for non-motor types. Optimizing model selection, feature sets, and segment lengths, and minimizing false alarms, are key to clinical utility and real-world adoption.
评估腕戴式可穿戴设备结合机器学习(ML)方法检测除全身强直阵挛性(GTC)发作以外的多种发作类型的可行性和有效性,这些发作类型包括局灶性、全身性和亚临床发作。28名在梅奥诊所接受住院视频脑电图监测的患者同时使用Empatica E4腕戴式设备进行监测。这些设备可采集加速度计数据、血容量脉搏、皮肤电活动、皮肤温度和心率。发作由神经科医生进行标注。对数据进行预处理,以试验不同的片段长度(10秒和60秒)和多个特征集。使用留一患者交叉验证评估三种ML策略,即XGBoost、深度学习模型(长短期记忆网络、卷积神经网络、Transformer)和ROCKET。使用受试者操作特征曲线下面积(AUROC)、逐发作召回率(SW-Recall)和每小时误报率(FA/h)评估性能。检测性能因发作类型和模型而异。GTC发作检测最为可靠(AUROC = 0.86,SW-Recall = 0.81,FA/h = 3.03)。运动过多和强直发作显示出较高的SW-Recall,但FA/h也较高。亚临床发作和认知障碍性发作的SW-Recall最低,FA/h最高。总体而言,MultiROCKET和XGBoost表现最佳,不过没有单一模型对所有发作类型都是最优的。较长的片段(60秒)通常会降低FA/h。特征集的有效性各不相同,多生物信号集可提高各类发作的检测性能。腕戴式可穿戴设备结合ML能够将发作检测扩展到GTC发作以外,不过对于非运动性发作类型,性能仍然有限。优化模型选择、特征集和片段长度以及最大限度减少误报,是实现临床应用和实际应用的关键。