Department of Neurology, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA.
Harvard Medical School, Boston, Massachusetts, USA.
Ann Neurol. 2020 Sep;88(3):588-595. doi: 10.1002/ana.25812. Epub 2020 Jul 9.
There are no validated methods for predicting the timing of seizures. Using machine learning, we sought to forecast 24-hour risk of self-reported seizure from e-diaries.
Data from 5,419 patients on SeizureTracker.com (including seizure count, type, and duration) were split into training (3,806 patients/1,665,215 patient-days) and testing (1,613 patients/549,588 patient-days) sets with no overlapping patients. An artificial intelligence (AI) program, consisting of recurrent networks followed by a multilayer perceptron ("deep learning" model), was trained to produce risk forecasts. Forecasts were made from a sliding window of 3-month diary history for each day of each patient's diary. After training, the model parameters were held constant and the testing set was scored. A rate-matched random (RMR) forecast was compared to the AI. Comparisons were made using the area under the receiver operating characteristic curve (AUC), a measure of binary discrimination performance, and the Brier score, a measure of forecast calibration. The Brier skill score (BSS) measured the improvement of the AI Brier score compared to the benchmark RMR Brier score. Confidence intervals (CIs) on performance statistics were obtained via bootstrapping.
The AUC was 0.86 (95% CI = 0.85-0.88) for AI and 0.83 (95% CI = 0.81-0.85) for RMR, favoring AI (p < 0.001). Overall (all patients combined), BSS was 0.27 (95% CI = 0.23-0.31), also favoring AI (p < 0.001).
The AI produced a valid forecast superior to a chance forecaster, and provided meaningful forecasts in the majority of patients. Future studies will be needed to quantify the clinical value of these forecasts for patients. ANN NEUROL 2020;88:588-595.
目前尚无验证的方法可预测癫痫发作的时间。我们采用机器学习,旨在通过电子日记预测患者 24 小时自报癫痫发作风险。
将来自 SeizureTracker.com 的 5419 名患者的数据(包括发作次数、类型和持续时间)分为训练集(3806 名患者/1665215 患者日)和测试集(1613 名患者/549588 患者日),两组患者无重叠。人工智能(AI)程序由递归网络和多层感知机(“深度学习”模型)组成,用于生成风险预测。通过每位患者日记中 3 个月的日记记录滑动窗口来进行预测。训练后,固定模型参数并对测试集进行评分。与 AI 进行比较的是速率匹配随机(RMR)预测。使用受试者工作特征曲线下面积(AUC),这是衡量二分类判别性能的指标,以及预测校准的Brier 评分来进行比较。Brier 技能评分(BSS)衡量 AI 的 Brier 评分与基准 RMR 的 Brier 评分相比的改进。通过自举法获得性能统计数据的置信区间(CI)。
AI 的 AUC 为 0.86(95%CI=0.85-0.88),RMR 的 AUC 为 0.83(95%CI=0.81-0.85),AI 更优(p<0.001)。总体而言(所有患者),BSS 为 0.27(95%CI=0.23-0.31),AI 也更优(p<0.001)。
AI 生成了优于机会预测器的有效预测结果,并且为大多数患者提供了有意义的预测。未来的研究需要量化这些预测对患者的临床价值。