MLSPred-bench：将脑电图（EEG）数据集转化为适用于机器学习的癫痫发作预测基准。

MLSPred-bench: Transforming electroencephalography (EEG) datasets into machine learning-ready epileptic seizure prediction benchmarks.

作者信息

Mohammad Umair, Saeed Fahad

机构信息

Electrical Computer and Biomedical Engineering Department, Union College, 807 Union St, Schenectady, NY 12308, US.

Knight Foundation School of Computing and Information Sciences, Florida International University, 11200 SW 8th St, Miami, FL 33199, USA.

出版信息

MethodsX. 2025 Aug 22;15:103574. doi: 10.1016/j.mex.2025.103574. eCollection 2025 Dec.

DOI:10.1016/j.mex.2025.103574

PMID:40949826

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12423417/

Abstract

Predicting epileptic seizures is a significantly more challenging task compared to seizure detection. However, most publicly available electroencephalography (EEG) datasets are geared towards detection because the ictal phase (main symptomatic period) is annotated. In contrast, prediction requires the availability of annotated preictal and interictal phases. To this end, we designed and developed a method called that can be used for converting any EEG big data annotated for detection into ML-ready data suitable for prediction. We apply our methods to the existing EEG data corpus to generate 12 ML-ready benchmarks comprising data for training, validating, and testing seizure prediction models. Our strategy uses different variations of seizure prediction horizon (SPH) and the seizure occurrence period (SOP) to produce >150GB of ML-ready data. To illustrate the usefulness of the generated data, we technically validate all the benchmarks using multiple machine learning (ML) and deep learning (DL) models. We hope that the generated benchmarking data will be utilized by various computational groups for their seizure prediction model development. The work can be summarized as follows:1.Extract short preictal and interictal segments from long-duration annotated EEG montages.2.Generate a comprehensive list of ML-ready benchmarks with varying SPH and SOP.3.Technically validate the generated data with multiple ML and DL models with up-to 88.73 % validation accuracy4.Opensource code and related materials are available at https://github.com/pcdslab/MLSPred-Bench.

摘要

与癫痫发作检测相比，预测癫痫发作是一项更具挑战性的任务。然而，大多数公开可用的脑电图（EEG）数据集都针对检测，因为发作期（主要症状期）已被标注。相比之下，预测需要有标注的发作前期和发作间期阶段的数据。为此，我们设计并开发了一种名为的方法，可用于将任何为检测而标注的EEG大数据转换为适用于预测的机器学习就绪数据。我们将我们的方法应用于现有的EEG数据语料库，以生成12个机器学习就绪基准，包括用于训练、验证和测试癫痫发作预测模型的数据。我们的策略使用癫痫发作预测时域（SPH）和癫痫发作发生期（SOP）的不同变体来生成超过150GB的机器学习就绪数据。为了说明生成数据的有用性，我们使用多个机器学习（ML）和深度学习（DL）模型对所有基准进行了技术验证。我们希望生成的基准数据能被各个计算团队用于他们的癫痫发作预测模型开发。这项工作可总结如下：1. 从长时间标注的EEG蒙太奇中提取短的发作前期和发作间期片段。2. 生成具有不同SPH和SOP的全面的机器学习就绪基准列表。3. 使用多个ML和DL模型对生成的数据进行技术验证，验证准确率高达88.73%。4. 开源代码和相关材料可在https://github.com/pcdslab/MLSPred-Bench获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/486c/12423417/25fa7e981bc2/ga1.jpg

相似文献

MLSPred-bench: Transforming electroencephalography (EEG) datasets into machine learning-ready epileptic seizure prediction benchmarks.MLSPred-bench：将脑电图（EEG）数据集转化为适用于机器学习的癫痫发作预测基准。

MethodsX. 2025 Aug 22;15:103574. doi: 10.1016/j.mex.2025.103574. eCollection 2025 Dec.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Idiopathic (Genetic) Generalized Epilepsy特发性（遗传性）全身性癫痫

EEG-Based Epileptic Seizure Detection via Machine/Deep Learning Approaches: A Systematic Review.基于脑电图的癫痫发作检测的机器学习/深度学习方法：系统综述。

Comput Intell Neurosci. 2022 Jun 17;2022:6486570. doi: 10.1155/2022/6486570. eCollection 2022.

Prognosis of adults and children following a first unprovoked seizure.首次无诱因发作后成人和儿童的预后。

Cochrane Database Syst Rev. 2023 Jan 23;1(1):CD013847. doi: 10.1002/14651858.CD013847.pub2.

Machine and deep learning methods for epileptic seizure recognition using EEG data: A systematic review.使用脑电图（EEG）数据进行癫痫发作识别的机器学习和深度学习方法：系统综述。

Brain Res. 2025 Oct 1;1864:149797. doi: 10.1016/j.brainres.2025.149797. Epub 2025 Jun 23.

Amplitude-integrated electroencephalography compared with conventional video-electroencephalography for detection of neonatal seizures.振幅整合脑电图与传统视频脑电图在新生儿惊厥检测中的比较。

Cochrane Database Syst Rev. 2025 Aug 11;8(8):CD013546. doi: 10.1002/14651858.CD013546.pub2.

Surgery for epilepsy.癫痫手术

Cochrane Database Syst Rev. 2015 Jul 1(7):CD010541. doi: 10.1002/14651858.CD010541.pub2.

Lamotrigine versus carbamazepine monotherapy for epilepsy: an individual participant data review.拉莫三嗪与卡马西平单药治疗癫痫的疗效比较：个体参与者数据回顾

Cochrane Database Syst Rev. 2018 Jun 28;6(6):CD001031. doi: 10.1002/14651858.CD001031.pub4.

A novel finite spectral entropy: Gated term memory unit recursive network integrated with Ladybug Beetle Optimization algorithm for epileptic seizure detection.一种新型有限谱熵：结合瓢虫优化算法的门控项记忆单元递归网络用于癫痫发作检测。

Int J Numer Method Biomed Eng. 2023 Dec;39(12):e3769. doi: 10.1002/cnm.3769. Epub 2023 Sep 23.

本文引用的文献

Utilizing Pretrained Vision Transformers and Large Language Models for Epileptic Seizure Prediction.利用预训练视觉变换器和大语言模型进行癫痫发作预测。

2025 8th Int Conf Data Sci Mach Learn Appl (2025). 2025 Feb;2025:132-137. doi: 10.1109/cdma61895.2025.00028. Epub 2025 Mar 7.

Lightweight Transformer exhibits comparable performance to LLMs for Seizure Prediction: A case for light-weight models for EEG data.轻量级Transformer在癫痫发作预测方面表现出与大型语言模型相当的性能：关于脑电图数据轻量级模型的一个实例

Proc IEEE Int Conf Big Data. 2024 Dec;2024:4941-4945. doi: 10.1109/bigdata62323.2024.10825319.

Synchronization-based graph spatio-temporal attention network for seizure prediction.用于癫痫发作预测的基于同步的图时空注意力网络。

Sci Rep. 2025 Feb 3;15(1):4080. doi: 10.1038/s41598-025-88492-5.

Comparison between epileptic seizure prediction and forecasting based on machine learning.基于机器学习的癫痫发作预测与预报的比较。

Sci Rep. 2024 Mar 7;14(1):5653. doi: 10.1038/s41598-024-56019-z.

EEG epilepsy seizure prediction: the post-processing stage as a chronology.脑电图癫痫发作预测：后处理阶段作为一个时间顺序。

Sci Rep. 2024 Jan 3;14(1):407. doi: 10.1038/s41598-023-50609-z.

EEG datasets for seizure detection and prediction- A review.用于癫痫检测和预测的 EEG 数据集——综述。

Epilepsia Open. 2023 Jun;8(2):252-267. doi: 10.1002/epi4.12704. Epub 2023 Feb 16.

Power efficient refined seizure prediction algorithm based on an enhanced benchmarking.基于增强基准测试的高能效精细化癫痫发作预测算法。

Sci Rep. 2021 Dec 6;11(1):23498. doi: 10.1038/s41598-021-02798-8.

Chin J Traumatol. 2022 Sep;25(5):272-276. doi: 10.1016/j.cjtee.2021.10.003. Epub 2021 Oct 15.

Energy-Efficient Neural Network for Epileptic Seizure Prediction.用于癫痫发作预测的节能神经网络

IEEE Trans Biomed Eng. 2022 Jan;69(1):401-411. doi: 10.1109/TBME.2021.3095848. Epub 2021 Dec 23.

Efficient Epileptic Seizure Prediction Based on Deep Learning.基于深度学习的高效癫痫发作预测。

IEEE Trans Biomed Circuits Syst. 2019 Oct;13(5):804-813. doi: 10.1109/TBCAS.2019.2929053. Epub 2019 Jul 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MLSPred-bench：将脑电图（EEG）数据集转化为适用于机器学习的癫痫发作预测基准。

MLSPred-bench: Transforming electroencephalography (EEG) datasets into machine learning-ready epileptic seizure prediction benchmarks.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献