• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个扩展的临床 EEG 数据集,包含 15300 个自动标记的记录,用于病理学解码。

An extended clinical EEG dataset with 15,300 automatically labelled recordings for pathology decoding.

机构信息

Neuromedical AI Lab, Department of Neurosurgery, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Engelbergerstr. 21, 79106 Freiburg, Germany; BrainLinks-BrainTools, IMBIT (Institute for Machine-Brain Interfacing Technology), University of Freiburg, Georges-Köhler-Allee 201, 79110 Freiburg, Germany; Autonomous Intelligent Systems, Computer Science Department - University of Freiburg, Faculty of Engineering, University of Freiburg, Georges-Köhler-Allee 80, 79110 Freiburg, Germany.

Neuromedical AI Lab, Department of Neurosurgery, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Engelbergerstr. 21, 79106 Freiburg, Germany; BrainLinks-BrainTools, IMBIT (Institute for Machine-Brain Interfacing Technology), University of Freiburg, Georges-Köhler-Allee 201, 79110 Freiburg, Germany.

出版信息

Neuroimage Clin. 2023;39:103482. doi: 10.1016/j.nicl.2023.103482. Epub 2023 Jul 28.

DOI:10.1016/j.nicl.2023.103482
PMID:37544168
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10432245/
Abstract

Automated clinical EEG analysis using machine learning (ML) methods is a growing EEG research area. Previous studies on binary EEG pathology decoding have mainly used the Temple University Hospital (TUH) Abnormal EEG Corpus (TUAB) which contains approximately 3,000 manually labelled EEG recordings. To evaluate and eventually even improve the generalisation performance of machine learning methods for EEG pathology, decoding larger, publicly available datasets is required. A number of studies addressed the automatic labelling of large open-source datasets as an approach to create new datasets for EEG pathology decoding, but little is known about the extent to which training on larger, automatically labelled dataset affects decoding performances of established deep neural networks. In this study, we automatically created additional pathology labels for the Temple University Hospital (TUH) EEG Corpus (TUEG) based on the medical reports using a rule-based text classifier. We generated a dataset of 15,300 newly labelled recordings, which we call the TUH Abnormal Expansion EEG Corpus (TUABEX), and which is five times larger than the TUAB. Since the TUABEX contains more pathological (75%) than non-pathological (25%) recordings, we then selected a balanced subset of 8,879 recordings, the TUH Abnormal Expansion Balanced EEG Corpus (TUABEXB). To investigate how training on a larger, automatically labelled dataset affects the decoding performance of deep neural networks, we applied four established deep convolutional neural networks (ConvNets) to the task of pathological versus non-pathological classification and compared the performance of each architecture after training on different datasets. The results show that training on the automatically labelled TUABEXB dataset rather than training on the manually labelled TUAB dataset increases accuracies on TUABEXB and even for TUAB itself for some architectures. We argue that automatically labelling of large open-source datasets can be used to efficiently utilise the massive amount of EEG data stored in clinical archives. We make the proposed TUABEXB available open source and thus offer a new dataset for EEG machine learning research.

摘要

使用机器学习 (ML) 方法进行自动临床 EEG 分析是一个不断发展的 EEG 研究领域。之前关于二进制 EEG 病理解码的研究主要使用了包含大约 3000 个手动标记 EEG 记录的 Temple 大学医院 (TUH) 异常 EEG 语料库 (TUAB)。为了评估甚至最终提高 EEG 病理的机器学习方法的泛化性能,需要解码更大的、公开可用的数据集。一些研究解决了自动标记大型开源数据集的问题,将其作为创建 EEG 病理解码新数据集的一种方法,但对于在更大的自动标记数据集上训练对现有深度神经网络解码性能的影响程度知之甚少。在这项研究中,我们使用基于规则的文本分类器根据医疗报告自动为 Temple 大学医院 (TUH) EEG 语料库 (TUEG) 创建了额外的病理标签。我们生成了一个包含 15300 个新标记记录的数据集,我们称之为 Temple 大学医院异常扩展 EEG 语料库 (TUABEX),其大小是 TUAB 的五倍。由于 TUABEX 包含更多的病理记录 (75%) 而不是非病理记录 (25%),因此我们选择了 8879 个记录的平衡子集,即 Temple 大学医院异常扩展平衡 EEG 语料库 (TUABEXB)。为了研究在更大的自动标记数据集上训练如何影响深度神经网络的解码性能,我们将四个已建立的深度卷积神经网络 (ConvNets) 应用于病理与非病理分类任务,并比较了在不同数据集上训练后每个架构的性能。结果表明,在自动标记的 TUABEXB 数据集上进行训练而不是在手动标记的 TUAB 数据集上进行训练会提高某些架构在 TUABEXB 上甚至在 TUAB 本身的准确性。我们认为,自动标记大型开源数据集可以有效地利用存储在临床档案中的大量 EEG 数据。我们开源提供建议的 TUABEXB,并为 EEG 机器学习研究提供一个新的数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/24a10becafdf/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/a81daeeae9f9/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/71456a0be71b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/635f0f1192de/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/79b382af4fee/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/4ed455721017/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/24a10becafdf/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/a81daeeae9f9/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/71456a0be71b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/635f0f1192de/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/79b382af4fee/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/4ed455721017/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea8a/10432245/24a10becafdf/gr6.jpg

相似文献

1
An extended clinical EEG dataset with 15,300 automatically labelled recordings for pathology decoding.一个扩展的临床 EEG 数据集,包含 15300 个自动标记的记录,用于病理学解码。
Neuroimage Clin. 2023;39:103482. doi: 10.1016/j.nicl.2023.103482. Epub 2023 Jul 28.
2
Reaching the ceiling? Empirical scaling behaviour for deep EEG pathology classification.触及天花板?深度 EEG 病理分类的经验缩放行为。
Comput Biol Med. 2024 Aug;178:108681. doi: 10.1016/j.compbiomed.2024.108681. Epub 2024 Jun 7.
3
Machine-learning-based diagnostics of EEG pathology.基于机器学习的脑电图病理诊断。
Neuroimage. 2020 Oct 15;220:117021. doi: 10.1016/j.neuroimage.2020.117021. Epub 2020 Jun 10.
4
ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data.ConTraNet:一种混合网络,用于在有限的训练数据下提高 EEG 和 EMG 信号的分类。
Comput Biol Med. 2024 Jan;168:107649. doi: 10.1016/j.compbiomed.2023.107649. Epub 2023 Nov 2.
5
The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling.NMT头皮脑电图数据集:一个用于预测建模的健康和病理性脑电图记录的开源注释数据集。
Front Neurosci. 2022 Jan 5;15:755817. doi: 10.3389/fnins.2021.755817. eCollection 2021.
6
Deep learning with convolutional neural networks for EEG decoding and visualization.基于卷积神经网络的 EEG 解码和可视化深度学习。
Hum Brain Mapp. 2017 Nov;38(11):5391-5420. doi: 10.1002/hbm.23730. Epub 2017 Aug 7.
7
Automated EEG pathology detection based on different convolutional neural network models: Deep learning approach.基于不同卷积神经网络模型的自动 EEG 病理检测:深度学习方法。
Comput Biol Med. 2021 Jun;133:104434. doi: 10.1016/j.compbiomed.2021.104434. Epub 2021 Apr 25.
8
A multi-task and multi-channel convolutional neural network for semi-supervised neonatal artefact detection.用于半监督新生儿伪影检测的多任务多通道卷积神经网络。
J Neural Eng. 2023 Mar 14;20(2). doi: 10.1088/1741-2552/acbc4b.
9
A Channel-Projection Mixed-Scale Convolutional Neural Network for Motor Imagery EEG Decoding.通道投影混合尺度卷积神经网络在运动想象 EEG 解码中的应用。
IEEE Trans Neural Syst Rehabil Eng. 2019 Jun;27(6):1170-1180. doi: 10.1109/TNSRE.2019.2915621. Epub 2019 May 8.
10
Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals.验证深度神经网络用于从 EEG 信号中在线解码运动想象运动。
Sensors (Basel). 2019 Jan 8;19(1):210. doi: 10.3390/s19010210.

引用本文的文献

1
A comprehensive EEG dataset of laser-evoked potentials for pain research.用于疼痛研究的激光诱发电位综合脑电图数据集。
Sci Data. 2025 Sep 2;12(1):1536. doi: 10.1038/s41597-025-05900-1.
2
Brain age revisited: Investigating the state vs. trait hypotheses of EEG-derived brain-age dynamics with deep learning.重新审视脑龄:运用深度学习研究脑电图衍生脑龄动态的状态与特质假设
Imaging Neurosci (Camb). 2024 Jul 8;2. doi: 10.1162/imag_a_00210. eCollection 2024.
3
Automatic diagnostics of electroencephalography pathology based on multi-domain feature fusion.
基于多域特征融合的脑电图病理学自动诊断
PLoS One. 2025 May 5;20(5):e0310348. doi: 10.1371/journal.pone.0310348. eCollection 2025.
4
Automatic Detection of Abnormal EEG Signals Using WaveNet and LSTM.基于 WaveNet 和 LSTM 的异常 EEG 信号自动检测
Sensors (Basel). 2023 Jun 27;23(13):5960. doi: 10.3390/s23135960.