VocalMind：一个用于有声、哑剧和想象中的声调语言语音的立体定向脑电图数据集。

VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.

作者信息

He Tianyu, Wei Mingyi, Wang Ruicong, Wang Renzhi, Du Shiwei, Cai Siqi, Tao Wei, Li Haizhou

机构信息

School of Data Science, Shenzhen Research Institute of Big Data, The Chinese University of Hong Kong, Shenzhen, Guangdong, 518172, P. R. China.

Department of Neurosurgery, South China Hospital, Medical School, Shenzhen University, Shenzhen, 518116, P. R. China.

出版信息

Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.

DOI:10.1038/s41597-025-04741-2

PMID:40253415

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12009324/

Abstract

Speech BCIs based on implanted electrodes hold significant promise for enhancing spoken communication through high temporal resolution and invasive neural sensing. Despite the potential, acquiring such data is challenging due to its invasive nature, and publicly available datasets, particularly for tonal languages, are limited. In this study, we introduce VocalMind, a stereotactic electroencephalography (sEEG) dataset focused on Mandarin Chinese, a tonal language. This dataset includes sEEG-speech parallel recordings from three distinct speech modes, namely vocalized speech, mimed speech, and imagined speech, at both word and sentence levels, totaling over one hour of intracranial neural recordings related to speech production. This paper also presents a baseline model as the reference model for future studies, at the same time, ensuring the integrity of the dataset. The diversity of tasks and the substantial data volume provide a valuable resource for developing advanced algorithms for speech decoding, thereby advancing BCI research for spoken communication.

摘要

基于植入电极的语音脑机接口在通过高时间分辨率和侵入性神经传感增强口语交流方面具有巨大潜力。尽管有这种潜力，但由于其侵入性，获取此类数据具有挑战性，并且公开可用的数据集，特别是针对声调语言的数据集非常有限。在本研究中，我们引入了VocalMind，这是一个专注于汉语（一种声调语言）的立体定向脑电图（sEEG）数据集。该数据集包括来自三种不同语音模式（即发声语音、哑剧语音和想象语音）在单词和句子层面的sEEG-语音并行记录，总计超过一小时与语音产生相关的颅内神经记录。本文还提出了一个基线模型作为未来研究的参考模型，同时确保数据集的完整性。任务的多样性和大量的数据量为开发用于语音解码的先进算法提供了宝贵资源，从而推动用于口语交流的脑机接口研究。

相似文献

VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.

Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.

Chisco: An EEG-based BCI dataset for decoding of imagined speech.

Sci Data. 2024 Nov 21;11(1):1265. doi: 10.1038/s41597-024-04114-1.

Decoding Neural Correlation of Language-Specific Imagined Speech using EEG Signals.

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:1977-1980. doi: 10.1109/EMBC48229.2022.9871721.

A brain-to-text framework for decoding natural tonal sentences.

Cell Rep. 2024 Nov 26;43(11):114924. doi: 10.1016/j.celrep.2024.114924. Epub 2024 Oct 31.

Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods.

J Neural Eng. 2024 Jun 27;21(3). doi: 10.1088/1741-2552/ad593a.

Decoding articulatory and phonetic components of naturalistic continuous speech from the distributed language network.

J Neural Eng. 2023 Aug 14;20(4). doi: 10.1088/1741-2552/ace9fb.

EEG-based Classification of Imaginary Mandarin Tones.

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:3889-3892. doi: 10.1109/EMBC44109.2020.9176608.

Decoding and synthesizing tonal language speech from brain activity.

Sci Adv. 2023 Jun 9;9(23):eadh0478. doi: 10.1126/sciadv.adh0478.

Resting state EEG assisted imagined vowel phonemes recognition by native and non-native speakers using brain connectivity measures.

Phys Eng Sci Med. 2024 Sep;47(3):939-954. doi: 10.1007/s13246-024-01417-w. Epub 2024 Apr 22.

Speech decoding from a small set of spatially segregated minimally invasive intracranial EEG electrodes with a compact and interpretable neural network.

J Neural Eng. 2022 Nov 24;19(6). doi: 10.1088/1741-2552/aca1e1.

本文引用的文献

A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages.

Nat Biomed Eng. 2024 Aug;8(8):977-991. doi: 10.1038/s41551-024-01207-5. Epub 2024 May 20.

The speech neuroprosthesis.

Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.

Plug-and-Play Stability for Intracortical Brain-Computer Interfaces: A One-Year Demonstration of Seamless Brain-to-Text Communication.

Adv Neural Inf Process Syst. 2023 Dec;36:42258-42270.

Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.

Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2.

A high-performance neuroprosthesis for speech decoding and avatar control.

Nature. 2023 Aug;620(7976):1037-1046. doi: 10.1038/s41586-023-06443-4. Epub 2023 Aug 23.

A high-performance speech neuroprosthesis.

Nature. 2023 Aug;620(7976):1031-1036. doi: 10.1038/s41586-023-06377-x. Epub 2023 Aug 23.

Decoding and synthesizing tonal language speech from brain activity.

Sci Adv. 2023 Jun 9;9(23):eadh0478. doi: 10.1126/sciadv.adh0478.

The nested hierarchy of overt, mouthed, and imagined speech activity evident in intracranial recordings.

Neuroimage. 2023 Apr 1;269:119913. doi: 10.1016/j.neuroimage.2023.119913. Epub 2023 Jan 31.

Speech decoding from a small set of spatially segregated minimally invasive intracranial EEG electrodes with a compact and interpretable neural network.

J Neural Eng. 2022 Nov 24;19(6). doi: 10.1088/1741-2552/aca1e1.

Dataset of Speech Production in intracranial.Electroencephalography.

Sci Data. 2022 Jul 22;9(1):434. doi: 10.1038/s41597-022-01542-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

VocalMind：一个用于有声、哑剧和想象中的声调语言语音的立体定向脑电图数据集。

VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献