Suppr超能文献

多说话人原始和重建语音产生实时 MRI 视频及 3D 容积图像数据集。

A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images.

机构信息

Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA.

Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, California, USA.

出版信息

Sci Data. 2021 Jul 20;8(1):187. doi: 10.1038/s41597-021-00976-x.

Abstract

Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 participants performing linguistically motivated speech tasks, alongside the corresponding public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each participant.

摘要

实时磁共振成像(RT-MRI)在人类言语产生中的应用正在推动言语科学、语言学、仿生言语技术发展和临床应用的重大进展。然而,RT-MRI 易于访问,并且需要具有广泛访问权限的综合数据集,以促进众多领域的研究。快速运动的发音器官和言语期间动态气道成形的成像需要高时空分辨率和强大的重建方法。此外,虽然已经发布了重建图像,但迄今为止,没有提供来自优化言语产生实验设置的原始多通道 RT-MRI 数据的开放数据集。这样的数据集可以为动态图像重建、伪影校正、特征提取以及语言相关生物标志物的直接提取提供新的和改进的方法。本数据集提供了一个独特的语料库,其中包含 75 名参与者执行语言驱动的言语任务时的二维矢状面 RT-MRI 视频以及同步音频,以及相应的公共领域原始 RT-MRI 数据。该数据集还包括在持续言语声音期间的三维容积性声门 MRI 和每位参与者的高分辨率静态解剖 T2 加权上气道 MRI。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5f1/8292336/0b7ea7ef546d/41597_2021_976_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验