Suppr超能文献

基于MRI数据的对比学习方法用于评估舌癌患者的语音清晰度

Contrastive Learning Approach for Assessment of Phonological Precision in Patients with Tongue Cancer Using MRI Data.

作者信息

Arias-Vergara Tomás, Pérez-Toro Paula Andrea, Liu Xiaofeng, Xing Fangxu, Stone Maureen, Zhuo Jiachen, Prince Jerry L, Schuster Maria, Nöth Elmar, Woo Jonghye, Maier Andreas

机构信息

Pattern Recognition Lab. Friedrich-Alexander University, Erlangen, Germany.

Massachusetts General Hospital - Harvard Medical School, Boston, MA, USA.

出版信息

Interspeech. 2024 Sep;2024:927-931. doi: 10.21437/interspeech.2024-2236.

Abstract

Magnetic Resonance Imaging (MRI) allows analyzing speech production by capturing high-resolution images of the dynamic processes in the vocal tract. In clinical applications, combining MRI with synchronized speech recordings leads to improved patient outcomes, especially if a phonological-based approach is used for assessment. However, when audio signals are unavailable, the recognition accuracy of sounds is decreased when using only MRI data. We propose a contrastive learning approach to improve the detection of phonological classes from MRI data when acoustic signals are not available at inference time. We demonstrate that frame-wise recognition of phonological classes improves from an f1 of 0.74 to 0.85 when the contrastive loss approach is implemented. Furthermore, we show the utility of our approach in the clinical application of using such phonological classes to assess speech disorders in patients with tongue cancer, yielding promising results in the recognition task.

摘要

磁共振成像(MRI)通过捕获声道动态过程的高分辨率图像,能够分析言语产生过程。在临床应用中,将MRI与同步语音记录相结合可改善患者预后,特别是在使用基于音系学的方法进行评估时。然而,当没有音频信号时,仅使用MRI数据时声音的识别准确率会降低。我们提出一种对比学习方法,以在推理时没有声学信号的情况下,提高从MRI数据中检测音系类别的能力。我们证明,当实施对比损失方法时,音系类别的逐帧识别f1值从0.74提高到了0.85。此外,我们展示了我们的方法在临床应用中的效用,即使用此类音系类别来评估舌癌患者的言语障碍,在识别任务中取得了有希望的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b6f/11671147/d99fbc2fb052/nihms-2002850-f0001.jpg

相似文献

3
3D dynamic MRI of the vocal tract during natural speech.自然言语状态下声道的 3D 动态 MRI
Magn Reson Med. 2019 Mar;81(3):1511-1520. doi: 10.1002/mrm.27570. Epub 2018 Nov 3.
9
High-frame-rate full-vocal-tract 3D dynamic speech imaging.高帧率全声道三维动态语音成像
Magn Reson Med. 2017 Apr;77(4):1619-1629. doi: 10.1002/mrm.26248. Epub 2016 Apr 21.
10
Improved imaging of lingual articulation using real-time multislice MRI.使用实时多层 MRI 改善舌位成像。
J Magn Reson Imaging. 2012 Apr;35(4):943-8. doi: 10.1002/jmri.23510. Epub 2011 Nov 29.

本文引用的文献

4
Speech production after glossectomy: methodological aspects.舌切除术后的言语产生:方法学方面
Clin Linguist Phon. 2014 Apr;28(4):241-56. doi: 10.3109/02699206.2013.802015. Epub 2013 Jul 9.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验