• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个用于从实时磁共振图像测量声道形状的开源工具箱。

An open-source toolbox for measuring vocal tract shape from real-time magnetic resonance images.

机构信息

Department of Psychology, Edge Hill University, Ormskirk, UK.

Department of Speech Hearing and Phonetic Sciences, University College London, London, UK.

出版信息

Behav Res Methods. 2024 Mar;56(3):2623-2635. doi: 10.3758/s13428-023-02171-9. Epub 2023 Jul 28.

DOI:10.3758/s13428-023-02171-9
PMID:37507650
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10990993/
Abstract

Real-time magnetic resonance imaging (rtMRI) is a technique that provides high-contrast videographic data of human anatomy in motion. Applied to the vocal tract, it is a powerful method for capturing the dynamics of speech and other vocal behaviours by imaging structures internal to the mouth and throat. These images provide a means of studying the physiological basis for speech, singing, expressions of emotion, and swallowing that are otherwise not accessible for external observation. However, taking quantitative measurements from these images is notoriously difficult. We introduce a signal processing pipeline that produces outlines of the vocal tract from the lips to the larynx as a quantification of the dynamic morphology of the vocal tract. Our approach performs simple tissue classification, but constrained to a researcher-specified region of interest. This combination facilitates feature extraction while retaining the domain-specific expertise of a human analyst. We demonstrate that this pipeline generalises well across datasets covering behaviours such as speech, vocal size exaggeration, laughter, and whistling, as well as producing reliable outcomes across analysts, particularly among users with domain-specific expertise. With this article, we make this pipeline available for immediate use by the research community, and further suggest that it may contribute to the continued development of fully automated methods based on deep learning algorithms.

摘要

实时磁共振成像(rtMRI)是一种提供人体运动中高对比度视频数据的技术。将其应用于声道,它是一种通过对口腔和喉咙内部结构成像来捕捉言语和其他声音行为动态的强大方法。这些图像提供了一种研究言语、歌唱、情感表达和吞咽等生理基础的方法,而这些方法是无法通过外部观察获得的。然而,从这些图像中进行定量测量是非常困难的。我们引入了一种信号处理管道,该管道从嘴唇到喉咙生成声道的轮廓,作为声道动态形态的定量测量。我们的方法执行简单的组织分类,但仅限于研究人员指定的感兴趣区域。这种组合促进了特征提取,同时保留了人类分析师的特定领域专业知识。我们证明,该管道可以很好地适用于涵盖言语、声音夸张、笑声和口哨等行为的数据集,并且在分析师之间,特别是在具有特定领域专业知识的用户之间,能够产生可靠的结果。通过本文,我们将这个管道立即提供给研究社区使用,并进一步建议它可能有助于基于深度学习算法的全自动方法的持续发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/e65f26aec419/13428_2023_2171_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/1244a046a669/13428_2023_2171_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/dbc25abf7d4c/13428_2023_2171_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/da4af20ac3f3/13428_2023_2171_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/7ca0ef26f127/13428_2023_2171_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/ad6551cdc734/13428_2023_2171_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/15f0686af5b6/13428_2023_2171_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/ed4af32af3ba/13428_2023_2171_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/e9a95e8d894a/13428_2023_2171_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/e65f26aec419/13428_2023_2171_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/1244a046a669/13428_2023_2171_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/dbc25abf7d4c/13428_2023_2171_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/da4af20ac3f3/13428_2023_2171_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/7ca0ef26f127/13428_2023_2171_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/ad6551cdc734/13428_2023_2171_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/15f0686af5b6/13428_2023_2171_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/ed4af32af3ba/13428_2023_2171_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/e9a95e8d894a/13428_2023_2171_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/699f/10990993/e65f26aec419/13428_2023_2171_Fig9_HTML.jpg

相似文献

1
An open-source toolbox for measuring vocal tract shape from real-time magnetic resonance images.一个用于从实时磁共振图像测量声道形状的开源工具箱。
Behav Res Methods. 2024 Mar;56(3):2623-2635. doi: 10.3758/s13428-023-02171-9. Epub 2023 Jul 28.
2
Determining the Relevant Criteria for Three-dimensional Vocal Tract Characterization.确定三维声道特征的相关标准。
J Voice. 2018 Mar;32(2):130-142. doi: 10.1016/j.jvoice.2017.04.001. Epub 2017 Jun 21.
3
Region segmentation in the frequency domain applied to upper airway real-time magnetic resonance images.应用于上呼吸道实时磁共振图像的频域区域分割
IEEE Trans Med Imaging. 2009 Mar;28(3):323-38. doi: 10.1109/TMI.2008.928920.
4
Weight-bearing MR imaging as an option in the study of gravitational effects on the vocal tract of untrained subjects in singing phonation.负重磁共振成像作为研究重力对未经训练的歌唱发声受试者声道影响的一种选择。
PLoS One. 2014 Nov 7;9(11):e112405. doi: 10.1371/journal.pone.0112405. eCollection 2014.
5
One-second MRI of a three-dimensional vocal tract to measure dynamic articulator modifications.用于测量动态发音器官变化的三维声道一秒磁共振成像。
J Magn Reson Imaging. 2017 Jul;46(1):94-101. doi: 10.1002/jmri.25561. Epub 2016 Dec 9.
6
Overtone focusing in biphonic tuvan throat singing.双音图瓦喉音的泛音聚焦。
Elife. 2020 Feb 17;9:e50476. doi: 10.7554/eLife.50476.
7
Acceleration of MRI of the vocal tract provides additional insight into articulator modifications.声道磁共振加速可提供对发音器官变化的更多见解。
J Magn Reson Imaging. 2015 Oct;42(4):925-35. doi: 10.1002/jmri.24857. Epub 2015 Feb 3.
8
The Vocal Tract Organ: A New Musical Instrument Using 3-D Printed Vocal Tracts.声道器官:一种使用3D打印声道的新型乐器。
J Voice. 2018 Nov;32(6):660-667. doi: 10.1016/j.jvoice.2017.09.014. Epub 2017 Oct 31.
9
Morphologic differences in the vocal tract resonance cavities of voice professionals: an MRI-based study.嗓音专业人员声道共鸣腔的形态学差异:一项基于 MRI 的研究。
J Voice. 2013 Mar;27(2):132-40. doi: 10.1016/j.jvoice.2012.11.010. Epub 2013 Feb 11.
10
MR imaging of the vocal tract during vowel production.元音发音过程中声道的磁共振成像。
J Magn Reson Imaging. 1991 Jan-Feb;1(1):71-6. doi: 10.1002/jmri.1880010109.

引用本文的文献

1
Real-time magnetic resonance imaging reveals distinct vocal tract configurations during spontaneous and volitional laughter.实时磁共振成像揭示了自发性和随意性笑声期间不同的声道构型。
Philos Trans R Soc Lond B Biol Sci. 2022 Nov 7;377(1863):20210511. doi: 10.1098/rstb.2021.0511. Epub 2022 Sep 21.

本文引用的文献

1
Dark tone quality and vocal tract shaping in soprano song production: Insights from real-time MRI.女高音歌唱发声中的暗色调音质与声道塑造:来自实时MRI的见解
JASA Express Lett. 2021 Jul;1(7):075202. doi: 10.1121/10.0005109. Epub 2021 Jul 9.
2
Speech Movement Variability in People Who Stutter: A Vocal Tract Magnetic Resonance Imaging Study.口吃者言语运动的可变性:一项磁共振成像研究。
J Speech Lang Hear Res. 2021 Jul 16;64(7):2438-2452. doi: 10.1044/2021_JSLHR-20-00507. Epub 2021 Jun 22.
3
Deep-learning-based segmentation of the vocal tract and articulators in real-time magnetic resonance images of speech.
基于深度学习的语音实时磁共振图像中声道和发音器官的分割。
Comput Methods Programs Biomed. 2021 Jan;198:105814. doi: 10.1016/j.cmpb.2020.105814. Epub 2020 Oct 26.
4
Imaging the breastfeeding swallow: Pilot study utilizing real-time MRI.对母乳喂养吞咽动作的成像:利用实时磁共振成像的初步研究。
Laryngoscope Investig Otolaryngol. 2020 May 20;5(3):572-579. doi: 10.1002/lio2.397. eCollection 2020 Jun.
5
Whistling shares a common tongue with speech: bioacoustics from real-time MRI of the human vocal tract.语音与口哨共享同一语言:来自实时 MRI 的人类声道的生物声学。
Proc Biol Sci. 2019 Sep 25;286(1911):20191116. doi: 10.1098/rspb.2019.1116.
6
The impact of B1+ correction on MP2RAGE cortical T and apparent cortical thickness at 7T.B1+ 校正对 7T 下 MP2RAGE 皮质 T1 值和表观皮质厚度的影响。
Hum Brain Mapp. 2018 Jun;39(6):2412-2425. doi: 10.1002/hbm.24011. Epub 2018 Feb 18.
7
Functional brain outcomes of L2 speech learning emerge during sensorimotor transformation.第二语言语音学习的功能大脑结果出现在感觉运动转换期间。
Neuroimage. 2017 Oct 1;159:18-31. doi: 10.1016/j.neuroimage.2017.06.053. Epub 2017 Jun 29.
8
High-frame-rate full-vocal-tract 3D dynamic speech imaging.高帧率全声道三维动态语音成像
Magn Reson Med. 2017 Apr;77(4):1619-1629. doi: 10.1002/mrm.26248. Epub 2016 Apr 21.
9
Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC).用于言语产生研究的实时磁共振成像和电磁关节造影数据库(TC)。
J Acoust Soc Am. 2014 Sep;136(3):1307. doi: 10.1121/1.4890284.
10
High-resolution dynamic speech imaging with joint low-rank and sparsity constraints.具有联合低秩和稀疏约束的高分辨率动态语音成像
Magn Reson Med. 2015 May;73(5):1820-32. doi: 10.1002/mrm.25302. Epub 2014 Jun 9.