• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Prosodic Preferences of Surface Electromyography-based Subvocal Speech for People With Laryngectomy.基于表面肌电图的喉切除患者默读语音的韵律偏好
J Voice. 2024 Dec 5. doi: 10.1016/j.jvoice.2024.10.024.
2
Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech.基于表面肌电图的韵律性默读语音识别、合成与感知
J Speech Lang Hear Res. 2021 Jun 18;64(6S):2134-2153. doi: 10.1044/2021_JSLHR-20-00257. Epub 2021 May 12.
3
Electromyographic control of a hands-free electrolarynx using neck strap muscles.利用颈部束带肌肉对免提式电子喉进行肌电图控制。
J Commun Disord. 2009 May-Jun;42(3):211-25. doi: 10.1016/j.jcomdis.2008.12.002. Epub 2009 Jan 19.
4
The Effect of Clear Speech on Cantonese Alaryngeal Speakers' Intelligibility.清晰语音对广东失音者可懂度的影响。
Folia Phoniatr Logop. 2022;74(2):103-111. doi: 10.1159/000517676. Epub 2021 Jul 30.
5
Neck and face surface electromyography for prosthetic voice control after total laryngectomy.全喉切除术后用于假体语音控制的颈部和面部表面肌电图
IEEE Trans Neural Syst Rehabil Eng. 2009 Apr;17(2):146-55. doi: 10.1109/TNSRE.2009.2017805. Epub 2009 Mar 16.
6
Long-term average spectral characteristics of Cantonese alaryngeal speech.粤语无喉语音的长期平均频谱特征。
Auris Nasus Larynx. 2009 Oct;36(5):571-7. doi: 10.1016/j.anl.2008.12.005. Epub 2009 Mar 3.
7
Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.无声语音识别作为喉切除患者的替代交流设备
IEEE/ACM Trans Audio Speech Lang Process. 2017 Dec;25(12):2386-2398. doi: 10.1109/TASLP.2017.2740000. Epub 2017 Nov 28.
8
The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech.调节基频和语速对合成语音的可懂度、通信效率及感知自然度的影响
Am J Speech Lang Pathol. 2019 Jul 15;28(2S):875-886. doi: 10.1044/2019_AJSLP-MSC18-18-0052.
9
Electrolarynx in voice rehabilitation.用于嗓音康复的电子喉
Auris Nasus Larynx. 2007 Sep;34(3):327-32. doi: 10.1016/j.anl.2006.11.010. Epub 2007 Jan 18.
10
Listener impressions of alaryngeal communication modalities.听者对喉切除后交流方式的印象。
Int J Speech Lang Pathol. 2021 Oct;23(5):540-547. doi: 10.1080/17549507.2020.1849400. Epub 2021 Jan 27.

本文引用的文献

1
Prediction of Voice Fundamental Frequency and Intensity from Surface Electromyographic Signals of the Face and Neck.基于面部和颈部表面肌电信号预测语音基频和强度
Vibration. 2022 Dec;5(4):692-710. doi: 10.3390/vibration5040041. Epub 2022 Oct 13.
2
Consumer Ratings of the Most Desirable Hearing Aid Attributes.消费者对最理想助听器属性的评价。
J Am Acad Audiol. 2021 Sep;32(8):537-546. doi: 10.1055/s-0041-1732442. Epub 2021 Dec 29.
3
Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech.基于表面肌电图的韵律性默读语音识别、合成与感知
J Speech Lang Hear Res. 2021 Jun 18;64(6S):2134-2153. doi: 10.1044/2021_JSLHR-20-00257. Epub 2021 May 12.
4
The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech.调节基频和语速对合成语音的可懂度、通信效率及感知自然度的影响
Am J Speech Lang Pathol. 2019 Jul 15;28(2S):875-886. doi: 10.1044/2019_AJSLP-MSC18-18-0052.
5
Speech synthesis from ECoG using densely connected 3D convolutional neural networks.使用密集连接的 3D 卷积神经网络进行脑电信号合成。
J Neural Eng. 2019 Jun;16(3):036019. doi: 10.1088/1741-2552/ab0c59. Epub 2019 Mar 4.
6
Communication changes with laryngectomy and impact on quality of life: a review.喉切除术与生活质量变化的交流:综述
Qual Life Res. 2019 Apr;28(4):863-877. doi: 10.1007/s11136-018-2033-y. Epub 2018 Nov 11.
7
Development of sEMG sensors and algorithms for silent speech recognition.用于无声语音识别的表面肌电传感器和算法的开发。
J Neural Eng. 2018 Aug;15(4):046031. doi: 10.1088/1741-2552/aac965. Epub 2018 Jun 1.
8
Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.无声语音识别作为喉切除患者的替代交流设备
IEEE/ACM Trans Audio Speech Lang Process. 2017 Dec;25(12):2386-2398. doi: 10.1109/TASLP.2017.2740000. Epub 2017 Nov 28.
9
The electrolarynx: voice restoration after total laryngectomy.电子喉:全喉切除术后的语音恢复
Med Devices (Auckl). 2017 Jun 21;10:133-140. doi: 10.2147/MDER.S133225. eCollection 2017.
10
An initial investigation into the real-time conversion of facial surface EMG signals to audible speech.
Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug;2016:888-891. doi: 10.1109/EMBC.2016.7590843.

基于表面肌电图的喉切除患者默读语音的韵律偏好

Prosodic Preferences of Surface Electromyography-based Subvocal Speech for People With Laryngectomy.

作者信息

Raiff Laura, Turashvili Dea, Heaton James T, De Luca Gianluca, Kline Joshua C, Vojtech Jenny

机构信息

Delsys, Inc., Natick, Massachusetts 01760; Altec, Inc., Natick, Massachusetts 01760.

Delsys, Inc., Natick, Massachusetts 01760; Altec, Inc., Natick, Massachusetts 01760; Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215.

出版信息

J Voice. 2024 Dec 5. doi: 10.1016/j.jvoice.2024.10.024.

DOI:10.1016/j.jvoice.2024.10.024
PMID:39643558
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12137685/
Abstract

INTRODUCTION

People who undergo a total laryngectomy lose their natural voice and depend on alaryngeal technologies for communication. However, these technologies are often difficult to use and lack prosody. Surface electromyographic-based silent speech interfaces are novel communication systems that overcome many of the shortcomings of traditional alaryngeal speech and have the potential to seamlessly incorporate individualized prosody. The purpose of this study was to (1) validate the ability of alaryngeal silent speech to effectively incorporate pitch modulations-a key prosodic element in natural speech-into synthesized speech assessed through listening experiments and (2) determine the key features of these communication devices according to core users.

METHODOLOGY

People with laryngectomy (n = 15) and their primary communication partners (n = 5) listened to synthesized sentences with differing prosodic content generated from deep regression neural networks developed in our prior work. Specifically, the fundamental frequency (f) contour of each sentence was manipulated in four ways: (1) flattened to the average f, (2) altered to discrete sentence-level classification of muscle activity, (3) altered to continuous mapping of muscle activity, and (4) filtered to emulate speech from an electrolarynx (EL). Listeners ranked the f contours of each sentence in terms of speech naturalness and the importance of various speech aid features.

RESULTS

Continuous contours rated higher than all other types of contours, and monotonic EL contours rated the lowest. Speech aid features were rated highest to lowest in the following order: sound quality, intelligibility, pitch, delay, volume, hands-free, maintenance, cost, wearability, training, and visibility.

CONCLUSION

These results will help inform future development of silent speech interfaces and shape priorities of communication devices toward the preferences of their users.

摘要

引言

接受全喉切除术的人会失去自然嗓音,依靠人工喉技术进行交流。然而,这些技术通常使用困难且缺乏韵律。基于表面肌电图的无声语音接口是一种新型通信系统,克服了传统人工喉语音的许多缺点,并有可能无缝融入个性化韵律。本研究的目的是:(1)通过听力实验验证人工喉无声语音将音高调制(自然语音中的关键韵律元素)有效融入合成语音的能力;(2)根据核心用户确定这些通信设备的关键特征。

方法

喉切除患者(n = 15)及其主要交流伙伴(n = 5)听取了由我们之前工作中开发的深度回归神经网络生成的具有不同韵律内容的合成句子。具体而言,每个句子的基频(f)轮廓通过四种方式进行处理:(1)平坦化为平均f;(2)改变为肌肉活动的离散句子级分类;(3)改变为肌肉活动的连续映射;(4)滤波以模拟电子喉(EL)的语音。听众根据语音自然度和各种助听功能的重要性对每个句子的f轮廓进行排序。

结果

连续轮廓的评分高于所有其他类型的轮廓,单调的EL轮廓评分最低。助听功能的评分从高到低依次为:音质、可懂度、音高、延迟、音量、免提、维护、成本、可穿戴性、培训和可见性。

结论

这些结果将有助于为无声语音接口的未来发展提供信息,并根据用户偏好确定通信设备的优先事项。