• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

为喉癌患者实现一个统计参数语音合成系统。

Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer.

机构信息

Multimedia Department, Polish-Japanese Academy of Information Technology, 02-008 Warsaw, Poland.

出版信息

Sensors (Basel). 2022 Apr 21;22(9):3188. doi: 10.3390/s22093188.

DOI:10.3390/s22093188
PMID:35590877
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9099606/
Abstract

Total laryngectomy, i.e., the surgical removal of the larynx, has a profound influence on a patient's quality of life. The procedure results in a loss of natural voice, which in effect constitutes a significant socio-psychological problem for the patient. The main aim of the study was to develop a statistical parametric speech synthesis system for a patient with laryngeal cancer, on the basis of the patient's speech samples recorded shortly before the surgery and to check if it was possible to generate speech quality close to that of the original recordings. The recording made use of a representative corpus of the Polish language, consisting of 2150 sentences. The recorded voice proved to indicate dysphonia, which was confirmed by the auditory-perceptual RBH scale (roughness, breathiness, hoarseness) and by acoustical analysis using AVQI (The Acoustic Voice Quality Index). The speech synthesis model was trained using the Merlin repository. Twenty-five experts participated in the MUSHRA listening tests, rating the synthetic voice at 69.4 in terms of the professional voice-over talent recording, on a 0-100 scale, which is a very good result. The authors compared the quality of the synthetic voice to another model of synthetic speech trained with the same corpus, but where a voice-over talent provided the recorded speech samples. The same experts rated the voice at 63.63, which means the patient's synthetic voice with laryngeal cancer obtained a higher score than that of the talent-voice recordings. As such, the method enabled for the creation of a statistical parametric speech synthesizer for patients awaiting total laryngectomy. As a result, the solution would improve the quality of life as well as better mental wellbeing of the patient.

摘要

全喉切除术,即喉的外科切除,对患者的生活质量有深远影响。该手术导致自然嗓音丧失,这实际上是患者面临的重大社会心理问题。本研究的主要目的是基于患者在手术前录制的语音样本,为喉癌患者开发一种统计参数语音合成系统,并检查是否有可能生成接近原始录音的语音质量。该录音利用了一个包含 2150 个句子的波兰语代表性语料库。记录的声音表明存在发音障碍,这通过听觉感知 RBH 量表(粗糙度、呼吸声、嘶哑)和使用 AVQI(语音质量指数)的声学分析得到了证实。语音合成模型使用 Merlin 存储库进行训练。二十五位专家参与了 MUSHRA 听力测试,根据 0-100 分制,将合成语音的评分设定为 69.4,与专业旁白录音相比,这是一个非常好的结果。作者将合成语音的质量与另一个使用相同语料库训练的合成语音模型进行了比较,但该模型的语音样本是由旁白演员录制的。同样的专家将该语音评为 63.63,这意味着患有喉癌的患者的合成语音比旁白演员的语音得分更高。因此,该方法为全喉切除术患者创建了统计参数语音合成器。结果,该解决方案将提高患者的生活质量和心理健康。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/167f31b6d897/sensors-22-03188-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/ef09c474bea2/sensors-22-03188-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/f9b4999a01f9/sensors-22-03188-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/ff6de0581d2c/sensors-22-03188-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/bb18fb6b2525/sensors-22-03188-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/67f5eea8477a/sensors-22-03188-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/068fdce0be4a/sensors-22-03188-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/10941d34a62b/sensors-22-03188-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/cfbff09f30d5/sensors-22-03188-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/167f31b6d897/sensors-22-03188-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/ef09c474bea2/sensors-22-03188-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/f9b4999a01f9/sensors-22-03188-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/ff6de0581d2c/sensors-22-03188-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/bb18fb6b2525/sensors-22-03188-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/67f5eea8477a/sensors-22-03188-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/068fdce0be4a/sensors-22-03188-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/10941d34a62b/sensors-22-03188-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/cfbff09f30d5/sensors-22-03188-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8529/9099606/167f31b6d897/sensors-22-03188-g009.jpg

相似文献

1
Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer.为喉癌患者实现一个统计参数语音合成系统。
Sensors (Basel). 2022 Apr 21;22(9):3188. doi: 10.3390/s22093188.
2
Validation of the Acoustic Voice Quality Index in the Korean Language.韩语嗓音障碍指数量表的验证。
J Voice. 2019 Nov;33(6):948.e1-948.e9. doi: 10.1016/j.jvoice.2018.06.007. Epub 2018 Jul 31.
3
Validation of the Acoustic Voice Quality Index in the Lithuanian Language.立陶宛语声学语音质量指数的验证。
J Voice. 2017 Mar;31(2):257.e1-257.e11. doi: 10.1016/j.jvoice.2016.06.002. Epub 2016 Jul 15.
4
Automatic intelligibility assessment of speakers after laryngeal cancer by means of acoustic modeling.通过声学建模实现喉癌患者语音可懂度的自动评估。
J Voice. 2012 May;26(3):390-7. doi: 10.1016/j.jvoice.2011.04.010. Epub 2011 Aug 5.
5
[Internal Validation of the Acoustic Voice Quality Index version 03.01 und Acoustic Breathiness Index].[声学嗓音质量指数03.01版及声学呼吸音指数的内部验证]
Laryngorhinootologie. 2018 Sep;97(9):630-635. doi: 10.1055/a-0596-7819. Epub 2018 Apr 10.
6
Acoustic and perceptual evaluation of voice and speech quality: a study of patients with laryngeal cancer treated with laryngectomy vs irradiation.嗓音和语音质量的声学及感知评估:喉癌患者喉切除术与放射治疗的对比研究
Arch Otolaryngol Head Neck Surg. 1999 Feb;125(2):157-63. doi: 10.1001/archotol.125.2.157.
7
A comparison of Dysphonia Severity Index and Acoustic Voice Quality Index measures in differentiating normal and dysphonic voices.嗓音障碍严重程度指数与声学嗓音质量指数在区分正常嗓音和嗓音障碍嗓音方面的比较。
Eur Arch Otorhinolaryngol. 2018 Apr;275(4):949-958. doi: 10.1007/s00405-018-4903-x. Epub 2018 Feb 13.
8
Voice rehabilitation with Provox2 voice prosthesis following total laryngectomy for laryngeal and hypopharyngeal carcinoma.喉癌和下咽癌全喉切除术后使用Provox2发音假体进行语音康复。
Auris Nasus Larynx. 2007 Mar;34(1):65-71. doi: 10.1016/j.anl.2006.09.017. Epub 2006 Nov 29.
9
Voice Outcomes After Radiation for Early-Stage Laryngeal Cancer.早期喉癌放疗后的嗓音结果
J Voice. 2020 May;34(3):460-464. doi: 10.1016/j.jvoice.2018.11.007. Epub 2019 Jan 2.
10
Validation of Acoustic Voice Quality Index Version 3.01 and Acoustic Breathiness Index in Korean Population.验证韩国人群中的嗓音障碍指数 3.01 版和声扰指数。
J Voice. 2021 Jul;35(4):660.e9-660.e18. doi: 10.1016/j.jvoice.2019.10.005. Epub 2019 Nov 7.

引用本文的文献

1
Analytics and Applications of Audio and Image Sensing Techniques.音频和图像感应技术的分析与应用。
Sensors (Basel). 2022 Nov 3;22(21):8443. doi: 10.3390/s22218443.

本文引用的文献

1
Text-to-speech synthesis as an alternative communication means after total laryngectomy.文本转语音合成作为全喉切除术后的一种替代交流手段。
Biomed Pap Med Fac Univ Palacky Olomouc Czech Repub. 2021 Jun;165(2):192-197. doi: 10.5507/bp.2020.016. Epub 2020 Apr 27.
2
The Treatment of Laryngeal Cancer.喉癌的治疗
Oral Maxillofac Surg Clin North Am. 2019 Feb;31(1):1-11. doi: 10.1016/j.coms.2018.09.001.
3
Communication changes with laryngectomy and impact on quality of life: a review.喉切除术与生活质量变化的交流:综述
Qual Life Res. 2019 Apr;28(4):863-877. doi: 10.1007/s11136-018-2033-y. Epub 2018 Nov 11.
4
Laryngeal cancer: United Kingdom National Multidisciplinary guidelines.喉癌:英国国家多学科指南
J Laryngol Otol. 2016 May;130(S2):S75-S82. doi: 10.1017/S0022215116000487.
5
Epidemiological review of laryngeal cancer: An Indian perspective.喉癌的流行病学综述:印度视角
Indian J Med Paediatr Oncol. 2015 Jul-Sep;36(3):154-60. doi: 10.4103/0971-5851.166721.
6
Laryngeal cancer mortality trends in European countries.欧洲国家喉癌死亡率趋势
Int J Cancer. 2016 Feb 15;138(4):833-42. doi: 10.1002/ijc.29833. Epub 2015 Sep 14.
7
The multidimensional impact of total laryngectomy on women.全喉切除术对女性的多维度影响。
J Commun Disord. 2015 Jul-Aug;56:59-75. doi: 10.1016/j.jcomdis.2015.06.008. Epub 2015 Jul 2.
8
Laryngeal replacement with an artificial larynx after total laryngectomy: the possibility of restoring larynx functionality in the future.全喉切除术后用人造喉进行喉再造:未来恢复喉部功能的可能性。
Head Neck. 2014 Nov;36(11):1669-73. doi: 10.1002/hed.23621. Epub 2014 Jun 21.
9
The value of the acoustic voice quality index as a measure of dysphonia severity in subjects speaking different languages.不同语言人群的嗓音声学质量指数与发音障碍严重程度的相关性。
Eur Arch Otorhinolaryngol. 2014 Jun;271(6):1609-19. doi: 10.1007/s00405-013-2730-7. Epub 2013 Oct 26.
10
Reconstructing the voice of an individual following laryngectomy.重建喉切除术后个体的声音。
Augment Altern Commun. 2011 Mar;27(1):61-6. doi: 10.3109/07434618.2010.545078. Epub 2011 Feb 2.