替代语音质量的客观评估。

Objective evaluation of the quality of substitution voices.

作者信息

Moerman Mieke, Pieters Glenn, Martens Jean-Pierre, Van der Borgt Marie-Jeanne, Dejonckere Phillippe

机构信息

Institute of Phoniatrics, University Medical Centre Utrecht, Utrecht, The Netherlands.

出版信息

Eur Arch Otorhinolaryngol. 2004 Nov;261(10):541-7. doi: 10.1007/s00405-003-0681-0. Epub 2004 Jan 15.

DOI:10.1007/s00405-003-0681-0

PMID:14727123

Abstract

This paper describes our first attempts to develop a method for the objective assessment of quality in substitution voices. The objective analysis deals with acoustic parameters characterising short voice and speech samples like a sequence of isolated vowels, a sequence of VCV and CVCVCV syllables, a short sentence, etc. A database of 113 registrations from 68 patients (53 total laryngectomy patients with tracheo-esophageal speech, 14 total laryngectomy patients with esophageal speech and 5 patients with partial frontolateral laryngectomy) and 6 registrations from healthy control persons was collected. Each registration consisted of seven speech utterances and was subjected to an acoustic analysis as well as to a perceptual evaluation, the latter involving eight parameters like "overall impression", "tonicity", etc. Since the goal of our work is to find out the best acoustical measurement for supporting perception and making it precise, it seemed logical to strive for a perceptually based acoustic analysis. We therefore performed the analysis by means of a peripheral auditory model with a built-in fundamental frequency (pitch) extractor. From the frame-level outputs (a frame is 10 ms) of the analyser, global objective parameters, such as (1) the percentage of voiced frames, (2) the average voicing evidence, (3) the voicing length distribution and (4) the fundamental frequency jitter, were computed for the different speech utterances. So as to reduce the parameter variability arising from the nature of the speech utterances (e.g., the presence of pauses in the signal, errors caused by the pitch extractor, etc.), the objective parameters were computed using non-standard averaging schemes involving energy weighting and frame selection. A statistical analysis of the objective parameters confirms that the quality of tracheo-esophageal speech is superior to that of esophageal speech, but inferior to that of normal speech and speech with the preservation of one vocal fold. Correlations between the objective parameters and the perceptual parameters are moderate.

摘要

本文描述了我们首次尝试开发一种客观评估替代语音质量的方法。客观分析涉及表征短语音和言语样本的声学参数，如一系列孤立元音、一系列VCV和CVCVCV音节、一个短句等。收集了来自68名患者（53名全喉切除术后采用气管食管发音的患者、14名全喉切除术后采用食管发音的患者以及5名部分额侧喉切除患者）的113份记录和来自健康对照者的6份记录。每份记录包含七个言语发声，并进行了声学分析以及感知评估，后者涉及“总体印象”“音调”等八个参数。由于我们工作的目标是找出支持感知并使其精确的最佳声学测量方法，因此基于感知进行声学分析似乎是合乎逻辑的。因此，我们借助一个内置基频（音高）提取器的外周听觉模型进行了分析。从分析仪的帧级输出（一帧为10毫秒）中，针对不同的言语发声计算了全局客观参数，如（1）浊音帧的百分比、（2）平均浊音证据、（3）浊音时长分布和（4）基频抖动。为了减少因言语发声的性质（例如信号中存在停顿、音高提取器导致的误差等）而产生的参数变异性，使用了涉及能量加权和帧选择的非标准平均方案来计算客观参数。对客观参数的统计分析证实，气管食管发音的质量优于食管发音，但低于正常发音和保留一侧声带的发音。客观参数与感知参数之间的相关性为中等。

相似文献

Objective evaluation of the quality of substitution voices.

Eur Arch Otorhinolaryngol. 2004 Nov;261(10):541-7. doi: 10.1007/s00405-003-0681-0. Epub 2004 Jan 15.

The intelligibility of tracheoesophageal speech, with an emphasis on the voiced-voiceless distinction.

Logoped Phoniatr Vocol. 2006;31(4):172-81. doi: 10.1080/14015430500515732.

Automatic intelligibility assessment of speakers after laryngeal cancer by means of acoustic modeling.

J Voice. 2012 May;26(3):390-7. doi: 10.1016/j.jvoice.2011.04.010. Epub 2011 Aug 5.

The role of the different neoglottis forms in the development of esophageal voice.

Acta Physiol Hung. 2014 Sep;101(3):291-300. doi: 10.1556/APhysiol.101.2014.004.

Multidimensional assessment of strongly irregular voices such as in substitution voicing and spasmodic dysphonia: a compilation of own research.

Logoped Phoniatr Vocol. 2015 Apr;40(1):24-9. doi: 10.3109/14015439.2014.936497. Epub 2014 Jul 14.

The Influence of Native Language on Auditory-Perceptual Evaluation of Vocal Samples Completed by Brazilian and Canadian SLPs.

J Voice. 2017 Mar;31(2):258.e1-258.e5. doi: 10.1016/j.jvoice.2016.05.021. Epub 2016 Jul 11.

Voice and speech after laryngectomy.

Clin Linguist Phon. 2006 Apr-May;20(2-3):195-203. doi: 10.1080/02699200400026975.

The vocal clarity of female speech-language pathology students: an exploratory study.

J Voice. 2012 Jan;26(1):63-8. doi: 10.1016/j.jvoice.2010.10.008. Epub 2011 Mar 25.

[Acoustic voice analysis in phonatory fistuloplasty after total laryngectomy].

Acta Otorrinolaringol Esp. 1999 Mar;50(2):129-33.

Substitution voicing index: towards improved speech assessment in patients who have undergone laryngeal oncosurgery.

Clin Linguist Phon. 2023 Jul 3;37(7):583-598. doi: 10.1080/02699206.2022.2059398. Epub 2022 Jun 3.

引用本文的文献

Tracheoesophageal Voicing Following Resistance-Based Dysphagia Rehabilitation: An Exploratory Multidimensional Assessment.

Head Neck. 2025 Aug;47(8):2209-2222. doi: 10.1002/hed.28136. Epub 2025 Mar 25.

Development and evaluation of a new intraoral voice assist device called the voice retriever.

Laryngoscope Investig Otolaryngol. 2024 Jan 30;9(1):e1204. doi: 10.1002/lio2.1204. eCollection 2024 Feb.

Subjective Perception and Psychoacoustic Aspects of the Laryngectomee Voice: The Impact on Quality of Life.

J Pers Med. 2023 Mar 22;13(3):570. doi: 10.3390/jpm13030570.

Objective and subjective voice outcomes after total laryngectomy: a systematic review.

Eur Arch Otorhinolaryngol. 2018 Jan;275(1):11-26. doi: 10.1007/s00405-017-4790-6. Epub 2017 Oct 31.

Functional outcomes after supracricoid laryngectomy: what do we not know and what do we need to know?

Eur Arch Otorhinolaryngol. 2016 Nov;273(11):3459-3475. doi: 10.1007/s00405-015-3822-3. Epub 2015 Nov 6.

A pilot study about speech changes after partial Tucker's laryngectomy: the reduction of regressive voicing assimilation.

Eur Arch Otorhinolaryngol. 2015 Dec;272(12):3843-9. doi: 10.1007/s00405-015-3702-x. Epub 2015 Jul 9.

Reliability of the Italian INFVo scale and correlations with objective measures and VHI scores.

Acta Otorhinolaryngol Ital. 2013 Apr;33(2):121-7.

Videolaryngostroboscopy and voice evaluation in patients with rheumatoid arthritis.

Braz J Otorhinolaryngol. 2012 Oct;78(5):121-7. doi: 10.5935/1808-8694.20120019.

Voicing quantification is more relevant than period perturbation in substitution voices: an advanced acoustical study.

Eur Arch Otorhinolaryngol. 2012 Apr;269(4):1205-12. doi: 10.1007/s00405-011-1900-8. Epub 2012 Jan 5.

Tridimensional assessment of adductor spasmodic dysphonia pre- and post-treatment with Botulinum toxin.

Eur Arch Otorhinolaryngol. 2012 Apr;269(4):1195-203. doi: 10.1007/s00405-011-1890-6. Epub 2011 Dec 31.

本文引用的文献

A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS).

Eur Arch Otorhinolaryngol. 2001 Feb;258(2):77-82. doi: 10.1007/s004050000299.

The dysphonia severity index: an objective measure of vocal quality based on a multiparameter approach.

J Speech Lang Hear Res. 2000 Jun;43(3):796-809. doi: 10.1044/jslhr.4303.796.

Acoustical analysis and perceptual evaluation of tracheoesophageal prosthetic voice.

J Voice. 1998 Jun;12(2):239-48. doi: 10.1016/s0892-1997(98)80044-1.

[Contribution and limits of acoustic analysis of the voice and alaryngeal speech with a computerized system].

Ann Otolaryngol Chir Cervicofac. 1996;113(2):61-8.

Spectrographic differences between tracheal-esophageal and esophageal voice.

Folia Phoniatr Logop. 1996;48(5):255-61. doi: 10.1159/000266416.

Perceptual evaluation of voice quality: review, tutorial, and a framework for future research.

J Speech Hear Res. 1993 Feb;36(1):21-40. doi: 10.1044/jshr.3601.21.

Acoustic analysis of tracheo-oesophageal versus oesophageal speech.

J Laryngol Otol. 1994 Apr;108(4):325-8. doi: 10.1017/s0022215100126660.

Acoustic differentiation of laryngeal, esophageal, and tracheoesophageal speech.

J Speech Hear Res. 1984 Dec;27(4):577-85. doi: 10.1044/jshr.2704.577.

Pitch and voiced/unvoiced determination with an auditory model.

J Acoust Soc Am. 1992 Jun;91(6):3511-26. doi: 10.1121/1.402840.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

替代语音质量的客观评估。

Objective evaluation of the quality of substitution voices.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献