• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于联合谱-时间特征的高斯后验图对唇腭裂语音清晰度的评估

Intelligibility assessment of cleft lip and palate speech using Gaussian posteriograms based on joint spectro-temporal features.

作者信息

Kalita Sishir, Mahadeva Prasanna S R, Dandapat S

机构信息

Department of Electronics and Electrical Engineering, Indian Institute of Technology Guwahati, Guwahati, Assam 781039, India.

出版信息

J Acoust Soc Am. 2018 Oct;144(4):2413. doi: 10.1121/1.5064463.

DOI:10.1121/1.5064463
PMID:30404473
Abstract

Intelligibility is considered as one of the primary measures for speech rehabilitation of individuals with a cleft lip and palate (CLP). Currently, speech processing and machine-learning-based objective methods are gaining more research interest as a way to quantify speech intelligibility. In this work, joint spectro-temporal features computed from a time-frequency representation of speech are explored to derive speech representations based on Gaussian posteriograms. A comparative framework using dynamic time warping (DTW) is used to quantify the intelligibility of child CLP speech. The DTW distance is used to score sentence-level intelligibility and tested for correlation with perceptual intelligibility ratings obtained from expert speech-language pathologists. A baseline DTW system using the conventional Mel-frequency cepstral coefficients (MFCCs) is also developed to compare the performance of the proposed system. Spearman's rank correlation coefficient between the objective intelligibility scores and the perceptual intelligibility rating is studied. A Williams significance test is conducted to assess the statistical significance of the correlation difference between the methods. The results show that the system based on joint spectro-temporal features significantly outperforms the MFCC-based system.

摘要

可懂度被视为唇腭裂(CLP)患者言语康复的主要指标之一。目前,语音处理和基于机器学习的客观方法作为量化言语可懂度的一种方式,正获得越来越多的研究关注。在这项工作中,探索了从语音的时频表示中计算出的联合谱-时间特征,以基于高斯后验图导出语音表示。使用动态时间规整(DTW)的比较框架用于量化儿童CLP语音的可懂度。DTW距离用于对句子级可懂度进行评分,并测试其与从专业言语病理学家获得的感知可懂度评级的相关性。还开发了一个使用传统梅尔频率倒谱系数(MFCC)的基线DTW系统,以比较所提出系统的性能。研究了客观可懂度分数与感知可懂度评级之间的斯皮尔曼等级相关系数。进行威廉姆斯显著性检验,以评估方法之间相关性差异的统计显著性。结果表明,基于联合谱-时间特征的系统明显优于基于MFCC的系统。

相似文献

1
Intelligibility assessment of cleft lip and palate speech using Gaussian posteriograms based on joint spectro-temporal features.基于联合谱-时间特征的高斯后验图对唇腭裂语音清晰度的评估
J Acoust Soc Am. 2018 Oct;144(4):2413. doi: 10.1121/1.5064463.
2
Importance of glottis landmarks for the assessment of cleft lip and palate speech intelligibility.评估腭裂语音清晰度时对声门裂标志的重视。
J Acoust Soc Am. 2018 Nov;144(5):2656. doi: 10.1121/1.5062838.
3
Objective assessment of cleft lip and palate speech intelligibility using articulation and hypernasality measures.使用构音和超鼻音测量客观评估唇腭裂语音清晰度。
J Acoust Soc Am. 2019 Aug;146(2):1164. doi: 10.1121/1.5121310.
4
Evaluation of speech intelligibility for children with cleft lip and palate by means of automatic speech recognition.通过自动语音识别评估唇腭裂儿童的言语清晰度
Int J Pediatr Otorhinolaryngol. 2006 Oct;70(10):1741-7. doi: 10.1016/j.ijporl.2006.05.016. Epub 2006 Jun 30.
5
Children's Attitudes Toward Peers With Unintelligible Speech Associated With Cleft Lip and/or Palate.儿童对伴有唇腭裂且语音不清的同伴的态度。
Cleft Palate Craniofac J. 2017 May;54(3):262-268. doi: 10.1597/15-088. Epub 2016 Mar 31.
6
[Intelligibility of children with bilateral and unilateral cleft lip and palate].[双侧及单侧唇腭裂患儿的语音清晰度]
Laryngorhinootologie. 2009 Nov;88(11):723-8. doi: 10.1055/s-0029-1225639. Epub 2009 Jul 23.
7
Effect of cleft type on overall speech intelligibility and resonance.腭裂类型对整体言语清晰度及共鸣的影响。
Folia Phoniatr Logop. 2002 May-Jun;54(3):158-68. doi: 10.1159/000063411.
8
The correlation between nasalance and a differentiated perceptual rating of speech in Dutch patients with velopharyngeal insufficiency.荷兰腭咽功能不全患者鼻音与语音差异感知评分之间的相关性。
Cleft Palate Craniofac J. 2002 May;39(3):277-84. doi: 10.1597/1545-1569_2002_039_0277_tcbnaa_2.0.co_2.
9
Assessing intelligibility in speakers with cleft palate: a critical review of the literature.评估腭裂患者的言语清晰度:文献综述
Cleft Palate Craniofac J. 2002 Jan;39(1):50-8. doi: 10.1597/1545-1569_2002_039_0050_aiiswc_2.0.co_2.
10
Untrained listeners' ratings of speech disorders in a group with cleft palate: a comparison with speech and language pathologists' ratings.未经训练的听众对腭裂患者语音障碍的评估:与言语语言病理学家评估的比较。
Int J Lang Commun Disord. 2009 Sep-Oct;44(5):656-74. doi: 10.1080/13682820802295203.

引用本文的文献

1
Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation.基于深度学习的辅音-元音过渡模型用于发音的客观评估。
IEEE/ACM Trans Audio Speech Lang Process. 2023;31:86-95. doi: 10.1109/taslp.2022.3209937. Epub 2022 Oct 10.