• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用独立频带信号相关和倍频程谱突出峰对腭裂语音咽擦音的声学分析和检测。

Acoustic analysis and detection of pharyngeal fricative in cleft palate speech using correlation of signals in independent frequency bands and octave spectrum prominent peak.

机构信息

College of Electrical Engineering, Sichuan University, 610065, Chengdu, China.

West China Hospital of Stomatology, Sichuan University, 610041, Chengdu, China.

出版信息

Biomed Eng Online. 2020 May 27;19(1):36. doi: 10.1186/s12938-020-00782-3.

DOI:10.1186/s12938-020-00782-3
PMID:32460765
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7251748/
Abstract

BACKGROUND

Pharyngeal fricative is one typical compensatory articulation error of cleft palate speech. It passively influences daily communication for people who suffer from it. The automatic detection of pharyngeal fricatives in cleft palate speech can provide information for clinical doctors and speech-language pathologists to aid in diagnosis.

RESULTS

This paper proposes two features (CSIFs: correlation of signals in independent frequency bands; OSPP: octave spectrum prominent peak) to detect pharyngeal fricative speech. CSIFs feature is proposed to detect the distribution characteristics of frequency components in pharyngeal fricative speech caused by the changed place of articulation and movement of articulators. While OSPP is presented to reflect the concentration degree of prominent peak which is closely related to the place of articulation in pharyngeal fricative, both features are investigated to relate to the altered production process of pharyngeal fricative. To evaluate the capability of these two features to detect pharyngeal fricative, we collected a speech database covering all the types of initial consonants in which pharyngeal fricatives occur. In this detection task, the classifier used to discriminate pharyngeal fricative speech and normal speech is based on ensemble learning.

CONCLUSION

The detection accuracy obtained with CSIFs and OSPP features ranges from 83.5 to 84.5% and from 85 to 87%, respectively. When these two features are combined, the detection accuracy for pharyngeal fricative speech ranges from 88 to 89%, with an AUC (area under the receiver operating characteristic curve) value of 93%.

摘要

背景

咽擦音是腭裂语音的一种典型代偿性发音错误。它会对患有腭裂语音的人的日常交流产生被动影响。自动检测腭裂语音中的咽擦音可以为临床医生和言语语言病理学家提供信息,以辅助诊断。

结果

本文提出了两个特征(CSIFs:独立频带信号的相关性;OSPP:倍频程谱突出峰)来检测咽擦音语音。CSIFs 特征用于检测由发音部位改变和发音器官运动引起的咽擦音语音中频率分量的分布特征。而 OSPP 则用于反映与发音部位密切相关的突出峰的集中程度,这两个特征都被研究用于反映咽擦音产生过程的改变。为了评估这两个特征检测咽擦音的能力,我们收集了一个涵盖所有发生咽擦音的辅音类型的语音数据库。在这个检测任务中,用于区分咽擦音语音和正常语音的分类器基于集成学习。

结论

CSIFs 和 OSPP 特征的检测准确率分别在 83.5%到 84.5%和 85%到 87%之间。当这两个特征结合使用时,咽擦音语音的检测准确率在 88%到 89%之间,ROC 曲线下面积(AUC)值为 93%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/ee10f323b8d4/12938_2020_782_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/32adf72b472b/12938_2020_782_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/322a11ad34a8/12938_2020_782_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/66f1fd7baafc/12938_2020_782_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/cc3a5f38ab97/12938_2020_782_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/3ce5a534ca62/12938_2020_782_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/323454fed888/12938_2020_782_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/87936d87bec9/12938_2020_782_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/dc596537d9cc/12938_2020_782_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/246fe594d06c/12938_2020_782_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/724b025b1e83/12938_2020_782_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/f13d4abc86d7/12938_2020_782_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/c72151465585/12938_2020_782_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/2f773bbde202/12938_2020_782_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/ac71e03ad18b/12938_2020_782_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/ee10f323b8d4/12938_2020_782_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/32adf72b472b/12938_2020_782_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/322a11ad34a8/12938_2020_782_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/66f1fd7baafc/12938_2020_782_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/cc3a5f38ab97/12938_2020_782_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/3ce5a534ca62/12938_2020_782_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/323454fed888/12938_2020_782_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/87936d87bec9/12938_2020_782_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/dc596537d9cc/12938_2020_782_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/246fe594d06c/12938_2020_782_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/724b025b1e83/12938_2020_782_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/f13d4abc86d7/12938_2020_782_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/c72151465585/12938_2020_782_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/2f773bbde202/12938_2020_782_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/ac71e03ad18b/12938_2020_782_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2fc/7251748/ee10f323b8d4/12938_2020_782_Fig15_HTML.jpg

相似文献

1
Acoustic analysis and detection of pharyngeal fricative in cleft palate speech using correlation of signals in independent frequency bands and octave spectrum prominent peak.使用独立频带信号相关和倍频程谱突出峰对腭裂语音咽擦音的声学分析和检测。
Biomed Eng Online. 2020 May 27;19(1):36. doi: 10.1186/s12938-020-00782-3.
2
Laryngeal fricative in cleft palate speech.腭裂语音中的喉摩擦音。
Acta Otolaryngol Suppl. 1984;419:180-8.
3
Acoustic characteristics of fricatives, amplitude of formants and clarity of speech produced without and with a medical mask.摩擦音的声学特征、共振峰幅度和使用与不使用医用口罩说话的清晰度。
Int J Lang Commun Disord. 2022 Mar;57(2):366-380. doi: 10.1111/1460-6984.12705. Epub 2022 Feb 15.
4
Acoustic features of pharyngeal /s/ fricatives produced by speakers with cleft palate.腭裂患者发出的咽音/s/擦音的声学特征。
Cleft Palate J. 1975 Jan;12(00):12-6.
5
Acoustic-phonetic features for the automatic classification of fricatives.用于擦音自动分类的声学语音特征。
J Acoust Soc Am. 2001 May;109(5 Pt 1):2217-35. doi: 10.1121/1.1357814.
6
Articulatory additions to the classical description of the speech of persons with cleft palate.腭裂患者语音经典描述的发音补充。
Cleft Palate J. 1981 Jul;18(3):193-203.
7
Phonetic analyses of the speech development of babies with cleft palate.腭裂患儿语音发展的语音分析
Cleft Palate J. 1988 Apr;25(2):122-34.
8
Investigation of the speech results of posterior pharyngeal wall augmentation with fat grafting for treatment of velopharyngeal insufficiency.脂肪移植后咽壁增厚治疗腭咽闭合不全的语音效果研究。
J Craniomaxillofac Surg. 2017 Jun;45(6):891-896. doi: 10.1016/j.jcms.2017.02.024. Epub 2017 Mar 6.
9
Effect of motor-based speech intervention on articulatory placement in the treatment of a posterior nasal fricative: a preliminary MRI study on a single subject.基于运动的言语干预对后鼻擦音治疗中发音位置的影响:一项单病例初步MRI研究
Int J Lang Commun Disord. 2018 Jul;53(4):852-863. doi: 10.1111/1460-6984.12393. Epub 2018 May 21.
10
Patterns of articulation abilities in speakers with cleft palate.腭裂患者的发音能力模式
Cleft Palate J. 1979 Jul;16(3):230-9.

引用本文的文献

1
Artificial Intelligence Applications in Pediatric Craniofacial Surgery.人工智能在小儿颅颌面外科的应用
Diagnostics (Basel). 2025 Mar 25;15(7):829. doi: 10.3390/diagnostics15070829.
2
The Effect of Mixed Articulation Therapy on Perceptual and Acoustic Features of Compensatory Errors in Children with Cleft Palate.混合发音疗法对腭裂儿童代偿性错误的感知和声学特征的影响
Med J Islam Repub Iran. 2024 Oct 8;38:116. doi: 10.47176/mjiri.38.116. eCollection 2024.
3
Differential Diagnosis of a Pharyngeal Fricative and Therapeutic Monitoring of Velopharyngeal Function Using Magnetic Resonance Imaging.

本文引用的文献

1
Acoustic analysis of voice in children with cleft lip and palate following vocal rehabilitation. Preliminary report.唇腭裂患儿嗓音康复后的嗓音声学分析。初步报告。
Int J Pediatr Otorhinolaryngol. 2019 Nov;126:109618. doi: 10.1016/j.ijporl.2019.109618. Epub 2019 Aug 3.
2
Understanding Nasal Emission During Speech Production: A Review of Types, Terminology, and Causality.言语产生过程中鼻漏气的理解:类型、术语及因果关系综述
Cleft Palate Craniofac J. 2020 Jan;57(1):123-126. doi: 10.1177/1055665619858873. Epub 2019 Jul 1.
3
Effect of Prompts for Restructuring Oral Muscular Phonetic Targets (PROMPT) on Compensatory Articulation in Children With Cleft Palate/Lip.
咽擦音的鉴别诊断及利用磁共振成像对腭咽功能进行治疗监测
Am J Speech Lang Pathol. 2025 Jan 7;34(1):1-11. doi: 10.1044/2024_AJSLP-24-00292. Epub 2024 Dec 5.
4
An Ultrasound Investigation of Tongue Dorsum Raising in Children with Cleft Palate +/- Cleft Lip.腭裂/唇裂患儿的舌背抬高超声研究。
Cleft Palate Craniofac J. 2024 Jul;61(7):1104-1115. doi: 10.1177/10556656231158965. Epub 2023 Feb 27.
重组口腔肌肉语音目标提示(PROMPT)对唇腭裂儿童代偿性发音的影响。
Glob Pediatr Health. 2019 Jun 12;6:2333794X19851417. doi: 10.1177/2333794X19851417. eCollection 2019.
4
Treatment of velopharyngeal insufficiency in a patient with a submucous cleft palate using a speech aid: the more treatment options, the better the treatment results.使用语音辅助装置治疗隐性腭裂患者的腭咽闭合不全:治疗选择越多,治疗效果越好。
Maxillofac Plast Reconstr Surg. 2019 May 1;41(1):19. doi: 10.1186/s40902-019-0202-8. eCollection 2019 Dec.
5
Nasalisation in the Production of Iraqi Arabic Pharyngeals.伊拉克阿拉伯语咽音发音中的鼻化现象。
Phonetica. 2018;75(4):310-348. doi: 10.1159/000487806. Epub 2018 Jun 27.
6
Investigation of the speech results of posterior pharyngeal wall augmentation with fat grafting for treatment of velopharyngeal insufficiency.脂肪移植后咽壁增厚治疗腭咽闭合不全的语音效果研究。
J Craniomaxillofac Surg. 2017 Jun;45(6):891-896. doi: 10.1016/j.jcms.2017.02.024. Epub 2017 Mar 6.
7
Investigation of in-body path loss in different human subjects for localization of capsule endoscope.
Annu Int Conf IEEE Eng Med Biol Soc. 2015;2015:5461-4. doi: 10.1109/EMBC.2015.7319627.
8
Characterization Methods for the Detection of Multiple Voice Disorders: Neurological, Functional, and Laryngeal Diseases.多种嗓音障碍的检测特征方法:神经、功能和喉部疾病。
IEEE J Biomed Health Inform. 2015 Nov;19(6):1820-8. doi: 10.1109/JBHI.2015.2467375. Epub 2015 Aug 12.
9
Optimization of Active Muscle Force-Length Models Using Least Squares Curve Fitting.使用最小二乘法曲线拟合优化活动肌肉力-长度模型
IEEE Trans Biomed Eng. 2016 Mar;63(3):630-5. doi: 10.1109/TBME.2015.2467169. Epub 2015 Aug 11.
10
Nasalance during use of pharyngeal and glottal place of production.使用咽音和声门发声部位时的鼻漏气率。
Codas. 2014 Sep-Oct;26(5):395-401. doi: 10.1590/2317-1782/20142014071.