• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Hey Siri: How Effective are Common Voice Recognition Systems at Recognizing Dysphonic Voices?嘿,Siri:常见语音识别系统在识别嗓音障碍者的声音方面效果如何?
Laryngoscope. 2021 Jul;131(7):1599-1607. doi: 10.1002/lary.29082. Epub 2020 Sep 19.
2
Correlation of the Dysphonia Severity Index (DSI), Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V), and Gender in Brazilians With and Without Voice Disorders.巴西有嗓音障碍和无嗓音障碍人群中,嗓音障碍严重程度指数(DSI)、嗓音的共识听觉-感知评估(CAPE-V)与性别的相关性。
J Voice. 2016 Nov;30(6):765.e7-765.e11. doi: 10.1016/j.jvoice.2015.10.013. Epub 2015 Nov 28.
3
Acoustic and Perceptual Classification of Within-sample Normal, Intermittently Dysphonic, and Consistently Dysphonic Voice Types.样本内正常、间歇性发声障碍和持续性发声障碍嗓音类型的声学及感知分类
J Voice. 2017 Mar;31(2):218-228. doi: 10.1016/j.jvoice.2016.04.016. Epub 2016 May 27.
4
Predictive value and discriminant capacity of cepstral- and spectral-based measures during continuous speech.基于倒谱和谱的语音连续语音分析的预测价值和判别能力。
J Voice. 2013 Jul;27(4):393-400. doi: 10.1016/j.jvoice.2013.02.005. Epub 2013 May 16.
5
Multiparameter Voice Assessment in Dysphonics: Correlation Between Objective and Perceptual Parameters.嗓音障碍的多参数语音评估:客观参数与感知参数之间的相关性
J Voice. 2022 May;36(3):335-343. doi: 10.1016/j.jvoice.2020.06.009. Epub 2020 Jul 7.
6
Use of cepstral analysis for differentiating dysphonic from normal voices in children.运用倒谱分析鉴别儿童嗓音障碍与正常嗓音。
Int J Pediatr Otorhinolaryngol. 2019 Jan;116:107-113. doi: 10.1016/j.ijporl.2018.10.029. Epub 2018 Oct 23.
7
A comparison of Dysphonia Severity Index and Acoustic Voice Quality Index measures in differentiating normal and dysphonic voices.嗓音障碍严重程度指数与声学嗓音质量指数在区分正常嗓音和嗓音障碍嗓音方面的比较。
Eur Arch Otorhinolaryngol. 2018 Apr;275(4):949-958. doi: 10.1007/s00405-018-4903-x. Epub 2018 Feb 13.
8
Evaluating iPhone recordings for acoustic voice assessment.评估用于声学语音评估的iPhone录音。
Folia Phoniatr Logop. 2012;64(3):122-30. doi: 10.1159/000335874. Epub 2012 May 15.
9
Validation of the Acoustic Voice Quality Index in the Lithuanian Language.立陶宛语声学语音质量指数的验证。
J Voice. 2017 Mar;31(2):257.e1-257.e11. doi: 10.1016/j.jvoice.2016.06.002. Epub 2016 Jul 15.
10
Quantifying and Improving the Performance of Speech Recognition Systems on Dysphonic Speech.量化并提高语音识别系统对嗓音障碍语音的性能。
Otolaryngol Head Neck Surg. 2023 May;168(5):1130-1138. doi: 10.1002/ohn.170. Epub 2023 Jan 24.

引用本文的文献

1
Participatory Design: Addressing Speech-Language Pathologists' Challenges through Collaborative Interface Design.参与式设计:通过协作界面设计应对言语病理学家的挑战
Des Issues. 2025 Spring;41(2):44-59. doi: 10.1162/desi_a_00805. Epub 2025 Apr 1.
2
Quantification of Automatic Speech Recognition System Performance on d/Deaf and Hard of Hearing Speech.自动语音识别系统对聋人及听力障碍者语音的性能量化
Laryngoscope. 2025 Jan;135(1):191-197. doi: 10.1002/lary.31713. Epub 2024 Aug 19.
3
Physical Activity Evaluation Using a Voice Recognition App: Development and Validation Study.使用语音识别应用程序进行身体活动评估:开发与验证研究。
JMIR Biomed Eng. 2021 Jan 21;6(1):e19088. doi: 10.2196/19088.
4
An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection.多倒谱投影表示策略在嗓音障碍检测中的实验分析。
Sensors (Basel). 2023 May 30;23(11):5196. doi: 10.3390/s23115196.

本文引用的文献

1
Measuring the impact of dysphonia on quality of life using health state preferences.使用健康状况偏好衡量声音障碍对生活质量的影响。
Laryngoscope. 2020 Apr;130(4):E177-E182. doi: 10.1002/lary.28148. Epub 2019 Jun 20.
2
The Effect of Background Noise on Intelligibility of Dysphonic Speech.背景噪声对嗓音障碍言语清晰度的影响。
J Speech Lang Hear Res. 2017 Jul 12;60(7):1919-1929. doi: 10.1044/2017_JSLHR-S-16-0012.
3
Prevalence and Occupation of Patients Presenting With Dysphonia in the United States.美国出现声音障碍患者的患病率及职业情况
J Voice. 2017 Sep;31(5):594-600. doi: 10.1016/j.jvoice.2017.01.011. Epub 2017 Apr 13.
4
Predicting Voice Disorder Status From Smoothed Measures of Cepstral Peak Prominence Using Praat and Analysis of Dysphonia in Speech and Voice (ADSV).使用Praat通过平滑的谐波峰值突出度测量以及言语和嗓音中的发声障碍分析(ADSV)预测嗓音障碍状态。
J Voice. 2017 Sep;31(5):557-566. doi: 10.1016/j.jvoice.2017.01.006. Epub 2017 Feb 4.
5
The Impact of Dysphonic Voices on Healthy Listeners: Listener Reaction Times, Speech Intelligibility, and Listener Comprehension.嗓音障碍对健康听众的影响:听众反应时间、言语清晰度及听众理解度
Am J Speech Lang Pathol. 2016 Nov 1;25(4):561-575. doi: 10.1044/2016_AJSLP-14-0183.
6
A Comparison of Cepstral Peak Prominence Measures From Two Acoustic Analysis Programs.两个声学分析程序的谐波峰值突出度测量比较
J Voice. 2017 May;31(3):387.e1-387.e10. doi: 10.1016/j.jvoice.2016.09.012. Epub 2016 Oct 15.
7
Assessment of Grade of Dysphonia and Correlation With Quality of Life Protocol.嗓音障碍分级评估及其与生活质量协议的相关性
J Voice. 2017 Mar;31(2):243.e21-243.e26. doi: 10.1016/j.jvoice.2016.04.005. Epub 2016 Sep 19.
8
Shifting Fundamental Frequency in Simulated Electric-Acoustic Listening: Effects of F0 Variation.模拟电声聆听中基频的变化:F0 变化的影响
Ear Hear. 2016 Jan-Feb;37(1):e18-25. doi: 10.1097/AUD.0000000000000227.
9
The prevalence of voice problems among adults in the United States.美国成年人嗓音问题的患病率。
Laryngoscope. 2014 Oct;124(10):2359-62. doi: 10.1002/lary.24740. Epub 2014 May 27.
10
Formant analysis in dysphonic patients and automatic Arabic digit speech recognition.嗓音障碍患者的共振峰分析与自动阿拉伯数字语音识别。
Biomed Eng Online. 2011 May 30;10:41. doi: 10.1186/1475-925X-10-41.

嘿,Siri:常见语音识别系统在识别嗓音障碍者的声音方面效果如何?

Hey Siri: How Effective are Common Voice Recognition Systems at Recognizing Dysphonic Voices?

作者信息

Rohlfing Matthew L, Buckley Daniel P, Piraquive Jacquelyn, Stepp Cara E, Tracy Lauren F

机构信息

Department of Otolaryngology-Head and Neck Surgery, Boston Medical Center Boston University School of Medicine, Boston, Massachusetts, U.S.A.

Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, U.S.A.

出版信息

Laryngoscope. 2021 Jul;131(7):1599-1607. doi: 10.1002/lary.29082. Epub 2020 Sep 19.

DOI:10.1002/lary.29082
PMID:32949415
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9009156/
Abstract

OBJECTIVES/HYPOTHESIS: Interaction with voice recognition systems, such as Siri™ and Alexa™, is an increasingly important part of everyday life. Patients with voice disorders may have difficulty with this technology, leading to frustration and reduction in quality of life. This study evaluates the ability of common voice recognition systems to transcribe dysphonic voices.

STUDY DESIGN

Retrospective evaluation of "Rainbow Passage" voice samples from patients with and without voice disorders.

METHODS

Participants with (n = 30) and without (n = 23) voice disorders were recorded reading the "Rainbow Passage". Recordings were played at standardized intensity and distance-to-dictation programs on Apple iPhone 6S™, Apple iPhone 11 Pro™, and Google Voice™. Word recognition scores were calculated as the proportion of correctly transcribed words. Word recognition scores were compared to auditory-perceptual and acoustic measures.

RESULTS

Mean word recognition scores for participants with and without voice disorders were, respectively, 68.6% and 91.9% for Apple iPhone 6S™ (P < .001), 71.2% and 93.7% for Apple iPhone 11 Pro™ (P < .001), and 68.7% and 93.8% for Google Voice™ (P < .001). There were strong, approximately linear associations between CAPE-V ratings of overall severity of dysphonia and word recognition score, with correlation coefficients (R ) of 0.609 (iPhone 6S™), 0.670 (iPhone 11 Pro™), and 0.619 (Google Voice™). These relationships persisted when controlling for diagnosis, age, gender, fundamental frequency, and speech rate (P < .001 for all systems).

CONCLUSION

Common voice recognition systems function well with nondysphonic voices but are poor at accurately transcribing dysphonic voices. There was a strong negative correlation with word recognition scores and perceptual voice evaluation. As our society increasingly interfaces with automated voice recognition technology, the needs of patients with voice disorders should be considered.

LEVEL OF EVIDENCE

4 Laryngoscope, 131:1599-1607, 2021.

摘要

目的/假设:与语音识别系统(如Siri™和Alexa™)的交互在日常生活中变得越来越重要。语音障碍患者在使用这项技术时可能会遇到困难,从而导致沮丧情绪并降低生活质量。本研究评估了常见语音识别系统转录嗓音障碍患者语音的能力。

研究设计

对有和没有语音障碍患者的“彩虹段落”语音样本进行回顾性评估。

方法

记录了有语音障碍的参与者(n = 30)和无语音障碍的参与者(n = 23)朗读“彩虹段落”的情况。录音在苹果iPhone 6S™、苹果iPhone 11 Pro™和谷歌语音™上以标准化强度和距离听写程序播放。单词识别分数以正确转录单词的比例计算。将单词识别分数与听觉感知和声学测量结果进行比较。

结果

对于苹果iPhone 6S™,有语音障碍和无语音障碍参与者的平均单词识别分数分别为68.6%和91.9%(P <.001);对于苹果iPhone 11 Pro™,分别为71.2%和93.7%(P <.001);对于谷歌语音™,分别为68.7%和93.8%(P <.001)。嗓音障碍总体严重程度的CAPE-V评分与单词识别分数之间存在强且近似线性的关联,相关系数(R)分别为0.609(iPhone 6S™)、0.670(iPhone 11 Pro™)和0.619(谷歌语音™)。在控制诊断、年龄、性别、基频和语速后,这些关系依然存在(所有系统的P均<.001)。

结论

常见语音识别系统对无嗓音障碍的语音功能良好,但在准确转录嗓音障碍语音方面表现不佳。单词识别分数与感知语音评估之间存在很强的负相关性。随着我们的社会越来越多地与自动语音识别技术交互,应考虑语音障碍患者的需求。

证据水平

4 《喉镜》,131:1599 - 1607,2021年。