• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Enhancing the Performance of Pathological Voice Quality Assessment System Through the Attention-Mechanism Based Neural Network.

作者信息

Han Ji-Yan, Hsiao Ching-Ju, Zheng Wei-Zhong, Weng Ko-Cheng, Ho Guan-Min, Chang Chia-Yuan, Wang Chi-Te, Fang Shih-Hau, Lai Ying-Hui

机构信息

National Yang Ming Chiao Tung University, Department of Biomedical Engineering, Taipei, Taiwan.

APrevent Medical Inc., Taipei, Taiwan.

出版信息

J Voice. 2025 Jul;39(4):1033-1043. doi: 10.1016/j.jvoice.2022.12.026. Epub 2023 Jan 31.

DOI:10.1016/j.jvoice.2022.12.026
PMID:36732109
Abstract

OBJECTIVE

Doctors, nowadays, primarily use auditory-perceptual evaluation, such as the grade, roughness, breathiness, asthenia, and strain scale, to evaluate voice quality and determine the treatment. However, the results predicted by individual physicians often differ, because of subjective perceptions, and diagnosis time interval, if the patient's symptoms are hard to judge. Therefore, an accurate computerized pathological voice quality assessment system will improve the quality of assessment.

METHOD

This study proposes a self_attention-based system, with a deep learning technology, named self_attention-based bidirectional long-short term memory (SA BiLSTM). Different pitches [low, normal, high], and vowels [/a/, /i/, /u/], were added into the proposed model, to make it learn how professional doctors evaluate the grade, roughness, breathiness, asthenia, and strain scale, in a high dimension view.

RESULTS

The experimental results showed that the proposed system provided higher performance than the baseline system. More specifically, the macro average of the F1 score, presented as decimal, was used to compare the accuracy of classification. The (G, R, and B) of the proposed system were (0.768±0.011, 0.820±0.009, and 0.815±0.009), which is higher than the baseline systems: deep neural network (0.395±0.010, 0.312±0.019, 0.321±0.014) and convolution neural network (0.421±0.052, 0.306±0.043, 0.3250±0.032) respectively.

CONCLUSIONS

The proposed system, with SA BiLSTM, pitches, and vowels, provides a more accurate way to evaluate the voice. This will be helpful for clinical voice evaluations and will improve patients' benefits from voice therapy.

摘要

相似文献

1
Enhancing the Performance of Pathological Voice Quality Assessment System Through the Attention-Mechanism Based Neural Network.
J Voice. 2025 Jul;39(4):1033-1043. doi: 10.1016/j.jvoice.2022.12.026. Epub 2023 Jan 31.
2
Perceptual and Computational Estimates of Vocal Breathiness and Roughness in Sustained Phonation and Connected Speech.持续发声和连贯言语中嗓音呼吸声和粗糙感的感知与计算评估
J Voice. 2025 Jul;39(4):1131.e31-1131.e43. doi: 10.1016/j.jvoice.2023.02.014. Epub 2023 Mar 16.
3
Voice disorder discrimination using vowel acoustic measures in female speakers.基于元音声学特征的女性嗓音障碍判别。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2087-2102. doi: 10.1111/1460-6984.13081. Epub 2024 Jun 17.
4
Is Cepstral Peak Prominence a Measure of Vocal Fatigue in Temple Priests: A Pilot Study.谐波峰值突出度能否作为寺庙神职人员嗓音疲劳的一项指标:一项初步研究。
J Voice. 2023 Mar 5. doi: 10.1016/j.jvoice.2023.01.015.
5
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
6
Short-Term Memory Impairment短期记忆障碍
7
Effectiveness of voice rehabilitation on vocalisation in postlaryngectomy patients: a systematic review.喉切除术后患者的嗓音康复对发声效果的影响:系统评价。
Int J Evid Based Healthc. 2010 Dec;8(4):256-8. doi: 10.1111/j.1744-1609.2010.00177.x.
8
Automatic GRBAS Scoring of Pathological Voices using Deep Learning and a Small Set of Labeled Voice Data.使用深度学习和少量标记语音数据对病理性嗓音进行自动GRBAS评分
J Voice. 2025 May;39(3):846.e1-846.e23. doi: 10.1016/j.jvoice.2022.10.020. Epub 2022 Nov 25.
9
Comparative Analysis of Two Methods of Perceptual Voice Assessment.两种感知语音评估方法的比较分析
J Voice. 2023 Mar 10. doi: 10.1016/j.jvoice.2023.01.005.
10
Normative Values of Client-Reported Outcome Measures and Self-Ratings of Six Voice Parameters via the VoiceEvalU8 App.通过 VoiceEvalU8 应用程序报告的客户报告结局测量和自我评估的 6 个声音参数的正常值。
J Voice. 2024 Jul;38(4):964.e17-964.e27. doi: 10.1016/j.jvoice.2021.10.026. Epub 2021 Dec 9.