• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

噪声对病理性语音分类的深度学习的影响。

The Effect of Noise on Deep Learning for Classification of Pathological Voice.

机构信息

Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.

出版信息

Laryngoscope. 2024 Aug;134(8):3537-3541. doi: 10.1002/lary.31303. Epub 2024 Jan 27.

DOI:10.1002/lary.31303
PMID:38280184
Abstract

OBJECTIVE

This study aimed to evaluate the significance of background noise in machine learning models assessing the GRBAS scale for voice disorders.

METHODS

A dataset of 1406 voice samples was collected from retrospective data, and a 5-layer 1D convolutional neural network (CNN) model was constructed using TensorFlow. The dataset was divided into training, validation, and test data. Gaussian noise was added to test samples at various intensities to assess the model's noise resilience. The model's performance was evaluated using accuracy, F1 score, and quadratic weighted Cohen's kappa score.

RESULTS

The model's performance on the GRBAS scale generally declined with increasing noise intensities. For the G scale, accuracy dropped from 70.9% (original) to 8.5% (at the highest noise), F1 score from 69.2% to 1.3%, and Cohen's kappa from 0.679 to 0.0. Similar declines were observed for the remaining RBAS components.

CONCLUSION

The model's performance was affected by background noise, with substantial decreases in evaluation metrics as noise levels intensified. Future research should explore noise-tolerant techniques, such as data augmentation, to improve the model's noise resilience in real-world settings.

LEVEL OF EVIDENCE

This study evaluates a machine learning model using a single dataset without comparative controls. Given its non-comparative design and specific focus, it aligns with Level 4 evidence (Case-series) under the 2011 OCEBM guidelines Laryngoscope, 134:3537-3541, 2024.

摘要

目的

本研究旨在评估背景噪声在用于评估嗓音障碍 GRBAS 量表的机器学习模型中的意义。

方法

从回顾性数据中收集了 1406 个语音样本数据集,并使用 TensorFlow 构建了一个 5 层 1D 卷积神经网络(CNN)模型。数据集分为训练、验证和测试数据。向测试样本添加不同强度的高斯噪声,以评估模型的抗噪能力。使用准确性、F1 分数和二次加权 Cohen's kappa 分数评估模型的性能。

结果

随着噪声强度的增加,模型在 GRBAS 量表上的性能普遍下降。对于 G 量表,准确性从原始的 70.9%下降到最高噪声时的 8.5%,F1 分数从 69.2%下降到 1.3%,Cohen's kappa 从 0.679 下降到 0.0。其余 RBAS 分量也观察到类似的下降。

结论

模型的性能受到背景噪声的影响,随着噪声水平的加剧,评估指标显著下降。未来的研究应探索抗噪技术,例如数据增强,以提高模型在实际环境中的抗噪能力。

证据水平

本研究使用单一数据集评估机器学习模型,没有对照组。鉴于其非比较设计和特定重点,根据 2011 年 OCEBM 指南,它与第 4 级证据(病例系列)一致,Laryngoscope, 134:3537-3541, 2024。

相似文献

1
The Effect of Noise on Deep Learning for Classification of Pathological Voice.噪声对病理性语音分类的深度学习的影响。
Laryngoscope. 2024 Aug;134(8):3537-3541. doi: 10.1002/lary.31303. Epub 2024 Jan 27.
2
Classification of Voice Disorders Using a One-Dimensional Convolutional Neural Network.基于一维卷积神经网络的嗓音障碍分类
J Voice. 2022 Jan;36(1):15-20. doi: 10.1016/j.jvoice.2020.02.009. Epub 2020 Mar 13.
3
InsightSleepNet: the interpretable and uncertainty-aware deep learning network for sleep staging using continuous Photoplethysmography.InsightSleepNet:一种可解释且具有不确定性感知能力的深度学习网络,用于使用连续光体积描记法进行睡眠分期。
BMC Med Inform Decis Mak. 2024 Feb 14;24(1):50. doi: 10.1186/s12911-024-02437-y.
4
Objective Assessment of Pathological Voice Using Artificial Intelligence Based on the GRBAS Scale.基于 GRBAS 量表的人工智能在病理性嗓音评估中的客观评估。
J Voice. 2024 May;38(3):561-566. doi: 10.1016/j.jvoice.2021.11.021. Epub 2021 Dec 30.
5
Automatic GRBAS Scoring of Pathological Voices using Deep Learning and a Small Set of Labeled Voice Data.使用深度学习和少量标记语音数据对病理性嗓音进行自动GRBAS评分
J Voice. 2025 May;39(3):846.e1-846.e23. doi: 10.1016/j.jvoice.2022.10.020. Epub 2022 Nov 25.
6
Comparative Analysis of CNN and RNN for Voice Pathology Detection.卷积神经网络(CNN)和循环神经网络(RNN)在语音病理学检测中的比较分析。
Biomed Res Int. 2021 Apr 14;2021:6635964. doi: 10.1155/2021/6635964. eCollection 2021.
7
Automated multi-model deep neural network for sleep stage scoring with unfiltered clinical data.基于未滤波临床数据的睡眠分期评分自动化多模态深度神经网络。
Sleep Breath. 2020 Jun;24(2):581-590. doi: 10.1007/s11325-019-02008-w. Epub 2020 Jan 14.
8
Deep Learning for Voice Gender Identification: Proof-of-concept for Gender-Affirming Voice Care.深度学习在语音性别识别中的应用:用于性别肯定型嗓音护理的概念验证。
Laryngoscope. 2021 May;131(5):E1611-E1615. doi: 10.1002/lary.29281. Epub 2020 Nov 21.
9
Improving skin cancer detection by Raman spectroscopy using convolutional neural networks and data augmentation.利用卷积神经网络和数据增强技术通过拉曼光谱改善皮肤癌检测
Front Oncol. 2024 Jun 19;14:1320220. doi: 10.3389/fonc.2024.1320220. eCollection 2024.
10
The effect of Gaussian noise on pneumonia detection on chest radiographs, using convolutional neural networks.使用卷积神经网络时高斯噪声对胸部X光片肺炎检测的影响。
Radiography (Lond). 2023 Jan;29(1):38-43. doi: 10.1016/j.radi.2022.09.011. Epub 2022 Oct 20.

引用本文的文献

1
Key insights into recommended SMS spam detection datasets.关于推荐的短信垃圾邮件检测数据集的关键见解。
Sci Rep. 2025 Mar 10;15(1):8162. doi: 10.1038/s41598-025-92223-1.
2
A Deep-Learning Model for Multi-class Audio Classification of Vocal Fold Pathologies in Office Stroboscopy.一种用于办公室频闪喉镜检查中声带病变多类别音频分类的深度学习模型。
Laryngoscope. 2025 Jul;135(7):2428-2436. doi: 10.1002/lary.32036. Epub 2025 Feb 5.