• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用语音分析通过人工智能对包括喉癌、良性黏膜疾病和声带麻痹在内的喉部疾病进行分类。

Classification of laryngeal diseases including laryngeal cancer, benign mucosal disease, and vocal cord paralysis by artificial intelligence using voice analysis.

作者信息

Kim Hyun-Bum, Song Jaemin, Park Seho, Lee Yong Oh

机构信息

Department of Otolaryngology-Head and Neck Surgery, The Catholic University of Korea, Seoul, South Korea.

Department of Industrial and Data Engineering, Hongik University, Seoul, South Korea.

出版信息

Sci Rep. 2024 Apr 23;14(1):9297. doi: 10.1038/s41598-024-58817-x.

DOI:10.1038/s41598-024-58817-x
PMID:38654036
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11549102/
Abstract

Voice change is often the first sign of laryngeal cancer, leading to diagnosis through hospital laryngoscopy. Screening for laryngeal cancer solely based on voice could enhance early detection. However, identifying voice indicators specific to laryngeal cancer is challenging, especially when differentiating it from other laryngeal ailments. This study presents an artificial intelligence model designed to distinguish between healthy voices, laryngeal cancer voices, and those of the other laryngeal conditions. We gathered voice samples of individuals with laryngeal cancer, vocal cord paralysis, benign mucosal diseases, and healthy participants. Comprehensive testing was conducted to determine the best mel-frequency cepstral coefficient conversion and machine learning techniques, with results analyzed in-depth. In our tests, laryngeal diseases distinguishing from healthy voices achieved an accuracy of 0.85-0.97. However, when multiclass classification, accuracy ranged from 0.75 to 0.83. These findings highlight the challenges of artificial intelligence-driven voice-based diagnosis due to overlaps with benign conditions but also underscore its potential.

摘要

声音变化往往是喉癌的首个迹象,可通过医院喉镜检查得以诊断。仅基于声音对喉癌进行筛查能够提高早期检测率。然而,识别喉癌特有的声音指标具有挑战性,尤其是在将其与其他喉部疾病区分开来时。本研究提出了一种人工智能模型,旨在区分健康嗓音、喉癌嗓音以及其他喉部疾病的嗓音。我们收集了喉癌患者、声带麻痹患者、良性黏膜疾病患者以及健康参与者的声音样本。进行了全面测试以确定最佳的梅尔频率倒谱系数转换和机器学习技术,并对结果进行了深入分析。在我们的测试中,将喉部疾病与健康嗓音区分开来的准确率达到了0.85至0.97。然而,在多类分类时,准确率范围为0.75至0.83。这些发现凸显了基于人工智能的声音诊断因与良性病症存在重叠而面临的挑战,但同时也强调了其潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/0452f49a7e2f/41598_2024_58817_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/60dea0aa0043/41598_2024_58817_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/206a06f1e4c2/41598_2024_58817_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/3a6a41b2c5a5/41598_2024_58817_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/86b935240b8d/41598_2024_58817_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/06825faeb613/41598_2024_58817_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/0452f49a7e2f/41598_2024_58817_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/60dea0aa0043/41598_2024_58817_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/206a06f1e4c2/41598_2024_58817_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/3a6a41b2c5a5/41598_2024_58817_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/86b935240b8d/41598_2024_58817_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/06825faeb613/41598_2024_58817_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b7d/11549102/0452f49a7e2f/41598_2024_58817_Fig6_HTML.jpg

相似文献

1
Classification of laryngeal diseases including laryngeal cancer, benign mucosal disease, and vocal cord paralysis by artificial intelligence using voice analysis.利用语音分析通过人工智能对包括喉癌、良性黏膜疾病和声带麻痹在内的喉部疾病进行分类。
Sci Rep. 2024 Apr 23;14(1):9297. doi: 10.1038/s41598-024-58817-x.
2
Analysis and Classification of Voice Pathologies Using Glottal Signal Parameters.基于声门信号参数的嗓音病理分析与分类
J Voice. 2016 Sep;30(5):549-56. doi: 10.1016/j.jvoice.2015.06.010. Epub 2015 Oct 23.
3
Hierarchical Classification and System Combination for Automatically Identifying Physiological and Neuromuscular Laryngeal Pathologies.用于自动识别生理性和神经肌肉性喉部病变的分层分类与系统组合
J Voice. 2017 May;31(3):384.e9-384.e14. doi: 10.1016/j.jvoice.2016.09.003. Epub 2016 Oct 12.
4
Impairment of voice quality in paradoxical vocal fold motion dysfunction.反常声门运动障碍导致的嗓音质量受损。
J Voice. 2010 Nov;24(6):724-7. doi: 10.1016/j.jvoice.2009.05.004. Epub 2010 Jan 25.
5
Is a voice-specific instrument more indicative of stroboscopy results than common clinical queries?与常见临床问题相比,专门针对嗓音的仪器是否更能反映频闪喉镜检查的结果?
Laryngoscope. 2020 Apr;130(4):992-999. doi: 10.1002/lary.28207. Epub 2019 Aug 16.
6
The value of varying vocal frequency during stroboscopy for vocal fold masses.频闪喉镜检查声带肿物时改变声音频率的价值。
Ear Nose Throat J. 2003 Oct;82(10):752-3.
7
Sarcoidosis Presenting as Bilateral Vocal Fold Immobility.以双侧声带固定为表现的结节病
J Voice. 2018 May;32(3):359-362. doi: 10.1016/j.jvoice.2017.05.017. Epub 2017 Jul 3.
8
The Impact of a Vocal Loading Task on Voice Characteristics of Female Speakers With Benign Vocal Fold Lesions.良性声带病变女性患者发声负载任务对嗓音特征的影响。
J Voice. 2024 Jul;38(4):964.e1-964.e16. doi: 10.1016/j.jvoice.2021.11.009. Epub 2021 Dec 24.
9
Utility of Laryngeal High-speed Videoendoscopy in Clinical Voice Assessment.喉高速视频内镜检查在临床嗓音评估中的应用价值
J Voice. 2018 Mar;32(2):216-220. doi: 10.1016/j.jvoice.2017.05.002. Epub 2017 Jun 7.
10
[Clinical characteristics of 1330 elderly patients with voice diseases diagnosed by electronic laryngoscope].[1330例老年嗓音疾病患者电子喉镜诊断的临床特征]
Lin Chuang Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. 2022 Dec;36(12):951-955. doi: 10.13201/j.issn.2096-7993.2022.12.012.

引用本文的文献

1
Artificial Intelligence in Medicine: A Specialty-Level Overview of Emerging AI Trends.医学中的人工智能:新兴人工智能趋势的专业层面概述。
JSLS. 2025 Jul-Sep;29(3). doi: 10.4293/JSLS.2025.00041. Epub 2025 Sep 3.
2
Research on automatic assessment of the severity of unilateral vocal cord paralysis based on Mel-spectrogram and convolutional neural networks.基于梅尔频谱图和卷积神经网络的单侧声带麻痹严重程度自动评估研究
Biomed Eng Online. 2025 Jun 21;24(1):76. doi: 10.1186/s12938-025-01401-9.

本文引用的文献

1
Voice disorder classification using convolutional neural network based on deep transfer learning.基于深度迁移学习的卷积神经网络语音障碍分类。
Sci Rep. 2023 May 4;13(1):7264. doi: 10.1038/s41598-023-34461-9.
2
Optimized Deep Learning for the Classification of Parkinson's Disease Based on Voice Features.基于语音特征的帕金森病分类优化深度学习
Crit Rev Biomed Eng. 2022;50(5):1-28. doi: 10.1615/CritRevBiomedEng.2022041442.
3
Laryngeal Anatomy, Molecular Biology, Cause, and Risk Factors for Laryngeal Cancer.喉解剖学、分子生物学、喉癌的病因和危险因素。
Otolaryngol Clin North Am. 2023 Apr;56(2):197-203. doi: 10.1016/j.otc.2022.12.001.
4
Variability in Voice Characteristics of Female Speakers With Phonotraumatic Vocal Fold Lesions.患有发声创伤性声带病变的女性说话者语音特征的变异性。
J Voice. 2025 Jul;39(4):1118-1129. doi: 10.1016/j.jvoice.2023.01.019. Epub 2023 Feb 20.
5
Trends in Incidence and Mortality of Larynx Cancer in the US.美国喉癌发病率和死亡率的趋势。
JAMA Otolaryngol Head Neck Surg. 2023 Jan 1;149(1):34-41. doi: 10.1001/jamaoto.2022.3636.
6
USPSTF Updates Recommendation for Obstructive Sleep Apnea Screening in Adults.美国预防医学工作组更新成人阻塞性睡眠呼吸暂停筛查建议。
JAMA Otolaryngol Head Neck Surg. 2023 Jan 1;149(1):5-6. doi: 10.1001/jamaoto.2022.3844.
7
Using SincNet for Learning Pathological Voice Disorders.基于 SincNet 学习病理性嗓音障碍。
Sensors (Basel). 2022 Sep 2;22(17):6634. doi: 10.3390/s22176634.
8
Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers.基于卷积神经网络分类器集成学习的喉图像和嗓音用于早期声门癌诊断
J Voice. 2025 Jan;39(1):245-257. doi: 10.1016/j.jvoice.2022.07.007. Epub 2022 Sep 6.
9
Voice outcomes in patients with advanced laryngeal and hypopharyngeal cancer treated with chemo-radiotherapy.接受放化疗的晚期喉和下咽癌患者的嗓音结果。
Acta Otorhinolaryngol Ital. 2022 Jun;42(3):243-249. doi: 10.14639/0392-100X-N1992.
10
Hoarseness.声音嘶哑。
Med Clin North Am. 2021 Sep;105(5):917-938. doi: 10.1016/j.mcna.2021.05.012. Epub 2021 Jul 12.