• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

卷积神经网络可高精度地对喉癌中的病理性声音变化进行分类。

Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy.

作者信息

Kim HyunBum, Jeon Juhyeong, Han Yeon Jae, Joo YoungHoon, Lee Jonghwan, Lee Seungchul, Im Sun

机构信息

Department of Otolaryngology-Head and Neck Surgery, Bucheon St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul 06591, Korea.

Department of Mechanical Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, Korea.

出版信息

J Clin Med. 2020 Oct 25;9(11):3415. doi: 10.3390/jcm9113415.

DOI:10.3390/jcm9113415
PMID:33113785
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7692693/
Abstract

Voice changes may be the earliest signs in laryngeal cancer. We investigated whether automated voice signal analysis can be used to distinguish patients with laryngeal cancer from healthy subjects. We extracted features using the software package for speech analysis in phonetics (PRAAT) and calculated the Mel-frequency cepstral coefficients (MFCCs) from voice samples of a vowel sound of /a:/. The proposed method was tested with six algorithms: support vector machine (SVM), extreme gradient boosting (XGBoost), light gradient boosted machine (LGBM), artificial neural network (ANN), one-dimensional convolutional neural network (1D-CNN) and two-dimensional convolutional neural network (2D-CNN). Their performances were evaluated in terms of accuracy, sensitivity, and specificity. The result was compared with human performance. A total of four volunteers, two of whom were trained laryngologists, rated the same files. The 1D-CNN showed the highest accuracy of 85% and sensitivity and sensitivity and specificity levels of 78% and 93%. The two laryngologists achieved accuracy of 69.9% but sensitivity levels of 44%. Automated analysis of voice signals could differentiate subjects with laryngeal cancer from those of healthy subjects with higher diagnostic properties than those performed by the four volunteers.

摘要

嗓音变化可能是喉癌最早的症状。我们研究了自动语音信号分析是否可用于区分喉癌患者与健康受试者。我们使用语音学语音分析软件包(PRAAT)提取特征,并从/a:/元音的语音样本中计算梅尔频率倒谱系数(MFCC)。所提出的方法用六种算法进行了测试:支持向量机(SVM)、极端梯度提升(XGBoost)、轻量级梯度提升机(LGBM)、人工神经网络(ANN)、一维卷积神经网络(1D-CNN)和二维卷积神经网络(2D-CNN)。根据准确率、敏感性和特异性对它们的性能进行了评估。结果与人工评判进行了比较。共有四名志愿者,其中两名是训练有素的喉科医生,对相同的文件进行了评分。1D-CNN显示出最高准确率85%,敏感性和特异性水平分别为78%和93%。两名喉科医生的准确率为69.9%,但敏感性水平为44%。语音信号的自动分析能够将喉癌患者与健康受试者区分开来,其诊断性能高于四名志愿者的表现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/2b1a6a974439/jcm-09-03415-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/22fd30330486/jcm-09-03415-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/e79ca9cc9267/jcm-09-03415-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/e818c3978914/jcm-09-03415-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/6d3fe5cf70b5/jcm-09-03415-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/21b7c85bc798/jcm-09-03415-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/b5947130fea2/jcm-09-03415-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/2b1a6a974439/jcm-09-03415-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/22fd30330486/jcm-09-03415-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/e79ca9cc9267/jcm-09-03415-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/e818c3978914/jcm-09-03415-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/6d3fe5cf70b5/jcm-09-03415-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/21b7c85bc798/jcm-09-03415-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/b5947130fea2/jcm-09-03415-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ef9/7692693/2b1a6a974439/jcm-09-03415-g007.jpg

相似文献

1
Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy.卷积神经网络可高精度地对喉癌中的病理性声音变化进行分类。
J Clin Med. 2020 Oct 25;9(11):3415. doi: 10.3390/jcm9113415.
2
Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.基于倒谱向量的病理性嗓音检测:深度学习方法。
J Voice. 2019 Sep;33(5):634-641. doi: 10.1016/j.jvoice.2018.02.003. Epub 2018 Mar 19.
3
Classification of Voice Disorders Using a One-Dimensional Convolutional Neural Network.基于一维卷积神经网络的嗓音障碍分类
J Voice. 2022 Jan;36(1):15-20. doi: 10.1016/j.jvoice.2020.02.009. Epub 2020 Mar 13.
4
Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers.基于卷积神经网络分类器集成学习的喉图像和嗓音用于早期声门癌诊断
J Voice. 2025 Jan;39(1):245-257. doi: 10.1016/j.jvoice.2022.07.007. Epub 2022 Sep 6.
5
Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning.基于梅尔频率倒谱系数的深度学习在声谱域中的神经退行性疾病诊断
Comput Math Methods Med. 2022 Apr 4;2022:4364186. doi: 10.1155/2022/4364186. eCollection 2022.
6
Deep Neural Network for Automatic Classification of Pathological Voice Signals.深度神经网络在病理嗓音信号自动分类中的应用。
J Voice. 2022 Mar;36(2):288.e15-288.e24. doi: 10.1016/j.jvoice.2020.05.029. Epub 2020 Jul 10.
7
Automated MRI-Based Deep Learning Model for Detection of Alzheimer's Disease Process.基于 MRI 的自动化深度学习模型用于阿尔茨海默病进程的检测。
Int J Neural Syst. 2020 Jun;30(6):2050032. doi: 10.1142/S012906572050032X.
8
Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures.基于深度学习和机器学习的语音分析用于新冠病毒检测:架构建议与比较
Knowl Based Syst. 2022 Oct 11;253:109539. doi: 10.1016/j.knosys.2022.109539. Epub 2022 Jul 28.
9
A temporal dependency feature in lower dimension for lung sound signal classification.肺部声音信号分类的低维时间依赖特征。
Sci Rep. 2022 May 12;12(1):7889. doi: 10.1038/s41598-022-11726-3.
10
Voice pathology detection using optimized convolutional neural networks and explainable artificial intelligence-based analysis.基于优化卷积神经网络和可解释人工智能的语音病理学检测。
Comput Methods Biomech Biomed Engin. 2024 Nov;27(14):2041-2057. doi: 10.1080/10255842.2023.2270102. Epub 2023 Oct 18.

引用本文的文献

1
Artificial Intelligence Assurance in Head and Neck Surgery: Now and Next.头颈外科中的人工智能保障:现状与未来。
Proc IEEE Int Symp Comput Based Med Syst. 2025 Jun;2025:977-982. doi: 10.1109/cbms65348.2025.00195. Epub 2025 Jul 4.
2
Research on automatic assessment of the severity of unilateral vocal cord paralysis based on Mel-spectrogram and convolutional neural networks.基于梅尔频谱图和卷积神经网络的单侧声带麻痹严重程度自动评估研究
Biomed Eng Online. 2025 Jun 21;24(1):76. doi: 10.1186/s12938-025-01401-9.
3
Deep learning-based classification of speech disorder in stroke and hearing impairment.

本文引用的文献

1
Changes in head and neck oncologic practice during the COVID-19 pandemic.COVID-19 大流行期间头颈部肿瘤诊治的变化。
Head Neck. 2020 Jul;42(7):1448-1453. doi: 10.1002/hed.26233. Epub 2020 Jun 11.
2
Evaluating the effects of smoking on the voice and subjective voice problems using a meta-analysis approach.采用荟萃分析方法评估吸烟对嗓音及主观嗓音问题的影响。
Sci Rep. 2020 Mar 13;10(1):4720. doi: 10.1038/s41598-020-61565-3.
3
Individualized prediction of late-onset dysphagia in head and neck cancer survivors.头颈部癌症幸存者迟发性吞咽困难的个体化预测
基于深度学习的中风和听力障碍语音障碍分类
PLoS One. 2025 May 28;20(5):e0315286. doi: 10.1371/journal.pone.0315286. eCollection 2025.
4
Real-time airway monitoring system using binary classification model based on respiratory sounds of rabbits with a tracheostomy tube.基于带气管造口管兔子呼吸音的二元分类模型的实时气道监测系统
Sci Rep. 2025 Apr 29;15(1):15014. doi: 10.1038/s41598-025-98546-3.
5
Vocal Feature Changes for Monitoring Parkinson's Disease Progression-A Systematic Review.用于监测帕金森病进展的嗓音特征变化——一项系统综述
Brain Sci. 2025 Mar 19;15(3):320. doi: 10.3390/brainsci15030320.
6
Bibliometric analysis of the Top 1000 most-cited articles in otolaryngology over the past decade: global research trends and hotspots.过去十年耳鼻喉科领域被引用次数最多的1000篇文章的文献计量分析:全球研究趋势与热点
Front Surg. 2025 Feb 17;12:1552102. doi: 10.3389/fsurg.2025.1552102. eCollection 2025.
7
Laryngeal disease classification using voice data: Octave-band vs. mel-frequency filters.使用语音数据进行喉疾病分类:倍频程滤波器与梅尔频率滤波器
Heliyon. 2024 Nov 30;10(24):e40748. doi: 10.1016/j.heliyon.2024.e40748. eCollection 2024 Dec 30.
8
Application of artificial intelligence in laryngeal lesions: a systematic review and meta-analysis.人工智能在喉部病变中的应用:一项系统评价和荟萃分析。
Eur Arch Otorhinolaryngol. 2025 Mar;282(3):1543-1555. doi: 10.1007/s00405-024-09075-0. Epub 2024 Nov 22.
9
Improving laryngeal cancer detection using chaotic metaheuristics integration with squeeze-and-excitation resnet model.使用混沌元启发式算法与挤压激励残差网络模型相结合来改善喉癌检测
Health Inf Sci Syst. 2024 Jul 12;12(1):38. doi: 10.1007/s13755-024-00296-5. eCollection 2024 Dec.
10
Voice as a Biomarker of Pediatric Health: A Scoping Review.声音作为儿童健康的生物标志物:一项范围综述。
Children (Basel). 2024 Jun 4;11(6):684. doi: 10.3390/children11060684.
Head Neck. 2020 Apr;42(4):708-718. doi: 10.1002/hed.26039. Epub 2020 Feb 7.
4
Cancer statistics, 2020.癌症统计数据,2020 年。
CA Cancer J Clin. 2020 Jan;70(1):7-30. doi: 10.3322/caac.21590. Epub 2020 Jan 8.
5
Multi-modal neuroimaging feature selection with consistent metric constraint for diagnosis of Alzheimer's disease.多模态神经影像学特征选择与一致度量约束相结合,用于阿尔茨海默病的诊断。
Med Image Anal. 2020 Feb;60:101625. doi: 10.1016/j.media.2019.101625. Epub 2019 Dec 2.
6
Factors affecting voice quality in early glottic cancer before and after radiotherapy.放疗前后早期声门型喉癌患者嗓音质量的影响因素。
Radiol Oncol. 2019 Oct 18;53(4):459-464. doi: 10.2478/raon-2019-0050.
7
Computer-aided diagnosis of laryngeal cancer via deep learning based on laryngoscopic images.基于喉镜图像的深度学习辅助喉癌计算机辅助诊断。
EBioMedicine. 2019 Oct;48:92-99. doi: 10.1016/j.ebiom.2019.08.075. Epub 2019 Oct 5.
8
Effects of oral and oropharyngeal cancer on speech intelligibility using acoustic analysis: Systematic review.运用声学分析评估口腔和口咽癌对言语清晰度的影响:系统评价
Head Neck. 2020 Jan;42(1):111-130. doi: 10.1002/hed.25949. Epub 2019 Sep 30.
9
Intelligent Fault Diagnosis of Diesel Engines via Extreme Gradient Boosting and High-Accuracy Time-Frequency Information of Vibration Signals.基于极限梯度提升和振动信号高精度时频信息的柴油机智能故障诊断
Sensors (Basel). 2019 Jul 25;19(15):3280. doi: 10.3390/s19153280.
10
Artificial intelligence in the field of electrodiagnosis - A new threat or heralding a new era in electromyography?电诊断领域中的人工智能——是新威胁还是预示着肌电图学的新时代?
Clin Neurophysiol. 2019 Oct;130(10):1995-1996. doi: 10.1016/j.clinph.2019.06.005. Epub 2019 Jun 24.