• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积神经网络分类器集成学习的喉图像和嗓音用于早期声门癌诊断

Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers.

作者信息

Kwon Ickhwan, Wang Soo-Geun, Shin Sung-Chan, Cheon Yong-Il, Lee Byung-Joo, Lee Jin-Choon, Lim Dong-Won, Jo Cheolwoo, Cho Youngseuk, Shin Bum-Joo

机构信息

Department of Applied IT and Engineering, Pusan National University, Miryang, Gyeongsangnam-do, South Korea.

Department of Otorhinolaryngology-Head and Neck Surgery, College of Medicine, Pusan National University and Medical Research Institute, Pusan National University Hospital, Busan, South Korea.

出版信息

J Voice. 2025 Jan;39(1):245-257. doi: 10.1016/j.jvoice.2022.07.007. Epub 2022 Sep 6.

DOI:10.1016/j.jvoice.2022.07.007
PMID:36075802
Abstract

OBJECTIVES

The purpose of study is to improve the classification accuracy by comparing the results obtained by applying decision tree ensemble learning, which is one of the methods to increase the classification accuracy for a relatively small dataset, with the results obtained by the convolutional neural network (CNN) algorithm for the diagnosis of glottal cancer.

METHODS

Pusan National University Hospital (PNUH) dataset were used to establish classifiers and Pusan National University Yangsan Hospital (PNUYH) dataset were used to verify the classifier's performance in the generated model. For the diagnosis of glottic cancer, deep learning-based CNN models were established and classified using laryngeal image and voice data. Classification accuracy was obtained by performing decision tree ensemble learning using probability through CNN classification algorithm. In this process, the classification and regression tree (CART) method was used. Then, we compared the classification accuracy of decision tree ensemble learning with CNN individual classifiers by fusing the laryngeal image with the voice decision tree classifier.

RESULTS

We obtained classification accuracy of 81.03 % and 99.18 % in the established laryngeal image and voice classification models using PNUH training dataset, respectively. However, the classification accuracy of CNN classifiers decreased to 73.88 % in voice and 68.92 % in laryngeal image when using an external dataset of PNUYH. To solve this problem, decision tree ensemble learning of laryngeal image and voice was used, and the classification accuracy was improved by integrating data of laryngeal image and voice of the same person. The classification accuracy was 87.88 % and 89.06 % for the individualized laryngeal image and voice decision tree model respectively, and the fusion of the laryngeal image and voice decision tree results represented a classification accuracy of 95.31 %.

CONCLUSION

The results of our study suggest that decision tree ensemble learning aimed at training multiple classifiers is useful to obtain an increased classification accuracy despite a small dataset. Although a large data amount is essential for AI analysis, when an integrated approach is taken by combining various input data high diagnostic classification accuracy can be expected.

摘要

目的

本研究的目的是通过比较应用决策树集成学习(这是提高相对较小数据集分类准确率的方法之一)所获得的结果与卷积神经网络(CNN)算法用于声门癌诊断所获得的结果,来提高分类准确率。

方法

使用釜山国立大学医院(PNUH)数据集建立分类器,并使用釜山国立大学梁山医院(PNUYH)数据集验证生成模型中分类器的性能。对于声门癌的诊断,使用基于深度学习的CNN模型,并利用喉部图像和语音数据进行分类。通过使用CNN分类算法的概率执行决策树集成学习来获得分类准确率。在此过程中,使用了分类与回归树(CART)方法。然后,通过将喉部图像与语音决策树分类器融合,比较决策树集成学习与CNN单个分类器的分类准确率。

结果

在使用PNUH训练数据集建立的喉部图像和语音分类模型中,我们分别获得了81.03%和99.18%的分类准确率。然而,当使用PNUYH的外部数据集时,CNN分类器的分类准确率在语音方面降至73.88%,在喉部图像方面降至68.92%。为了解决这个问题,使用了喉部图像和语音的决策树集成学习,并通过整合同一人的喉部图像和语音数据提高了分类准确率。个性化喉部图像和语音决策树模型的分类准确率分别为87.88%和89.06%,喉部图像和语音决策树结果的融合代表分类准确率为95.31%。

结论

我们的研究结果表明,旨在训练多个分类器的决策树集成学习对于在数据集较小的情况下提高分类准确率是有用的。尽管大量数据对于人工智能分析至关重要,但当采用综合方法结合各种输入数据时,可以预期获得较高的诊断分类准确率。

相似文献

1
Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers.基于卷积神经网络分类器集成学习的喉图像和嗓音用于早期声门癌诊断
J Voice. 2025 Jan;39(1):245-257. doi: 10.1016/j.jvoice.2022.07.007. Epub 2022 Sep 6.
2
Towards laryngeal cancer diagnosis using Dandelion Optimizer Algorithm with ensemble learning on biomedical throat region images.基于生物医学喉部图像的 Dandelion Optimizer 算法集成学习进行喉癌诊断。
Sci Rep. 2024 Aug 24;14(1):19713. doi: 10.1038/s41598-024-70525-0.
3
AI Detection of Glottic Neoplasm Using Voice Signals, Demographics, and Structured Medical Records.利用语音信号、人口统计学和结构化医疗记录进行声带肿瘤的人工智能检测。
Laryngoscope. 2024 Nov;134(11):4585-4592. doi: 10.1002/lary.31563. Epub 2024 Jun 12.
4
Deep learning assisted detection of glaucomatous optic neuropathy and potential designs for a generalizable model.深度学习辅助青光眼视神经病变检测及通用模型的潜在设计。
PLoS One. 2020 May 14;15(5):e0233079. doi: 10.1371/journal.pone.0233079. eCollection 2020.
5
Construction of prediction model of early glottic cancer based on machine learning.基于机器学习的早期声门癌预测模型构建
Acta Otolaryngol. 2025 Jan;145(1):72-80. doi: 10.1080/00016489.2024.2430613. Epub 2024 Dec 30.
6
Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy During Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach.基于卷积神经网络的痉挛性发声障碍患者连接性言语时高速视频内镜下声带图像遮挡的检测。
J Voice. 2024 Jul;38(4):951-962. doi: 10.1016/j.jvoice.2022.01.028. Epub 2022 Mar 16.
7
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
8
Reviewing ensemble classification methods in breast cancer.综述乳腺癌中的集成分类方法。
Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20.
9
Demographic and Symptomatic Features of Voice Disorders and Their Potential Application in Classification Using Machine Learning Algorithms.嗓音障碍的人口统计学和症状学特征及其在使用机器学习算法进行分类中的潜在应用。
Folia Phoniatr Logop. 2018;70(3-4):174-182. doi: 10.1159/000492327. Epub 2018 Sep 5.
10
Optimized classification of dental implants using convolutional neural networks and pre-trained models with preprocessed data.使用卷积神经网络和带有预处理数据的预训练模型对牙种植体进行优化分类。
BMC Oral Health. 2025 Apr 11;25(1):535. doi: 10.1186/s12903-025-05704-0.

引用本文的文献

1
A Deep-Learning Model for Multi-class Audio Classification of Vocal Fold Pathologies in Office Stroboscopy.一种用于办公室频闪喉镜检查中声带病变多类别音频分类的深度学习模型。
Laryngoscope. 2025 Jul;135(7):2428-2436. doi: 10.1002/lary.32036. Epub 2025 Feb 5.
2
Laryngeal disease classification using voice data: Octave-band vs. mel-frequency filters.使用语音数据进行喉疾病分类:倍频程滤波器与梅尔频率滤波器
Heliyon. 2024 Nov 30;10(24):e40748. doi: 10.1016/j.heliyon.2024.e40748. eCollection 2024 Dec 30.
3
Towards laryngeal cancer diagnosis using Dandelion Optimizer Algorithm with ensemble learning on biomedical throat region images.
基于生物医学喉部图像的 Dandelion Optimizer 算法集成学习进行喉癌诊断。
Sci Rep. 2024 Aug 24;14(1):19713. doi: 10.1038/s41598-024-70525-0.
4
New developments in the application of artificial intelligence to laryngology.人工智能在喉科学中的应用新进展。
Curr Opin Otolaryngol Head Neck Surg. 2024 Dec 1;32(6):391-397. doi: 10.1097/MOO.0000000000000999. Epub 2024 Jul 24.
5
Depression recognition using voice-based pre-training model.基于语音的预训练模型进行抑郁识别。
Sci Rep. 2024 Jun 3;14(1):12734. doi: 10.1038/s41598-024-63556-0.
6
Classification of laryngeal diseases including laryngeal cancer, benign mucosal disease, and vocal cord paralysis by artificial intelligence using voice analysis.利用语音分析通过人工智能对包括喉癌、良性黏膜疾病和声带麻痹在内的喉部疾病进行分类。
Sci Rep. 2024 Apr 23;14(1):9297. doi: 10.1038/s41598-024-58817-x.
7
Automated Laryngeal Cancer Detection and Classification Using Dwarf Mongoose Optimization Algorithm with Deep Learning.基于矮猫鼬优化算法与深度学习的自动喉癌检测与分类
Cancers (Basel). 2023 Dec 29;16(1):181. doi: 10.3390/cancers16010181.
8
Real-time detection of laryngopharyngeal cancer using an artificial intelligence-assisted system with multimodal data.利用多模态数据的人工智能辅助系统实时检测喉咽癌。
J Transl Med. 2023 Oct 7;21(1):698. doi: 10.1186/s12967-023-04572-y.