深度学习对柔性喉镜检查中声带图像进行分类的支持。

Support of deep learning to classify vocal fold images in flexible laryngoscopy.

作者信息

Tran Bich Anh, Dao Thao Thi Phuong, Dung Ho Dang Quy, Van Ngoc Boi, Ha Chanh Cong, Pham Nam Hoang, Nguyen Tu Cong Huyen Ton Nu Cam, Nguyen Tan-Cong, Pham Minh-Khoi, Tran Mai-Khiem, Tran Truong Minh, Tran Minh-Triet

机构信息

Otorhinolaryngology Department, Cho Ray Hospital, Ho Chi Minh City, Viet Nam.

University of Science, VNUHCM, Ho Chi Minh City, Viet Nam; John von Neumann Institute, VNUHCM, Ho Chi Minh City, Viet Nam; Vietnam National University, Ho Chi Minh City, Viet Nam; Department of Otolaryngology, Thong Nhat Hospital, Ho Chi Minh City, Viet Nam.

出版信息

Am J Otolaryngol. 2023 May-Jun;44(3):103800. doi: 10.1016/j.amjoto.2023.103800. Epub 2023 Feb 24.

DOI:10.1016/j.amjoto.2023.103800

PMID:36905912

Abstract

PURPOSE

To collect a dataset with adequate laryngoscopy images and identify the appearance of vocal folds and their lesions in flexible laryngoscopy images by objective deep learning models.

METHODS

We adopted a number of novel deep learning models to train and classify 4549 flexible laryngoscopy images as no vocal fold, normal vocal folds, and abnormal vocal folds. This could help these models recognize vocal folds and their lesions within these images. Ultimately, we made a comparison between the results of the state-of-the-art deep learning models, and another comparison of the results between the computer-aided classification system and ENT doctors.

RESULTS

This study exhibited the performance of the deep learning models by evaluating laryngoscopy images collected from 876 patients. The efficiency of the Xception model was higher and steadier than almost the rest of the models. The accuracy of no vocal fold, normal vocal folds, and vocal fold abnormalities on this model were 98.90 %, 97.36 %, and 96.26 %, respectively. Compared to our ENT doctors, the Xception model produced better results than a junior doctor and was near an expert.

CONCLUSION

Our results show that current deep learning models can classify vocal fold images well and effectively assist physicians in vocal fold identification and classification of normal or abnormal vocal folds.

摘要

目的

收集包含足够喉镜图像的数据集，并通过客观深度学习模型识别柔性喉镜图像中声带的外观及其病变。

方法

我们采用了多种新型深度学习模型，对4549张柔性喉镜图像进行训练和分类，分为无声带、正常声带和异常声带。这有助于这些模型识别这些图像中的声带及其病变。最终，我们对最先进的深度学习模型的结果进行了比较，还对计算机辅助分类系统与耳鼻喉科医生的结果进行了另一项比较。

结果

本研究通过评估从876名患者收集的喉镜图像，展示了深度学习模型的性能。Xception模型的效率比几乎其他所有模型都更高且更稳定。该模型上无声带、正常声带和声带异常的准确率分别为98.90%、97.36%和96.26%。与我们的耳鼻喉科医生相比，Xception模型的结果比初级医生更好，接近专家水平。

结论

我们的结果表明，当前的深度学习模型可以很好地对声带图像进行分类，并有效地协助医生进行声带识别以及正常或异常声带的分类。

相似文献

Support of deep learning to classify vocal fold images in flexible laryngoscopy.

Am J Otolaryngol. 2023 May-Jun;44(3):103800. doi: 10.1016/j.amjoto.2023.103800. Epub 2023 Feb 24.

Comparison of Convolutional Neural Network Models for Determination of Vocal Fold Normality in Laryngoscopic Images.

J Voice. 2022 Sep;36(5):590-598. doi: 10.1016/j.jvoice.2020.08.003. Epub 2020 Aug 30.

A deep learning pipeline for automated classification of vocal fold polyps in flexible laryngoscopy.

Eur Arch Otorhinolaryngol. 2024 Apr;281(4):2055-2062. doi: 10.1007/s00405-023-08190-8. Epub 2023 Sep 11.

Ultrasound assessment of vocal fold paresis: a correlation case series with flexible fiberoptic laryngoscopy and adding the third dimension (3-D) to vocal fold mobility assessment.

Middle East J Anaesthesiol. 2012 Feb;21(4):493-8.

Assessment of Vocal Fold Function Using Transcutaneous Laryngeal Ultrasonography and Flexible Laryngoscopy.

JAMA Otolaryngol Head Neck Surg. 2016 Jan;142(1):74-8. doi: 10.1001/jamaoto.2015.2795.

Comparison of convolutional neural networks for classification of vocal fold nodules from high-speed video images.

Eur Arch Otorhinolaryngol. 2023 May;280(5):2365-2371. doi: 10.1007/s00405-022-07736-6. Epub 2022 Nov 11.

A Convolutional Neural Network for Real Time Classification, Identification, and Labelling of Vocal Cord and Tracheal Using Laryngoscopy and Bronchoscopy Video.

J Med Syst. 2020 Jan 2;44(2):44. doi: 10.1007/s10916-019-1481-4.

Vocal cord lesions classification based on deep convolutional neural network and transfer learning.

Med Phys. 2022 Jan;49(1):432-442. doi: 10.1002/mp.15371. Epub 2021 Dec 8.

Assessment of the infant airway with videorecorded flexible laryngoscopy and the objective analysis of vocal fold abduction.

Otolaryngol Head Neck Surg. 1996 Apr;114(4):554-61. doi: 10.1016/S0194-59989670246-2.

Assessment of videolaryngostroboscopy images based on visible vessels of vocal folds.

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:6251-4. doi: 10.1109/EMBC.2012.6347423.

引用本文的文献

A Deep-Learning Model for Multi-class Audio Classification of Vocal Fold Pathologies in Office Stroboscopy.

Laryngoscope. 2025 Jul;135(7):2428-2436. doi: 10.1002/lary.32036. Epub 2025 Feb 5.

New developments in the application of artificial intelligence to laryngology.

Curr Opin Otolaryngol Head Neck Surg. 2024 Dec 1;32(6):391-397. doi: 10.1097/MOO.0000000000000999. Epub 2024 Jul 24.

Improving Laryngoscopy Image Analysis Through Integration of Global Information and Local Features in VoFoCD Dataset.

J Imaging Inform Med. 2024 Dec;37(6):2794-2809. doi: 10.1007/s10278-024-01068-z. Epub 2024 May 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度学习对柔性喉镜检查中声带图像进行分类的支持。

Support of deep learning to classify vocal fold images in flexible laryngoscopy.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献