INKA-Innovation Laboratory for Image Guided Therapy, Otto-von-Guericke University Magdeburg, 39120 Magdeburg, Germany.
Chair for Computer Aided Medical Procedures and Augmented Reality, Technical University of Munich, 85748 Munich, Germany.
Sensors (Basel). 2021 Dec 6;21(23):8157. doi: 10.3390/s21238157.
(1) Background: Contact Endoscopy (CE) and Narrow Band Imaging (NBI) are optical imaging modalities that can provide enhanced and magnified visualization of the superficial vascular networks in the laryngeal mucosa. The similarity of vascular structures between benign and malignant lesions causes a challenge in the visual assessment of CE-NBI images. The main objective of this study is to use Deep Convolutional Neural Networks (DCNN) for the automatic classification of CE-NBI images into benign and malignant groups with minimal human intervention. (2) Methods: A pretrained Res-Net50 model combined with the cut-off-layer technique was selected as the DCNN architecture. A dataset of 8181 CE-NBI images was used during the fine-tuning process in three experiments where several models were generated and validated. The accuracy, sensitivity, and specificity were calculated as the performance metrics in each validation and testing scenario. (3) Results: Out of a total of 72 trained and tested models in all experiments, Model 5 showed high performance. This model is considerably smaller than the full ResNet50 architecture and achieved the testing accuracy of 0.835 on the unseen data during the last experiment. (4) Conclusion: The proposed fine-tuned ResNet50 model showed a high performance to classify CE-NBI images into the benign and malignant groups and has the potential to be part of an assisted system for automatic laryngeal cancer detection.
(1) 背景:接触内镜(CE)和窄带成像(NBI)是光学成像方式,可以增强和放大喉黏膜表面血管网络的可视化效果。良性和恶性病变的血管结构相似,这使得 CE-NBI 图像的视觉评估具有挑战性。本研究的主要目的是使用深度卷积神经网络(DCNN),在最小的人为干预下,自动将 CE-NBI 图像分为良性和恶性组。
(2) 方法:选择经过预训练的 Res-Net50 模型结合截止层技术作为 DCNN 架构。在三个实验中,使用了 8181 张 CE-NBI 图像数据集进行微调过程,生成并验证了多个模型。在每个验证和测试场景中,计算准确性、敏感性和特异性作为性能指标。
(3) 结果:在所有实验中,总共训练和测试了 72 个模型,模型 5 表现出较高的性能。该模型明显小于完整的 ResNet50 架构,在最后一个实验中,对未见数据的测试准确率达到 0.835。
(4) 结论:所提出的微调 ResNet50 模型在将 CE-NBI 图像分类为良性和恶性组方面表现出较高的性能,并且有可能成为自动喉癌检测辅助系统的一部分。