Zhang Rui, Lu Miao, Zhang Jiayuan, Chen Xiaoyan, Zhu Fudong, Tian Xiang, Chen Yaowu, Cao Yuqi
Zhejiang Provincial Key Laboratory of Internet Multimedia Technology, College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou 310027, China.
Key Laboratory of Oral Biomedical Research of Zhejiang Province, Cancer Center of Zhejiang University, Stomatology Hospital, School of Stomatology, Zhejiang University School of Medicine, Zhejiang Provincial Clinical Research Center for Oral Diseases, Engineering Research Center of Oral Biomaterials and Devices of Zhejiang Province, Hangzhou 310053, China.
Bioengineering (Basel). 2024 Nov 2;11(11):1107. doi: 10.3390/bioengineering11111107.
Given the complexity of oral mucosal disease diagnosis and the limitations in the precision of traditional object detection methods, this study aims to develop a high-accuracy artificial intelligence-assisted diagnostic approach based on the SegFormer semantic segmentation model. This method is designed to automatically segment lesion areas in white-light images of oral mucosal diseases, providing objective and quantifiable evidence for clinical diagnosis. This study utilized a dataset of oral mucosal diseases provided by the Affiliated Stomatological Hospital of Zhejiang University School of Medicine, comprising 838 high-resolution images of three diseases: oral lichen planus, oral leukoplakia, and oral submucous fibrosis. These images were annotated at the pixel level by oral specialists using Labelme software (v5.5.0) to construct a semantic segmentation dataset. This study designed a SegFormer model based on the Transformer architecture, employed cross-validation to divide training and testing sets, and compared SegFormer models of different capacities with classical segmentation models such as UNet and DeepLabV3. Quantitative metrics including the Dice coefficient and mIoU were evaluated, and a qualitative visual analysis of the segmentation results was performed to comprehensively assess model performance. The SegFormer-B2 model achieved optimal performance on the test set, with a Dice coefficient of 0.710 and mIoU of 0.786, significantly outperforming other comparative algorithms. The visual results demonstrate that this model could accurately segment the lesion areas of three common oral mucosal diseases. The SegFormer model proposed in this study effectively achieves the precise automatic segmentation of three common oral mucosal diseases, providing a reliable auxiliary tool for clinical diagnosis. It shows promising prospects in improving the efficiency and accuracy of oral mucosal disease diagnosis and has potential clinical application value.
鉴于口腔黏膜疾病诊断的复杂性以及传统目标检测方法在精度上的局限性,本研究旨在基于SegFormer语义分割模型开发一种高精度的人工智能辅助诊断方法。该方法旨在自动分割口腔黏膜疾病白光图像中的病变区域,为临床诊断提供客观且可量化的依据。本研究使用了浙江大学医学院附属口腔医院提供的口腔黏膜疾病数据集,其中包含838张三种疾病的高分辨率图像:口腔扁平苔藓、口腔白斑和口腔黏膜下纤维化。这些图像由口腔专家使用Labelme软件(v5.5.0)在像素级别进行标注,以构建语义分割数据集。本研究基于Transformer架构设计了SegFormer模型,采用交叉验证划分训练集和测试集,并将不同容量的SegFormer模型与UNet和DeepLabV3等经典分割模型进行比较。评估了包括Dice系数和mIoU在内的定量指标,并对分割结果进行了定性视觉分析,以全面评估模型性能。SegFormer-B2模型在测试集上取得了最优性能,Dice系数为0.710,mIoU为0.786,显著优于其他对比算法。视觉结果表明,该模型能够准确分割三种常见口腔黏膜疾病的病变区域。本研究提出的SegFormer模型有效地实现了三种常见口腔黏膜疾病的精确自动分割,为临床诊断提供了可靠的辅助工具。它在提高口腔黏膜疾病诊断的效率和准确性方面显示出广阔前景,具有潜在的临床应用价值。