Zhang Han, Jiang Rongrong, Yang Tao, Gao Jiayi, Wang Yi, Zhang Junfeng
School of Artificial Intelligence and Information Technology, Nanjing University of Chinese Medicine, Nanjing, China.
School of Nursing, Nanjing University of Chinese Medicine, Nanjing, China.
Evid Based Complement Alternat Med. 2022 Mar 8;2022:3943920. doi: 10.1155/2022/3943920. eCollection 2022.
Tongue image segmentation is a base work of TCM tongue processing. Nowadays, deep learning methods are widely used on tongue segmentation, which has better performance than conventional methods. However, when the tongue color is close to the color of the adjoining area, the contour of tongue segmentation by deep learning may be coarse which could influence the subsequent analysis. Here a novel tongue image segmentation model based on a convolutional neural network fused with superpixel was proposed to solve the problem. . On the basis of a convolutional neural network fused with superpixel, the novel tongue image segmentation model SpurNet was proposed in this study. The residual structure of ResNet18 was introduced as the feature extraction layer on the encoding path, to construct the first stage processing module UrNet of SpurNet. The superpixel segmentation was fused with UrNet to form the second stage process of SpurNet. To verify the effect of SpurNet. The models before and after fusion with superpixel, classical image segmentation models FCN and DeepLab were compared with SpurNet on the dataset of 367 manually labeled tongue images. . The SpurNet model performance test with 10-fold cross-validation showed PA of 0.9145 ± 0.0043, MPA of 0.9168 ± 0.0048, MIoU of 0.8417 ± 0.0072 and FWIoU of 0.8454 ± 0.0072. Relative to FCN, DeepLab and their superpixel fused models, the SpurNet model was superior in tongue image segmentation and could increase PA by 1.91%-3.17%, MPA by 1.38%-2.61%, MIoU by 3.09%-5.07%, and FWIoU by 3.11%-5.08%. Compared to UrNet, the first stage processing module, the SpurNet model also increased the PA, MPA, MIoU and FWIoU by 0.15%, 0.09%, 0.24% and 0.24%, respectively. . The SpurNet model, after fusing with superpixel image segmentation, can better accomplish the task of tongue image segmentation, more accurately process the margins of tongue and resolve the over-segmentation and under-segmentation. The thought of this study is a new exploration in the field of tongue image segmentation, which could provide a reference for the modern research on TCM tongue images.
舌象分割是中医舌诊处理的基础工作。如今,深度学习方法在舌象分割中被广泛应用,其性能优于传统方法。然而,当舌色与相邻区域颜色相近时,深度学习进行舌象分割得到的轮廓可能会比较粗糙,这会影响后续分析。在此提出一种基于卷积神经网络融合超像素的新型舌象分割模型来解决该问题。本研究基于卷积神经网络融合超像素,提出了新型舌象分割模型SpurNet。引入ResNet18的残差结构作为编码路径上的特征提取层,构建SpurNet的第一阶段处理模块UrNet。将超像素分割与UrNet融合形成SpurNet的第二阶段处理。为验证SpurNet的效果,在367张人工标注的舌象图像数据集上,将融合超像素前后的模型、经典图像分割模型FCN和DeepLab与SpurNet进行比较。SpurNet模型10折交叉验证的性能测试显示,像素精度(PA)为0.9145±0.0043,平均像素精度(MPA)为0.9168±0.0048,交并比(MIoU)为0.8417±0.0072,频率加权交并比(FWIoU)为0.8454±0.0072。相对于FCN、DeepLab及其融合超像素的模型,SpurNet模型在舌象分割方面表现更优,PA可提高1.91%-3.17%,MPA可提高1.38%-2.61%,MIoU可提高3.09%-5.07%,FWIoU可提高3.11%-5.08%。与第一阶段处理模块UrNet相比,SpurNet模型的PA、MPA、MIoU和FWIoU也分别提高了0.15%、0.09%、0.24%和0.24%。SpurNet模型融合超像素图像分割后,能更好地完成舌象分割任务,更准确地处理舌象边缘,解决过分割和欠分割问题。本研究思路是舌象分割领域的新探索,可为中医舌象图像的现代研究提供参考。