Deng Feilin, Li Shangxuan, Yang Zizhu, Zhou Wu
School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, China.
Front Physiol. 2025 Jun 11;16:1598850. doi: 10.3389/fphys.2025.1598850. eCollection 2025.
Recognizing a tooth-marked tongue has important clinical diagnostic value in traditional Chinese medicine. Current deep learning methods for tooth mark detection require extensive manual labeling and tongue segmentation, which is labor-intensive. Therefore, we propose a weakly supervised multipleinstance active learning model for tooth-marked tongue recognition, aiming to eliminate preprocessing segmentation and reduce the annotation workload while maintaining diagnostic accuracy.
We propose a one-stage method tongenerate tooth mark instances that eliminates the need for pre-segmentation of the tongue. To make full use of unlabeled data, we introduce a semisupervised learning paradigm to pseudo-label unlabeled tongue images with high model confidence in active learning and incorporate them into the training set to improve the training efficiency of the active learning model. In addition, we propose an instance-level hybrid query method considering the diversity of tooth marks.
Experimental results on clinical tongue images verify the effectiveness of the proposed method, which achieves an accuracy of 93.88% for tooth-marked tongue recognition, outperforming the recently introduced weakly supervised approaches.
The proposed method is effective with only a small amount of image-level annotation, and its performance is comparable to that of image-level annotation, instance-level annotation and pixel-level annotation, which require a large number of tooth markers. Our method significantly reduces the annotation cost of the binary classification task of traditional Chinese medicine tooth mark recognition.
在中医中,识别齿痕舌具有重要的临床诊断价值。当前用于齿痕检测的深度学习方法需要大量的人工标注和舌部分割,这是一项劳动密集型工作。因此,我们提出了一种用于齿痕舌识别的弱监督多实例主动学习模型,旨在消除预处理分割并减少标注工作量,同时保持诊断准确性。
我们提出了一种单阶段方法来生成齿痕实例,从而无需对舌头进行预分割。为了充分利用未标注数据,我们引入了一种半监督学习范式,在主动学习中用模型置信度高的伪标签标注未标注的舌图像,并将它们纳入训练集以提高主动学习模型的训练效率。此外,我们提出了一种考虑齿痕多样性的实例级混合查询方法。
在临床舌图像上的实验结果验证了所提方法的有效性,该方法在齿痕舌识别中达到了93.88%的准确率,优于最近引入的弱监督方法。
所提方法仅需少量图像级标注就有效,其性能与需要大量齿痕标注的图像级标注、实例级标注和像素级标注相当。我们的方法显著降低了中医齿痕识别二分类任务的标注成本。