School of Informatics, Xiamen University, Xiamen, Fujian, China.
Department of Stomatology, Zhongshan Hospital (Xiamen), Fudan University, Xiamen, Fujian, China.
Technol Health Care. 2024;32(S1):465-475. doi: 10.3233/THC-248041.
Oral cancer is a malignant tumor that usually occurs within the tissues of the mouth. This type of cancer mainly includes tumors in the lining of the mouth, tongue, lips, buccal mucosa and gums. Oral cancer is on the rise globally, especially in some specific risk groups. The early stage of oral cancer is usually asymptomatic, while the late stage may present with ulcers, lumps, bleeding, etc.
The objective of this paper is to propose an effective and accurate method for the identification and classification of oral cancer.
We applied two deep learning methods, CNN and Transformers. First, we propose a new CANet classification model for oral cancer, which uses attention mechanisms combined with neglected location information to explore the complex combination of attention mechanisms and deep networks, and fully tap the potential of attention mechanisms. Secondly, we design a classification model based on Swim transform. The image is segmented into a series of two-dimensional image blocks, which are then processed by multiple layers of conversion blocks.
The proposed classification model was trained and predicted on Kaggle Oral Cancer Images Dataset, and satisfactory results were obtained. The average accuracy, sensitivity, specificity and F1-Socre of Swin transformer architecture are 94.95%, 95.37%, 95.52% and 94.66%, respectively. The average accuracy, sensitivity, specificity and F1-Score of CANet model were 97.00%, 97.82%, 97.82% and 96.61%, respectively.
We studied different deep learning algorithms for oral cancer classification, including convolutional neural networks, converters, etc. Our Attention module in CANet leverages the benefits of channel attention to model the relationships between channels while encoding precise location information that captures the long-term dependencies of the network. The model achieves a high classification effect with an accuracy of 97.00%, which can be used in the automatic recognition and classification of oral cancer.
口腔癌是一种恶性肿瘤,通常发生在口腔组织内。这种癌症主要包括口腔内、舌、唇、颊粘膜和牙龈的肿瘤。口腔癌在全球呈上升趋势,特别是在一些特定的高危人群中。口腔癌早期通常无症状,晚期可能出现溃疡、肿块、出血等症状。
本研究旨在提出一种有效的口腔癌识别和分类方法。
我们应用了两种深度学习方法,CNN 和 Transformer。首先,我们提出了一种新的 CANet 口腔癌分类模型,该模型使用注意力机制结合被忽略的位置信息,探索注意力机制与深度网络的复杂组合,充分挖掘注意力机制的潜力。其次,我们设计了一种基于 Swim 变换的分类模型。该图像被分割成一系列二维图像块,然后由多个转换块进行处理。
我们在 Kaggle 口腔癌图像数据集上对所提出的分类模型进行了训练和预测,得到了令人满意的结果。Swin 转换器架构的平均准确率、灵敏度、特异性和 F1-Score 分别为 94.95%、95.37%、95.52%和 94.66%。CANet 模型的平均准确率、灵敏度、特异性和 F1-Score 分别为 97.00%、97.82%、97.82%和 96.61%。
我们研究了不同的深度学习算法用于口腔癌分类,包括卷积神经网络、转换器等。我们的 CANet 中的注意力模块利用通道注意力的优势来建立通道之间的关系,同时编码捕捉网络长期依赖关系的精确位置信息。该模型的分类效果非常好,准确率达到 97.00%,可用于口腔癌的自动识别和分类。