Wei Wei, Zhang Xiao-Lei, Wang Hong-Zhen, Wang Lin-Lin, Wen Jing-Li, Han Xin, Liu Qian
Department of Oncology, Dongying People's Hospital, Dongying 257091, Shandong Province, China.
Department of Pathology, Dongying People's Hospital, Dongying 257091, Shandong Province, China.
World J Gastroenterol. 2025 May 21;31(19):104897. doi: 10.3748/wjg.v31.i19.104897.
Esophageal cancer is the sixth most common cancer worldwide, with a high mortality rate. Early prognosis of esophageal abnormalities can improve patient survival rates. The progression of esophageal cancer follows a sequence from esophagitis to non-dysplastic Barrett's esophagus, dysplastic Barrett's esophagus, and eventually esophageal adenocarcinoma (EAC). This study explored the application of deep learning technology in the precise diagnosis of pathological classification and staging of EAC to enhance diagnostic accuracy and efficiency.
To explore the application of deep learning models, particularly Wave-Vision Transformer (Wave-ViT), in the pathological classification and staging of esophageal cancer to enhance diagnostic accuracy and efficiency.
We applied several deep learning models, including multi-layer perceptron, residual network, transformer, and Wave-ViT, to a dataset of clinically validated esophageal pathology images. The models were trained to identify pathological features and assist in the classification and staging of different stages of esophageal cancer. The models were compared based on accuracy, computational complexity, and efficiency.
The Wave-ViT model demonstrated the highest accuracy at 88.97%, surpassing the transformer (87.65%), residual network (85.44%), and multi-layer perceptron (81.17%). Additionally, Wave-ViT exhibited low computational complexity with significantly reduced parameter size, making it highly efficient for real-time clinical applications.
Deep learning technology, particularly the Frequency-Domain Transformer model, shows promise in improving the precision of pathological classification and staging of EAC. The application of the Frequency-Domain Transformer model enhances the automation of the diagnostic process and may support early detection and treatment of EAC. Future research may further explore the potential of this model in broader medical image analysis applications, particularly in the field of precision medicine.
食管癌是全球第六大常见癌症,死亡率很高。食管异常的早期预后可提高患者生存率。食管癌的发展遵循从食管炎到非发育异常的巴雷特食管、发育异常的巴雷特食管,最终发展为食管腺癌(EAC)的顺序。本研究探讨了深度学习技术在EAC病理分类和分期的精确诊断中的应用,以提高诊断准确性和效率。
探讨深度学习模型,特别是Wave-Vision Transformer(Wave-ViT)在食管癌病理分类和分期中的应用,以提高诊断准确性和效率。
我们将包括多层感知器、残差网络、Transformer和Wave-ViT在内的几种深度学习模型应用于经过临床验证的食管病理图像数据集。对这些模型进行训练,以识别病理特征,并协助对食管癌不同阶段进行分类和分期。根据准确性、计算复杂度和效率对这些模型进行比较。
Wave-ViT模型的准确率最高,为88.97%,超过了Transformer(87.65%)、残差网络(85.44%)和多层感知器(81.17%)。此外,Wave-ViT的计算复杂度较低,参数规模显著减小,使其在实时临床应用中具有很高的效率。
深度学习技术,特别是频域Transformer模型,在提高EAC病理分类和分期的精度方面显示出前景。频域Transformer模型的应用提高了诊断过程的自动化程度,并可能支持EAC的早期检测和治疗。未来的研究可以进一步探索该模型在更广泛的医学图像分析应用中的潜力,特别是在精准医学领域。