Xue Zhiyun, Pearlman Paul C, Yu Kelly, Pal Anabik, Chen Tseng-Cheng, Hua Chun-Hung, Kang Chung Jan, Chien Chih-Yen, Tsai Ming-Hsui, Wang Cheng-Ping, Chaturvedi Anil K, Antani Sameer
Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894.
Center for Global Health, National Cancer Institute, National Institutes of Health, Rockville, MD 20850.
Proc SPIE Int Soc Opt Eng. 2022 Feb-Mar;12037. doi: 10.1117/12.2611541. Epub 2022 Apr 4.
Oral cavity cancer is a common cancer that can result in breathing, swallowing, drinking, eating problems as well as speech impairment, and there is high mortality for the advanced stage. Its diagnosis is confirmed through histopathology. It is of critical importance to determine the need for biopsy and identify the correct location. Deep learning has demonstrated great promise/success in several image-based medical screening/diagnostic applications. However, automated visual evaluation of oral cavity lesions has received limited attention in the literature. Since the disease can occur in different parts of the oral cavity, a first step is to identify the images of different anatomical sites. We automatically generate labels for six sites which will help in lesion detection in a subsequent analytical module. We apply a recently proposed network called ResNeSt that incorporates channel-wise attention with multi-path representation and demonstrate high performance on the test set. The average F1-score for all classes and accuracy are both 0.96. Moreover, we provide a detailed discussion on class activation maps obtained from both correct and incorrect predictions to analyze algorithm behavior. The highlighted regions in the class activation maps generally correlate considerably well with the region of interest perceived and expected by expert human observers. The insights and knowledge gained from the analysis are helpful in not only algorithm improvement, but also aiding the development of the other key components in the process of computer assisted oral cancer screening.
口腔癌是一种常见癌症,可导致呼吸、吞咽、饮水、进食问题以及言语障碍,晚期死亡率很高。其诊断需通过组织病理学确认。确定活检需求并识别正确位置至关重要。深度学习在一些基于图像的医学筛查/诊断应用中已展现出巨大潜力/取得了成功。然而,口腔病变的自动视觉评估在文献中受到的关注有限。由于该疾病可发生在口腔的不同部位,第一步是识别不同解剖部位的图像。我们自动为六个部位生成标签,这将有助于在后续分析模块中进行病变检测。我们应用了一种最近提出的名为ResNeSt的网络,该网络将通道注意力与多路径表示相结合,并在测试集上展示了高性能。所有类别的平均F1分数和准确率均为0.96。此外,我们对从正确和错误预测中获得的类激活映射进行了详细讨论,以分析算法行为。类激活映射中的突出显示区域通常与专业人类观察者感知和预期的感兴趣区域有相当好的相关性。从分析中获得的见解和知识不仅有助于算法改进,还有助于计算机辅助口腔癌筛查过程中其他关键组件的开发。