Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510080, Guangdong, China.
School of Computer Science and Engineering, Guangdong Province Key Lab of Computational Science, Sun Yat-Sen University, Guangzhou, 510006, Guangdong, China.
J Transl Med. 2023 Oct 7;21(1):698. doi: 10.1186/s12967-023-04572-y.
Laryngopharyngeal cancer (LPC) includes laryngeal and hypopharyngeal cancer, whose early diagnosis can significantly improve the prognosis and quality of life of patients. Pathological biopsy of suspicious cancerous tissue under the guidance of laryngoscopy is the gold standard for diagnosing LPC. However, this subjective examination largely depends on the skills and experience of laryngologists, which increases the possibility of missed diagnoses and repeated unnecessary biopsies. We aimed to develop and validate a deep convolutional neural network-based Laryngopharyngeal Artificial Intelligence Diagnostic System (LPAIDS) for real-time automatically identifying LPC in both laryngoscopy white-light imaging (WLI) and narrow-band imaging (NBI) images to improve the diagnostic accuracy of LPC by reducing diagnostic variation among on-expert laryngologists.
All 31,543 laryngoscopic images from 2382 patients were categorised into training, verification, and test sets to develop, validate, and internal test LPAIDS. Another 25,063 images from five other hospitals were used as external tests. Overall, 551 videos were used to evaluate the real-time performance of the system, and 200 randomly selected videos were used to compare the diagnostic performance of the LPAIDS with that of laryngologists. Two deep-learning models using either WLI (model W) or NBI (model N) images were constructed to compare with LPAIDS.
LPAIDS had a higher diagnostic performance than models W and N, with accuracies of 0·956 and 0·949 in the internal image and video tests, respectively. The robustness and stability of LPAIDS were validated in external sets with the area under the receiver operating characteristic curve values of 0·965-0·987. In the laryngologist-machine competition, LPAIDS achieved an accuracy of 0·940, which was comparable to expert laryngologists and outperformed other laryngologists with varying qualifications.
LPAIDS provided high accuracy and stability in detecting LPC in real-time, which showed great potential for using LPAIDS to improve the diagnostic accuracy of LPC by reducing diagnostic variation among on-expert laryngologists.
喉咽癌(LPC)包括喉癌和下咽癌,其早期诊断可显著改善患者的预后和生活质量。在喉镜引导下对可疑癌组织进行病理活检是诊断 LPC 的金标准。然而,这种主观检查在很大程度上依赖于喉镜医生的技能和经验,这增加了漏诊和不必要的重复活检的可能性。我们旨在开发和验证一种基于深度卷积神经网络的喉咽人工智能诊断系统(LPAIDS),以便实时自动识别喉镜白光成像(WLI)和窄带成像(NBI)图像中的 LPC,从而通过减少专家喉镜医生之间的诊断差异来提高 LPC 的诊断准确性。
将 2382 名患者的 31543 张喉镜图像分为训练集、验证集和测试集,以开发、验证和内部测试 LPAIDS。另外 5 家医院的 25063 张图像用于外部测试。总共使用 551 个视频来评估系统的实时性能,使用 200 个随机选择的视频来比较 LPAIDS 与喉镜医生的诊断性能。构建了两个使用 WLI(模型 W)或 NBI(模型 N)图像的深度学习模型,与 LPAIDS 进行比较。
LPAIDS 的诊断性能优于模型 W 和 N,内部图像和视频测试的准确率分别为 0.956 和 0.949。在外部数据集的稳健性和稳定性验证中,受试者工作特征曲线下面积值分别为 0.965-0.987。在喉镜医生与机器的竞争中,LPAIDS 的准确率为 0.940,与专家喉镜医生相当,优于不同资质的其他喉镜医生。
LPAIDS 实时检测 LPC 的准确率高且稳定性好,有望通过减少专家喉镜医生之间的诊断差异来提高 LPC 的诊断准确性。