利用多模态数据的人工智能辅助系统实时检测喉咽癌。

Real-time detection of laryngopharyngeal cancer using an artificial intelligence-assisted system with multimodal data.

机构信息

Otorhinolaryngology Hospital, The First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510080, Guangdong, China.

School of Computer Science and Engineering, Guangdong Province Key Lab of Computational Science, Sun Yat-Sen University, Guangzhou, 510006, Guangdong, China.

出版信息

J Transl Med. 2023 Oct 7;21(1):698. doi: 10.1186/s12967-023-04572-y.

DOI:10.1186/s12967-023-04572-y

PMID:37805551

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10559609/

Abstract

BACKGROUND

Laryngopharyngeal cancer (LPC) includes laryngeal and hypopharyngeal cancer, whose early diagnosis can significantly improve the prognosis and quality of life of patients. Pathological biopsy of suspicious cancerous tissue under the guidance of laryngoscopy is the gold standard for diagnosing LPC. However, this subjective examination largely depends on the skills and experience of laryngologists, which increases the possibility of missed diagnoses and repeated unnecessary biopsies. We aimed to develop and validate a deep convolutional neural network-based Laryngopharyngeal Artificial Intelligence Diagnostic System (LPAIDS) for real-time automatically identifying LPC in both laryngoscopy white-light imaging (WLI) and narrow-band imaging (NBI) images to improve the diagnostic accuracy of LPC by reducing diagnostic variation among on-expert laryngologists.

METHODS

All 31,543 laryngoscopic images from 2382 patients were categorised into training, verification, and test sets to develop, validate, and internal test LPAIDS. Another 25,063 images from five other hospitals were used as external tests. Overall, 551 videos were used to evaluate the real-time performance of the system, and 200 randomly selected videos were used to compare the diagnostic performance of the LPAIDS with that of laryngologists. Two deep-learning models using either WLI (model W) or NBI (model N) images were constructed to compare with LPAIDS.

RESULTS

LPAIDS had a higher diagnostic performance than models W and N, with accuracies of 0·956 and 0·949 in the internal image and video tests, respectively. The robustness and stability of LPAIDS were validated in external sets with the area under the receiver operating characteristic curve values of 0·965-0·987. In the laryngologist-machine competition, LPAIDS achieved an accuracy of 0·940, which was comparable to expert laryngologists and outperformed other laryngologists with varying qualifications.

CONCLUSIONS

LPAIDS provided high accuracy and stability in detecting LPC in real-time, which showed great potential for using LPAIDS to improve the diagnostic accuracy of LPC by reducing diagnostic variation among on-expert laryngologists.

摘要

背景

喉咽癌（LPC）包括喉癌和下咽癌，其早期诊断可显著改善患者的预后和生活质量。在喉镜引导下对可疑癌组织进行病理活检是诊断 LPC 的金标准。然而，这种主观检查在很大程度上依赖于喉镜医生的技能和经验，这增加了漏诊和不必要的重复活检的可能性。我们旨在开发和验证一种基于深度卷积神经网络的喉咽人工智能诊断系统（LPAIDS），以便实时自动识别喉镜白光成像（WLI）和窄带成像（NBI）图像中的 LPC，从而通过减少专家喉镜医生之间的诊断差异来提高 LPC 的诊断准确性。

方法

将 2382 名患者的 31543 张喉镜图像分为训练集、验证集和测试集，以开发、验证和内部测试 LPAIDS。另外 5 家医院的 25063 张图像用于外部测试。总共使用 551 个视频来评估系统的实时性能，使用 200 个随机选择的视频来比较 LPAIDS 与喉镜医生的诊断性能。构建了两个使用 WLI（模型 W）或 NBI（模型 N）图像的深度学习模型，与 LPAIDS 进行比较。

结果

LPAIDS 的诊断性能优于模型 W 和 N，内部图像和视频测试的准确率分别为 0.956 和 0.949。在外部数据集的稳健性和稳定性验证中，受试者工作特征曲线下面积值分别为 0.965-0.987。在喉镜医生与机器的竞争中，LPAIDS 的准确率为 0.940，与专家喉镜医生相当，优于不同资质的其他喉镜医生。

结论

LPAIDS 实时检测 LPC 的准确率高且稳定性好，有望通过减少专家喉镜医生之间的诊断差异来提高 LPC 的诊断准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f65/10559609/e9f034f2dc79/12967_2023_4572_Fig1_HTML.jpg

相似文献

Real-time detection of laryngopharyngeal cancer using an artificial intelligence-assisted system with multimodal data.

J Transl Med. 2023 Oct 7;21(1):698. doi: 10.1186/s12967-023-04572-y.

Multi-Instance Learning for Vocal Fold Leukoplakia Diagnosis Using White Light and Narrow-Band Imaging: A Multicenter Study.

Laryngoscope. 2024 Oct;134(10):4321-4328. doi: 10.1002/lary.31537. Epub 2024 May 27.

Convolutional neural network based anatomical site identification for laryngoscopy quality control: A multicenter study.

Am J Otolaryngol. 2023 Mar-Apr;44(2):103695. doi: 10.1016/j.amjoto.2022.103695. Epub 2022 Nov 24.

Real-time artificial intelligence for detection of upper gastrointestinal cancer by endoscopy: a multicentre, case-control, diagnostic study.

Lancet Oncol. 2019 Dec;20(12):1645-1654. doi: 10.1016/S1470-2045(19)30637-0. Epub 2019 Oct 4.

Application of artificial intelligence using a convolutional neural network for diagnosis of early gastric cancer based on magnifying endoscopy with narrow-band imaging.

J Gastroenterol Hepatol. 2021 Feb;36(2):482-489. doi: 10.1111/jgh.15190. Epub 2020 Jul 28.

Comparative study on artificial intelligence systems for detecting early esophageal squamous cell carcinoma between narrow-band and white-light imaging.

World J Gastroenterol. 2021 Jan 21;27(3):281-293. doi: 10.3748/wjg.v27.i3.281.

Clinical utility and effectiveness of a training programme in the application of a new classification of narrow-band imaging for vocal cord leukoplakia: A multicentre study.

Clin Otolaryngol. 2019 Sep;44(5):729-735. doi: 10.1111/coa.13361. Epub 2019 Jun 19.

Diagnostic Accuracies of Laryngeal Diseases Using a Convolutional Neural Network-Based Image Classification System.

Laryngoscope. 2021 Nov;131(11):2558-2566. doi: 10.1002/lary.29595. Epub 2021 May 17.

Automated detection of glottic laryngeal carcinoma in laryngoscopic images from a multicentre database using a convolutional neural network.

Clin Otolaryngol. 2023 May;48(3):436-441. doi: 10.1111/coa.14029. Epub 2023 Jan 20.

Comparison of Convolutional Neural Network Models for Determination of Vocal Fold Normality in Laryngoscopic Images.

J Voice. 2022 Sep;36(5):590-598. doi: 10.1016/j.jvoice.2020.08.003. Epub 2020 Aug 30.

引用本文的文献

Comparative Evaluation of High-Speed Videoendoscopy and Laryngovideostroboscopy for Functional Laryngeal Assessment in Clinical Practice.

J Clin Med. 2025 Mar 4;14(5):1723. doi: 10.3390/jcm14051723.

本文引用的文献

Accuracy of narrow-band imaging for diagnosing malignant transformation of vocal cord leukoplakia: A systematic review and meta-analysis.

Laryngoscope Investig Otolaryngol. 2023 Mar 29;8(2):508-517. doi: 10.1002/lio2.1049. eCollection 2023 Apr.

Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers.

J Voice. 2025 Jan;39(1):245-257. doi: 10.1016/j.jvoice.2022.07.007. Epub 2022 Sep 6.

A deep convolutional neural network-based method for laryngeal squamous cell carcinoma diagnosis.

Ann Transl Med. 2021 Dec;9(24):1797. doi: 10.21037/atm-21-6458.

Real-time automated diagnosis of colorectal cancer invasion depth using a deep learning model with multimodal data (with video).

Gastrointest Endosc. 2022 Jun;95(6):1186-1194.e3. doi: 10.1016/j.gie.2021.11.049. Epub 2021 Dec 14.

Real-time use of artificial intelligence for diagnosing early gastric cancer by magnifying image-enhanced endoscopy: a multicenter diagnostic study (with videos).

Gastrointest Endosc. 2022 Apr;95(4):671-678.e4. doi: 10.1016/j.gie.2021.11.040. Epub 2021 Dec 8.

Deep Learning Applied to White Light and Narrow Band Imaging Videolaryngoscopy: Toward Real-Time Laryngeal Cancer Detection.

Laryngoscope. 2022 Sep;132(9):1798-1806. doi: 10.1002/lary.29960. Epub 2021 Nov 25.

Deep learning for diagnosis and survival prediction in soft tissue sarcoma.

Ann Oncol. 2021 Sep;32(9):1178-1187. doi: 10.1016/j.annonc.2021.06.007. Epub 2021 Jun 15.

A deep-learning model to assist thyroid nodule diagnosis and management.

Lancet Digit Health. 2021 Jul;3(7):e410. doi: 10.1016/S2589-7500(21)00108-4. Epub 2021 Jun 10.

Diagnostic Accuracies of Laryngeal Diseases Using a Convolutional Neural Network-Based Image Classification System.

Laryngoscope. 2021 Nov;131(11):2558-2566. doi: 10.1002/lary.29595. Epub 2021 May 17.

Artificial intelligence in the diagnosis of gastric precancerous conditions by image-enhanced endoscopy: a multicenter, diagnostic study (with video).

Gastrointest Endosc. 2021 Sep;94(3):540-548.e4. doi: 10.1016/j.gie.2021.03.013. Epub 2021 Mar 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用多模态数据的人工智能辅助系统实时检测喉咽癌。

Real-time detection of laryngopharyngeal cancer using an artificial intelligence-assisted system with multimodal data.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献