Lee Hannah, Chung Jun-Won, Yun Sung-Cheol, Jung Sung Woo, Yoon Yeong Jun, Kim Ji Hee, Cha Boram, Kayasseh Mohd Azzam, Kim Kyoung Oh
Division of Gastroenterology, Department of Internal Medicine, Gachon University, Gil Medical Center, Incheon 21565, Republic of Korea.
Division of Biostatistics, Center for Medical Research and Information, University of Ulsan College of Medicine, Seoul 05505, Republic of Korea.
Diagnostics (Basel). 2024 Nov 30;14(23):2706. doi: 10.3390/diagnostics14232706.
BACKGROUND/OBJECTIVES: Gastric cancer ranks fifth for incidence and fourth in the leading causes of mortality worldwide. In this study, we aimed to validate previously developed artificial intelligence (AI) computer-aided detection (CADe) algorithm, called ALPHAON in detecting gastric neoplasm.
We used the retrospective data of 500 still images, including 5 benign gastric ulcers, 95 with gastric cancer, and 400 normal images. Thereby we validated the CADe algorithm measuring accuracy, sensitivity, and specificity with the result of receiver operating characteristic curves (ROC) and area under curve (AUC) in addition to comparing the diagnostic performance status of four expert endoscopists, four trainees, and four beginners from two university-affiliated hospitals with CADe algorithm. After a washing-out period of over 2 weeks, endoscopists performed gastric detection on the same dataset of the 500 endoscopic images again marked by ALPHAON.
The CADe algorithm presented high validity in detecting gastric neoplasm with accuracy (0.88, 95% CI: 0.85 to 0.91), sensitivity (0.93, 95% CI: 0.88 to 0.98), specificity (0.87, 95% CI: 0.84 to 0.90), and AUC (0.962). After a washing-out period of over 2 weeks, overall validity improved in the trainee and beginner groups with the assistance of ALPHAON. Significant improvement was present, especially in the beginner group (accuracy 0.94 (0.93 to 0.96) < 0.001, sensitivity 0.87 (0.82 to 0.92) < 0.001, specificity 0.96 (0.95 to 0.97) < 0.001).
The high validation performance state of the CADe algorithm system was verified. Also, ALPHAON has demonstrated its potential to serve as an endoscopic educator for beginners improving and making progress in sensitivity and specificity.
背景/目的:胃癌的发病率在全球排名第五,是主要死因中的第四位。在本研究中,我们旨在验证先前开发的名为ALPHAON的人工智能(AI)计算机辅助检测(CADe)算法在检测胃肿瘤方面的性能。
我们使用了500张静态图像的回顾性数据,其中包括5例良性胃溃疡、95例胃癌以及400例正常图像。通过这些数据,我们验证了CADe算法的测量准确性、敏感性和特异性,并得出了受试者工作特征曲线(ROC)和曲线下面积(AUC)的结果。此外,我们还比较了来自两家大学附属医院的四位专家内镜医师、四位实习生和四位初学者在使用CADe算法时的诊断性能状况。在超过两周的洗脱期后,内镜医师再次对由ALPHAON标记的500张内镜图像的同一数据集进行胃部检测。
CADe算法在检测胃肿瘤方面表现出较高的有效性,准确性为0.88(95%CI:0.85至0.91),敏感性为0.93(95%CI:0.88至0.98),特异性为0.87(95%CI:0.84至0.90),AUC为0.962。在超过两周的洗脱期后,在ALPHAON的辅助下,实习生和初学者组的整体有效性有所提高。尤其是初学者组有显著改善(准确性0.94(0.93至0.96)<0.001,敏感性0.87(0.82至0.92)<0.001,特异性0.96(0.95至0.97)<0.001)。
验证了CADe算法系统的高验证性能状态。此外,ALPHAON已证明其有潜力作为初学者的内镜教育工具,提高敏感性和特异性并取得进步。