Jin Tao, Jiang Yancai, Mao Boneng, Wang Xing, Lu Bo, Qian Ji, Zhou Hutao, Ma Tieliang, Zhang Yefei, Li Sisi, Shi Yun, Yao Zhendong
Department of Gastroenterology, The Affiliated Yixing Hospital of Jiangsu University, Yixing, China.
Microsoft Ltd Co., Suzhou, China.
Front Oncol. 2022 Aug 16;12:953090. doi: 10.3389/fonc.2022.953090. eCollection 2022.
Convolutional Neural Network(CNN) is increasingly being applied in the diagnosis of gastric cancer. However, the impact of proportion of internal data in the training set on test results has not been sufficiently studied. Here, we constructed an artificial intelligence (AI) system called EGC-YOLOV4 using the YOLO-v4 algorithm to explore the optimal ratio of training set with the power to diagnose early gastric cancer.
A total of 22,0918 gastroscopic images from Yixing People's Hospital were collected. 7 training set models were established to identify 4 test sets. Respective sensitivity, specificity, Youden index, accuracy, and corresponding thresholds were tested, and ROC curves were plotted.
EGC-YOLOV4 can quickly and accurately identify the early gastric cancer lesions in gastroscopic images, and has good generalization.The proportion of positive and negative samples in the training set will affect the overall diagnostic performance of AI.In this study, the optimal ratio of positive samples to negative samples in the training set is 1:1~ 1:2.
卷积神经网络(CNN)在胃癌诊断中的应用日益广泛。然而,训练集中内部数据比例对测试结果的影响尚未得到充分研究。在此,我们使用YOLO-v4算法构建了一个名为EGC-YOLOV4的人工智能(AI)系统,以探索具有诊断早期胃癌能力的训练集的最佳比例。
收集了来自宜兴市人民医院的220918张胃镜图像。建立了7个训练集模型以识别4个测试集。测试了各自的灵敏度、特异性、约登指数、准确性及相应阈值,并绘制了ROC曲线。
EGC-YOLOV4能够快速、准确地识别胃镜图像中的早期胃癌病变,且具有良好的泛化能力。训练集中正负样本的比例会影响AI的整体诊断性能。在本研究中,训练集中正样本与负样本的最佳比例为1:1至1:2。