Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2025-2028. doi: 10.1109/EMBC48229.2022.9870992.
This work focuses on detection of upper gas-trointestinal (GI) landmarks, which are important anatomical areas of the upper GI tract digestive system that should be photodocumented during endoscopy to guarantee a complete examination. The aim of this work consisted in testing new automatic algorithms, specifically based on convolutional neural network (CNN) systems, able to detect upper GI landmarks, that can help to avoid the presence of blind spots during esophagogastroduodenoscopy. We tested pre-trained CNN architectures, such as the ResNet-50 and VGG-16, in conjunction with different training approaches, including the use of class weights, batch normalization, dropout, and data augmentation. The ResNet-50 model trained with class weights was the best performing CNN, achieving an accuracy of 71.79% and a Mathews Correlation Coefficient (MCC) of 65.06%. The combination of supervised and unsupervised learning was also explored to increase classification performance. In particular, convolutional autoencoder architectures trained with unlabeled GI images were used to extract representative features. Such features were then concatenated with those extracted by the pre-trained ResNet-50 architecture. This approach achieved a classification accuracy of 72.45% and an MCC of 65.08%. Clinical relevance- Esophagogastroduodenoscopy (EGD) photodocumentation is essential to guarantee that all areas of the upper GI system are examined avoiding blind spots. This work has the objective to help the EGD photodocumentation monitorization by testing new CNN-based systems able to detect EGD landmarks.
这项工作专注于检测上消化道 (GI) 标志,这些标志是上消化道消化系统的重要解剖区域,在进行内窥镜检查时应进行拍照记录,以保证全面检查。这项工作的目的在于测试新的自动算法,特别是基于卷积神经网络 (CNN) 系统的算法,这些算法能够检测上消化道标志,可以帮助避免在进行食管胃十二指肠镜检查时出现盲点。我们测试了预训练的 CNN 架构,如 ResNet-50 和 VGG-16,并结合了不同的训练方法,包括使用类权重、批量归一化、随机失活和数据增强。使用类权重训练的 ResNet-50 模型是表现最好的 CNN,达到了 71.79%的准确率和 65.06%的马修斯相关系数 (MCC)。还探索了监督学习和无监督学习的结合,以提高分类性能。特别是,使用未标记的 GI 图像训练的卷积自动编码器架构用于提取代表性特征。然后将这些特征与预训练的 ResNet-50 架构提取的特征串联起来。这种方法达到了 72.45%的分类准确率和 65.08%的 MCC。临床相关性- 食管胃十二指肠镜检查 (EGD) 的拍照记录对于保证检查时上消化道系统的所有区域都被检查到避免出现盲点至关重要。这项工作旨在通过测试新的基于 CNN 的系统来帮助 EGD 拍照记录监测,这些系统能够检测 EGD 标志。