文献检索，用中文搜 PubMed

BACKGROUND

Construct deep learning models for colonoscopy quality control using different architectures and explore their decision-making mechanisms.

METHODS

A total of 4,189 colonoscopy images were collected from two medical centers, covering different levels of bowel cleanliness, the presence of polyps, and the cecum. Using these data, eight pre-trained models based on CNN and Transformer architectures underwent transfer learning and fine-tuning. The models' performance was evaluated using metrics such as AUC, Precision, and F1 score. Perceptual hash functions were employed to detect image changes, enabling real-time monitoring of colonoscopy withdrawal speed. Model interpretability was analyzed using techniques such as Grad-CAM and SHAP. Finally, the best-performing model was converted to ONNX format and deployed on device terminals.

RESULTS

The EfficientNetB2 model outperformed other architectures on the validation set, achieving an accuracy of 0.992. It surpassed models based on other CNN and Transformer architectures. The model's precision, recall, and F1 score were 0.991, 0.989, and 0.990, respectively. On the test set, the EfficientNetB2 model achieved an average AUC of 0.996, with a precision of 0.948 and a recall of 0.952. Interpretability analysis showed the specific image regions the model used for decision-making. The model was converted to ONNX format and deployed on device terminals, achieving an average inference speed of over 60 frames per second.

CONCLUSIONS

The AI-assisted quality system, based on the EfficientNetB2 model, integrates four key quality control indicators for colonoscopy. This integration enables medical institutions to comprehensively manage and enhance these indicators using a single model, showcasing promising potential for clinical applications.

BACKGROUND

Construct deep learning models for colonoscopy quality control using different architectures and explore their decision-making mechanisms.

METHODS

RESULTS

CONCLUSIONS

背景

使用不同的架构构建结肠镜检查质量控制的深度学习模型，并探索其决策机制。

方法

从两个医学中心共收集了 4189 张结肠镜图像，涵盖了不同程度的肠道清洁度、息肉存在和盲肠。使用这些数据，对基于 CNN 和 Transformer 架构的八个预训练模型进行了迁移学习和微调。使用 AUC、精度和 F1 分数等指标评估模型的性能。使用感知哈希函数来检测图像变化，从而实现结肠镜退出速度的实时监测。使用 Grad-CAM 和 SHAP 等技术分析模型的可解释性。最后，将性能最佳的模型转换为 ONNX 格式并部署在设备终端上。

结果

在验证集上，EfficientNetB2 模型的表现优于其他架构，准确率达到 0.992。它优于基于其他 CNN 和 Transformer 架构的模型。该模型的精度、召回率和 F1 分数分别为 0.991、0.989 和 0.990。在测试集上，EfficientNetB2 模型的平均 AUC 为 0.996，精度为 0.948，召回率为 0.952。可解释性分析显示了模型用于决策的特定图像区域。该模型被转换为 ONNX 格式并部署在设备终端上，平均推理速度超过 60 帧/秒。