School of Medical Instrument and Food Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China.
Department of Rehabilitation, Shanghai Putuo People's Hospital, Shanghai 200060, China.
Rev Sci Instrum. 2021 Sep 1;92(9):094102. doi: 10.1063/5.0054161.
The wireless capsule endoscopy (WCE) procedure produces tens of thousands of images of the digestive tract, for which the use of the manual reading process is full of challenges. Convolutional neural networks are used to automatically detect lesions in WCE images. However, studies on clinical multilesion detection are scarce, and it is difficult to effectively balance the sensitivity to multiple lesions. A strategy for detecting multiple lesions is proposed, wherein common vascular and inflammatory lesions can be automatically and quickly detected on capsule endoscopic images. Based on weakly supervised learning, EfficientNet is fine-tuned to extract the endoscopic image features. Combining spatial features and channel features, the proposed attention network is then used as a classifier to obtain three classifications. The accuracy and speed of the model were compared with those of the ResNet121 and InceptionNetV4 models. It was tested on a public WCE image dataset obtained from 4143 subjects. On the computer-assisted diagnosis for capsule endoscopy database, the method gives a sensitivity of 96.67% for vascular lesions and 93.33% for inflammatory lesions. The precision for vascular lesions was 92.80%, and that for inflammatory lesions was 95.73%. The accuracy was 96.11%, which is 1.11% higher than that of the latest InceptionNetV4 network. Prediction for an image only requires 14 ms, which balances the accuracy and speed comparatively better. This strategy can be used as an auxiliary diagnostic method for specialists for the rapid reading of clinical capsule endoscopes.
无线胶囊内镜(WCE)检查会生成数以万计的消化道图像,而手动阅读过程充满了挑战。卷积神经网络被用于自动检测 WCE 图像中的病变。然而,针对临床多病变检测的研究很少,并且很难有效地平衡对多种病变的敏感性。提出了一种用于检测多种病变的策略,其中可以自动快速地检测胶囊内镜图像上的常见血管和炎症性病变。基于弱监督学习,对 EfficientNet 进行了微调以提取内镜图像特征。然后,结合空间特征和通道特征,使用所提出的注意力网络作为分类器,以获得三种分类。将模型的准确性和速度与 ResNet121 和 InceptionNetV4 模型进行了比较。在来自 4143 名受试者的公共 WCE 图像数据库上进行了测试。在计算机辅助胶囊内镜诊断数据库上,该方法对血管病变的敏感性为 96.67%,对炎症性病变的敏感性为 93.33%。血管病变的精确率为 92.80%,炎症性病变的精确率为 95.73%。准确性为 96.11%,比最新的 InceptionNetV4 网络高 1.11%。预测一张图像仅需 14ms,在准确性和速度之间实现了较好的平衡。该策略可以作为专家快速阅读临床胶囊内镜的辅助诊断方法。