Hwang Ji-Hee, Lim Minyoung, Han Gyeongjin, Park Heejin, Kim Yong-Bum, Park Jinseok, Jun Sang-Yeop, Lee Jaeku, Cho Jae-Woo
Toxicologic Pathology Research Group, Department of Advanced Toxicology Research, Korea Institute of Toxicology, Daejeon, 34114 Republic of Korea.
Department of Advanced Toxicology Research, Korea Institute of Toxicology, Daejeon, 34114 Republic of Korea.
Toxicol Res. 2023 Apr 6;39(3):399-408. doi: 10.1007/s43188-023-00173-5. eCollection 2023 Jul.
Deep learning has recently become one of the most popular methods of image analysis. In non-clinical studies, several tissue slides are generated to investigate the toxicity of a test compound. These are converted into digital image data using a slide scanner, which is then studied by researchers to investigate abnormalities, and the deep learning method has been started to adopt in this study. However, comparative studies evaluating different deep learning algorithms for analyzing abnormal lesions are scarce. In this study, we applied three algorithms, SSD, Mask R-CNN, and DeepLabV3, to detect hepatic necrosis in slide images and determine the best deep learning algorithm for analyzing abnormal lesions. We trained each algorithm on 5750 images and 5835 annotations of hepatic necrosis including validation and test, augmented with 500 image tiles of 448 × 448 pixels. Precision, recall, and accuracy were calculated for each algorithm based on the prediction results of 60 test images of 2688 × 2688 pixels. The two segmentation algorithms, DeepLabV3 and Mask R-CNN, showed over 90% of accuracy (0.94 and 0.92, respectively), whereas SSD, an object detection algorithm, showed lower accuracy. The trained DeepLabV3 outperformed all others in recall while also successfully separating hepatic necrosis from other features in the test images. It is important to localize and separate the abnormal lesion of interest from other features to investigate it on a slide level. Therefore, we suggest that segmentation algorithms are more appropriate than object detection algorithms for use in the pathological analysis of images in non-clinical studies.
The online version contains supplementary material available at 10.1007/s43188-023-00173-5.
深度学习最近已成为最流行的图像分析方法之一。在非临床研究中,会生成几张组织切片来研究受试化合物的毒性。使用载玻片扫描仪将这些切片转换为数字图像数据,然后研究人员对其进行研究以调查异常情况,并且深度学习方法已开始应用于这项研究。然而,评估用于分析异常病变的不同深度学习算法的比较研究却很少。在本研究中,我们应用了三种算法,即SSD、Mask R-CNN和DeepLabV3,来检测载玻片图像中的肝坏死,并确定用于分析异常病变的最佳深度学习算法。我们在5750张图像和5835个肝坏死注释(包括验证和测试)上训练了每种算法,并使用500个448×448像素的图像块进行了增强。基于2688×2688像素的60张测试图像的预测结果,计算了每种算法的精确率、召回率和准确率。两种分割算法DeepLabV3和Mask R-CNN的准确率均超过90%(分别为0.94和0.92),而目标检测算法SSD的准确率较低。经过训练的DeepLabV3在召回率方面优于所有其他算法,同时还成功地在测试图像中将肝坏死与其他特征区分开来。在载玻片水平上研究感兴趣的异常病变时,将其与其他特征进行定位和分离很重要。因此,我们建议在非临床研究的图像病理分析中,分割算法比目标检测算法更合适。
在线版本包含可在10.1007/s43188-023-00173-5获取的补充材料。