Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, Thailand; Chula Intelligent and Complex Systems, Faculty of Science, Chulalongkorn University, Bangkok, Thailand.
Department of Pathology, King Chulalongkorn Memorial Hospital and Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
Artif Intell Med. 2023 Jan;135:102462. doi: 10.1016/j.artmed.2022.102462. Epub 2022 Nov 25.
Mitotic count (MC) is an important histological parameter for cancer diagnosis and grading, but the manual process for obtaining MC from whole-slide histopathological images is very time-consuming and prone to error. Therefore, deep learning models have been proposed to facilitate this process. Existing approaches utilize a two-stage pipeline: the detection stage for identifying the locations of potential mitotic cells and the classification stage for refining prediction confidences. However, this pipeline formulation can lead to inconsistencies in the classification stage due to the poor prediction quality of the detection stage and the mismatches in training data distributions between the two stages. In this study, we propose a Refine Cascade Network (ReCasNet), an enhanced deep learning pipeline that mitigates the aforementioned problems with three improvements. First, window relocation was used to reduce the number of poor quality false positives generated during the detection stage. Second, object re-cropping was performed with another deep learning model to adjust poorly centered objects. Third, improved data selection strategies were introduced during the classification stage to reduce the mismatches in training data distributions. ReCasNet was evaluated on two large-scale mitotic figure recognition datasets, canine cutaneous mast cell tumor (CCMCT) and canine mammary carcinoma (CMC), which resulted in up to 4.8% percentage point improvements in the F1 scores for mitotic cell detection and 44.1% reductions in mean absolute percentage error (MAPE) for MC prediction. Techniques that underlie ReCasNet can be generalized to other two-stage object detection pipeline and should contribute to improving the performances of deep learning models in broad digital pathology applications.
有丝分裂计数 (MC) 是癌症诊断和分级的重要组织学参数,但从全切片组织病理学图像中获取 MC 的手动过程非常耗时且容易出错。因此,已经提出了深度学习模型来促进这一过程。现有的方法利用两阶段流水线:检测阶段用于识别潜在有丝分裂细胞的位置,分类阶段用于细化预测置信度。然而,由于检测阶段的预测质量较差以及两个阶段的训练数据分布不匹配,这种流水线形式可能会导致分类阶段的不一致。在这项研究中,我们提出了一种改进的深度学习流水线 Refine Cascade Network (ReCasNet),该流水线通过三个改进来解决上述问题。首先,使用窗口重新定位减少了检测阶段产生的大量低质量假阳性。其次,使用另一个深度学习模型进行对象重新裁剪,以调整中心不佳的对象。最后,在分类阶段引入了改进的数据选择策略,以减少训练数据分布的不匹配。ReCasNet 在两个大规模有丝分裂图像识别数据集——犬皮肤肥大细胞瘤 (CCMCT) 和犬乳腺肿瘤 (CMC) 上进行了评估,在有丝分裂细胞检测的 F1 分数方面提高了高达 4.8%,在 MC 预测的平均绝对百分比误差 (MAPE) 方面降低了 44.1%。ReCasNet 所基于的技术可以推广到其他两阶段目标检测流水线,并有助于提高深度学习模型在广泛的数字病理学应用中的性能。