Li Jin, Wang Peng, Zhou Yang, Liang Hong, Luan Kuan
College of Intelligent System Science and Engineering, Harbin Engineering University, Harbin, China.
Department of Radiology, Harbin Medical University Cancer Hospital, Harbin, China.
Front Bioeng Biotechnol. 2021 Jan 14;8:620257. doi: 10.3389/fbioe.2020.620257. eCollection 2020.
The classification of colorectal cancer (CRC) lymph node metastasis (LNM) is a vital clinical issue related to recurrence and design of treatment plans. However, it remains unclear which method is effective in automatically classifying CRC LNM. Hence, this study compared the performance of existing classification methods, i.e., machine learning, deep learning, and deep transfer learning, to identify the most effective method. A total of 3,364 samples (1,646 positive and 1,718 negative) from Harbin Medical University Cancer Hospital were collected. All patches were manually segmented by experienced radiologists, and the image size was based on the lesion to be intercepted. Two classes of global features and one class of local features were extracted from the patches. These features were used in eight machine learning algorithms, while the other models used raw data. Experiment results showed that deep transfer learning was the most effective method with an accuracy of 0.7583 and an area under the curve of 0.7941. Furthermore, to improve the interpretability of the results from the deep learning and deep transfer learning models, the classification heat-map features were used, which displayed the region of feature extraction by superposing with raw data. The research findings are expected to promote the use of effective methods in CRC LNM detection and hence facilitate the design of proper treatment plans.
结直肠癌(CRC)淋巴结转移(LNM)的分类是一个与复发和治疗方案设计相关的重要临床问题。然而,目前尚不清楚哪种方法能有效地自动对CRC LNM进行分类。因此,本研究比较了现有分类方法(即机器学习、深度学习和深度迁移学习)的性能,以确定最有效的方法。我们收集了哈尔滨医科大学附属肿瘤医院的3364个样本(1646个阳性和1718个阴性)。所有图像块均由经验丰富的放射科医生手动分割,图像大小基于要截取的病变。从图像块中提取了两类全局特征和一类局部特征。这些特征被用于八种机器学习算法,而其他模型使用原始数据。实验结果表明,深度迁移学习是最有效的方法,准确率为0.7583,曲线下面积为0.7941。此外,为了提高深度学习和深度迁移学习模型结果的可解释性,使用了分类热图特征,通过与原始数据叠加来显示特征提取区域。研究结果有望促进有效方法在CRC LNM检测中的应用,从而有助于制定合适的治疗方案。