Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, 610065, P.R.China.
Department of Respiratory and Critical Care Medicine, West China School/West China Hospital, Sichuan University, Chengdu, Sichuan Province 610041, PR China.
Med Image Anal. 2020 Oct;65:101772. doi: 10.1016/j.media.2020.101772. Epub 2020 Jul 8.
The accurate identification of malignant lung nodules using computed tomography (CT) screening images is vital for the early detection of lung cancer. It also offers patients the best chance of cure, because non-invasive CT imaging has the ability to capture intra-tumoral heterogeneity. Deep learning methods have obtained promising results for the malignancy identification problem; however, two substantial challenges still remain. First, small datasets cannot insufficiently train the model and tend to overfit it. Second, category imbalance in the data is a problem. In this paper, we propose a method called MSCS-DeepLN that evaluates lung nodule malignancy and simultaneously solves these two problems. Three light models are trained and combined to evaluate the malignancy of a lung nodule. Three-dimensional convolutional neural networks (CNNs) are employed as the backbone of each light model to extract the lung nodule features from CT images and preserve lung nodule spatial heterogeneity. Multi-scale input cropped from CT images enables the sub-networks to learn the multi-level contextual features and preserve diverse. To tackle the imbalance problem, our proposed method employs an AUC approximation as the penalty term. During training, the error in this penalty term is generated from each major and minor class pair, so that negatives and positives can contribute equally to updating this model. Based on these methods, we obtain state-of-the-art results on the LIDC-IDRI dataset. Furthermore, we constructed a new dataset collected from a grade-A tertiary hospital and annotated using biopsy-based cytological analysis to verify the performance of our method in clinical practice.
使用计算机断层扫描 (CT) 筛查图像准确识别恶性肺结节对于早期发现肺癌至关重要。它还为患者提供了最佳的治愈机会,因为无创 CT 成像有能力捕获肿瘤内异质性。深度学习方法在恶性肿瘤识别问题上取得了有希望的结果;然而,仍然存在两个重大挑战。首先,小数据集不能充分训练模型,并且容易过拟合。其次,数据中的类别不平衡是一个问题。在本文中,我们提出了一种称为 MSCS-DeepLN 的方法,用于评估肺结节的恶性程度,并同时解决这两个问题。训练了三个轻量级模型并进行组合,以评估肺结节的恶性程度。三维卷积神经网络 (CNN) 被用作每个轻量级模型的骨干,从 CT 图像中提取肺结节特征并保留肺结节空间异质性。从 CT 图像裁剪的多尺度输入使子网络能够学习多层次的上下文特征并保留多样性。为了解决不平衡问题,我们提出的方法采用 AUC 逼近作为惩罚项。在训练过程中,该惩罚项的误差来自每个主要和次要类别的每一对,以便负数和正数可以平等地为更新此模型做出贡献。基于这些方法,我们在 LIDC-IDRI 数据集上取得了最先进的结果。此外,我们构建了一个新的数据集,该数据集是从一家甲级三甲医院收集的,并使用基于活检的细胞学分析进行了注释,以验证我们的方法在临床实践中的性能。