Paul Rahul, Schabath Matthew B, Gillies Robert, Hall Lawrence O, Goldgof Dmitry B
University of South Florida, Department of Computer Science and Engineering, Tampa, Florida, United States.
H. L. Moffitt Cancer Center and Research Institute, Department of Cancer Epidemiology, Tampa, Florida, United States.
J Med Imaging (Bellingham). 2020 Mar;7(2):024502. doi: 10.1117/1.JMI.7.2.024502. Epub 2020 Apr 6.
: Due to the high incidence and mortality rates of lung cancer worldwide, early detection of a precancerous lesion is essential. Low-dose computed tomography is a commonly used technique for screening, diagnosis, and prognosis of non-small-cell lung cancer. Recently, convolutional neural networks (CNN) had shown great potential in lung nodule classification. Clinical information (family history, gender, and smoking history) together with nodule size provide information about lung cancer risk. Large nodules have greater risk than small nodules. A subset of cases from the National Lung Screening Trial was chosen as a dataset in our study. We divided the nodules into large and small nodules based on different clinical guideline thresholds and then analyzed the groups individually. Similarly, we also analyzed clinical features by dividing them into groups. CNNs were designed and trained over each of these groups individually. To our knowledge, this is the first study to incorporate nodule size and clinical features for classification using CNN. We further made a hybrid model using an ensemble with the CNN models of clinical and size information to enhance malignancy prediction. From our study, we obtained 0.9 AUC and 83.12% accuracy, which was a significant improvement over our previous best results. In conclusion, we found that dividing the nodules by size and clinical information for building predictive models resulted in improved malignancy predictions. Our analysis also showed that appropriately integrating clinical information and size groups could further improve risk prediction.
由于全球肺癌的高发病率和死亡率,癌前病变的早期检测至关重要。低剂量计算机断层扫描是用于非小细胞肺癌筛查、诊断和预后的常用技术。最近,卷积神经网络(CNN)在肺结节分类中显示出巨大潜力。临床信息(家族史、性别和吸烟史)以及结节大小提供了有关肺癌风险的信息。大结节比小结节风险更高。在我们的研究中,选择了国家肺癌筛查试验的一部分病例作为数据集。我们根据不同的临床指南阈值将结节分为大结节和小结节,然后分别对这些组进行分析。同样,我们也通过分组分析临床特征。针对这些组分别设计并训练了卷积神经网络。据我们所知,这是第一项将结节大小和临床特征纳入使用卷积神经网络进行分类的研究。我们进一步使用临床和大小信息的卷积神经网络模型集成构建了一个混合模型,以增强恶性肿瘤预测。从我们的研究中,我们获得了0.9的曲线下面积(AUC)和83.12%的准确率,这比我们之前的最佳结果有了显著提高。总之,我们发现按大小和临床信息对结节进行划分以构建预测模型可提高恶性肿瘤预测。我们的分析还表明,适当地整合临床信息和大小组可以进一步改善风险预测。