Department of Cardiothoracic Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pa.
Department of Radiology, The Affiliated Zhongshan Hospital of Dalian University, Dalian, China.
J Thorac Cardiovasc Surg. 2022 Apr;163(4):1496-1505.e10. doi: 10.1016/j.jtcvs.2021.02.010. Epub 2021 Feb 16.
The study objective was to investigate if machine learning algorithms can predict whether a lung nodule is benign, adenocarcinoma, or its preinvasive subtype from computed tomography images alone.
A dataset of chest computed tomography scans containing lung nodules was collected with their pathologic diagnosis from several sources. The dataset was split randomly into training (70%), internal validation (15%), and independent test sets (15%) at the patient level. Two machine learning algorithms were developed, trained, and validated. The first algorithm used the support vector machine model, and the second used deep learning technology: a convolutional neural network. Receiver operating characteristic analysis was used to evaluate the performance of the classification on the test dataset.
The support vector machine/convolutional neural network-based models classified nodules into 6 categories resulting in an area under the curve of 0.59/0.65 when differentiating atypical adenomatous hyperplasia versus adenocarcinoma in situ, 0.87/0.86 with minimally invasive adenocarcinoma versus invasive adenocarcinoma, 0.76/0.72 atypical adenomatous hyperplasia + adenocarcinoma in situ versus minimally invasive adenocarcinoma, 0.89/0.87 atypical adenomatous hyperplasia + adenocarcinoma in situ versus minimally invasive adenocarcinoma + invasive adenocarcinoma, and 0.93/0.92 atypical adenomatous hyperplasia + adenocarcinoma in situ + minimally invasive adenocarcinoma versus invasive adenocarcinoma. Classifying benign versus atypical adenomatous hyperplasia + adenocarcinoma in situ + minimally invasive adenocarcinoma versus invasive adenocarcinoma resulted in a micro-average area under the curve of 0.93/0.94 for the support vector machine/convolutional neural network models, respectively. The convolutional neural network-based methods had higher sensitivities than the support vector machine-based methods but lower specificities and accuracies.
The machine learning algorithms demonstrated reasonable performance in differentiating benign versus preinvasive versus invasive adenocarcinoma from computed tomography images alone. However, the prediction accuracy varies across its subtypes. This holds the potential for improved diagnostic capabilities with less-invasive means.
本研究旨在探讨机器学习算法是否可以仅通过 CT 图像预测肺结节的良恶性、腺癌或其癌前病变亚型。
从多个来源收集包含肺结节的胸部 CT 扫描数据集,并记录其病理诊断。数据集在患者水平上随机分为训练集(70%)、内部验证集(15%)和独立测试集(15%)。开发并验证了两种机器学习算法。第一种算法使用支持向量机模型,第二种算法使用深度学习技术:卷积神经网络。使用受试者工作特征分析评估分类器在测试数据集上的性能。
支持向量机/卷积神经网络模型将结节分为 6 类,在区分非典型腺瘤样增生与原位腺癌、微浸润性腺癌与浸润性腺癌、非典型腺瘤样增生+原位腺癌与微浸润性腺癌、非典型腺瘤样增生+原位腺癌与微浸润性腺癌+浸润性腺癌、非典型腺瘤样增生+原位腺癌+微浸润性腺癌与浸润性腺癌方面,曲线下面积分别为 0.59/0.65、0.87/0.86、0.76/0.72、0.89/0.87 和 0.93/0.92。将良性与非典型腺瘤样增生+原位腺癌+微浸润性腺癌+浸润性腺癌进行分类,支持向量机/卷积神经网络模型的平均曲线下面积分别为 0.93/0.94。基于卷积神经网络的方法比基于支持向量机的方法具有更高的敏感性,但特异性和准确性较低。
机器学习算法在单独使用 CT 图像区分良性、癌前病变与浸润性腺癌方面表现出了合理的性能。然而,其在预测各亚型方面的准确性存在差异。这为利用更微创的手段提高诊断能力提供了潜力。