School of Medical Imaging, Weifang Medical University.
Department of Radiology, Changzheng Hospital, Naval Medical University, Shanghai.
J Thorac Imaging. 2023 Sep 1;38(5):304-314. doi: 10.1097/RTI.0000000000000725. Epub 2023 Jul 10.
Reliable prediction of volume doubling time (VDT) is essential for the personalized management of pulmonary ground-glass nodules (GGNs). We aimed to determine the optimal VDT prediction method by comparing different machine learning methods only based on the baseline chest computed tomography (CT) images.
Seven classical machine learning methods were evaluated in terms of their stability and performance for VDT prediction. The VDT, calculated by the preoperative and baseline CT, was divided into 2 groups with a cutoff value of 400 days. A total of 90 GGNs from 3 hospitals constituted the training set, and 86 GGNs from the fourth hospital served as the external validation set. The training set was used for feature selection and model training, and the validation set was used to evaluate the predictive performance of the model independently.
The eXtreme Gradient Boosting showed the highest predictive performance (accuracy: 0.890±0.128 and area under the ROC curve (AUC): 0.896±0.134), followed by the neural network (NNet) (accuracy: 0.865±0.103 and AUC: 0.886±0.097). While regarding stability, the NNet showed the highest robustness against data perturbation (relative SDs [%] of mean AUC: 10.9%). Therefore, the NNet was chosen as the final model, achieving high accuracy of 0.756 in the external validation set.
The NNet is a promising machine learning method to predict the VDT of GGNs, which would assist in the personalized follow-up and treatment strategies for GGNs reducing unnecessary follow-up and radiation dose.
可靠的体积倍增时间(VDT)预测对于肺部磨玻璃结节(GGN)的个体化管理至关重要。我们旨在通过仅基于基线胸部计算机断层扫描(CT)图像比较不同的机器学习方法来确定最佳的 VDT 预测方法。
评估了 7 种经典机器学习方法在 VDT 预测中的稳定性和性能。通过术前和基线 CT 计算的 VDT,以 400 天的截止值分为 2 组。来自 3 家医院的 90 个 GGN 构成训练集,来自第 4 家医院的 86 个 GGN 作为外部验证集。训练集用于特征选择和模型训练,验证集用于独立评估模型的预测性能。
极端梯度提升(eXtreme Gradient Boosting)显示出最高的预测性能(准确性:0.890±0.128 和 ROC 曲线下面积(AUC):0.896±0.134),其次是神经网络(NNet)(准确性:0.865±0.103 和 AUC:0.886±0.097)。然而,在稳定性方面,NNet 对数据扰动显示出最高的稳健性(平均 AUC 的相对标准偏差 [%]:10.9%)。因此,选择 NNet 作为最终模型,在外部验证集中实现了 0.756 的高精度。
NNet 是一种很有前途的预测 GGN VDT 的机器学习方法,它将有助于对 GGN 进行个性化随访和治疗策略,减少不必要的随访和辐射剂量。