Department of Radiology, The General Hospital of Central Theater Command, PLA, Wuhan, 430070, Hubei, China.
School of Computer Science and Information Engineering, Hubei University, Wuhan, 430062, Hubei, China.
Sci Rep. 2023 Apr 11;13(1):5853. doi: 10.1038/s41598-023-32979-6.
To study the classification efficiency of using texture feature machine learning method in distinguishing solid lung adenocarcinoma (SADC) and tuberculous granulomatous nodules (TGN) that appear as solid nodules (SN) in non-enhanced CT images. 200 patients with SADC and TGN who underwent thoracic non-enhanced CT examination from January 2012 to October 2019 were included in the study, 490 texture eigenvalues of 6 categories were extracted from the lesions in the non-enhanced CT images of these patients for machine learning, the classification prediction model is established by using relatively the best classifier selected according to the fitting degree of learning curve in the process of machine learning, and the effectiveness of the model was tested and verified. The logistic regression model of clinical data (including demographic data and CT parameters and CT signs of solitary nodules) was used for comparison. The prediction model of clinical data was established by logistic regression, and the classifier was established by machine learning of radiologic texture features. The area under the curve was 0.82 and 0.65 for the prediction model based on clinical CT and only CT parameters and CT signs, and 0.870 based on Radiomics characteristics. The machine learning prediction model developed by us can improve the differentiation efficiency of SADC and TGN with SN, and provide appropriate support for treatment decisions.
研究纹理特征机器学习方法在区分非增强 CT 图像中表现为实性结节 (SN) 的实性肺腺癌 (SADC) 和结核性肉芽肿性结节 (TGN) 中的分类效率。纳入 2012 年 1 月至 2019 年 10 月间经胸部非增强 CT 检查的 200 例 SADC 和 TGN 患者,从这些患者的非增强 CT 图像中提取病变的 6 类 490 个纹理特征值,用于机器学习,通过使用根据学习曲线拟合度在机器学习过程中选择的相对最佳分类器,建立分类预测模型,并进行模型有效性的测试和验证。比较临床数据(包括人口统计学数据和 CT 参数及 CT 征象)的逻辑回归模型。通过逻辑回归建立临床数据的预测模型,通过放射组学特征的机器学习建立分类器。基于临床 CT 及仅 CT 参数和 CT 征象的预测模型的曲线下面积分别为 0.82 和 0.65,基于放射组学特征的为 0.870。我们建立的机器学习预测模型可以提高 SADC 和 TGN 伴 SN 的鉴别效率,为治疗决策提供适当的支持。