Department of Radiology, Perelman School of Medicine and Hospital of the University of Pennsylvania, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA, 19104, USA.
Sci Rep. 2023 Feb 4;13(1):2040. doi: 10.1038/s41598-023-29058-1.
High-throughput extraction of radiomic features from low-dose CT scans can characterize the heterogeneity of the lung parenchyma and potentially aid in identifying subpopulations that may have higher risk of lung diseases, such as COPD, and lung cancer due to inflammation or obstruction of the airways. We aim to determine the feasibility of a lung radiomics phenotyping approach in a lung cancer screening cohort, while quantifying the effect of different CT reconstruction algorithms on phenotype robustness. We identified low-dose CT scans (n = 308) acquired with Siemens Healthineers scanners from patients who completed low-dose CT within our lung cancer screening program between 2015 and 2018 and had two different sets of image reconstructions kernel available (i.e., medium (I30f.), sharp (I50f.)) for the same acquisition. Following segmentation of the lung field, a total of 26 radiomic features were extracted from the entire 3D lung-field using a previously validated fully-automated lattice-based software pipeline, adapted for low-dose CT scans. The lattice in-house software was used to extract features including gray-level histogram, co-occurrence, and run-length descriptors. The lattice approach uses non-overlapping windows for traversing along pixels of images and calculates different features. Each feature was averaged for each scan within a range of lattice window sizes (W) of 4, 8 and 20 mm. The extracted imaging features from both datasets were harmonized to correct for differences in image acquisition parameters. Subsequently, unsupervised hierarchical clustering was applied on the extracted features to identify distinct phenotypic patterns of the lung parenchyma, where consensus clustering was used to identify the optimal number of clusters (K = 2). Differences between phenotypes for demographic and clinical covariates including sex, age, BMI, pack-years of smoking, Lung-RADS and cancer diagnosis were assessed for each phenotype cluster, and then compared across clusters for the two different CT reconstruction algorithms using the cluster entanglement metric, where a lower entanglement coefficient corresponds to good cluster alignment. Furthermore, an independent set of low-dose CT scans (n = 88) from patients with available pulmonary function data on lung obstruction were analyzed using the identified optimal clusters to assess associations to lung obstruction and validate the lung phenotyping paradigm. Heatmaps generated by radiomic features identified two distinct lung parenchymal phenotype patterns across different feature extraction window sizes, for both reconstruction algorithms (P < 0.05 with K = 2). Associations of radiomic-based clusters with clinical covariates showed significant differences for BMI and pack-years of smoking (P < 0.05) for both reconstruction kernels. Radiomic phenotype patterns were more similar across the two reconstructed kernels, when smaller window sizes (W = 4 and 8 mm) were used for radiomic feature extraction, as deemed by their entanglement coefficient. Validation of clustering approaches using cluster mapping for the independent sample with lung obstruction also showed two statistically significant phenotypes (P < 0.05) with significant difference for BMI and smoking pack-years. Radiomic analysis can be used to characterize lung parenchymal phenotypes from low-dose CT scans, which appear reproducible for different reconstruction kernels. Further work should seek to evaluate the effect of additional CT acquisition parameters and validate these phenotypes in characterizing lung cancer screening populations, to potentially better stratify disease patterns and cancer risk.
从低剂量 CT 扫描中高通量提取放射组学特征可以描述肺实质的异质性,并有可能有助于识别由于气道炎症或阻塞而可能患 COPD 和肺癌等肺部疾病风险更高的亚群。我们旨在确定肺癌筛查队列中肺放射组学表型方法的可行性,同时量化不同 CT 重建算法对表型稳健性的影响。我们从 2015 年至 2018 年期间在我们的肺癌筛查计划中完成低剂量 CT 检查的患者中确定了低剂量 CT 扫描(n=308),这些患者具有两种不同的可用图像重建内核(即中等(I30f.),锐利(I50f.))相同采集。在对肺野进行分割后,使用先前经过验证的完全自动化的基于晶格的软件管道从整个 3D 肺野中提取了总共 26 个放射组学特征,该软件管道适用于低剂量 CT 扫描。使用内部晶格软件提取灰度直方图、共生和游程长度描述符等特征。晶格方法使用非重叠窗口沿图像的像素进行遍历,并计算不同的特征。对于晶格窗口大小(W)为 4、8 和 20 毫米的范围内的每个扫描,将每个特征进行平均。从两个数据集提取的成像特征被协调以校正图像采集参数的差异。随后,对提取的特征应用无监督层次聚类,以识别肺实质的不同表型模式,其中共识聚类用于识别最佳聚类数量(K=2)。然后,使用聚类纠缠度量评估每个表型簇的性别、年龄、BMI、吸烟包年数、Lung-RADS 和癌症诊断等人口统计学和临床协变量之间的表型差异,并使用聚类纠缠度量在两个不同的 CT 重建算法之间比较表型簇之间的差异,其中较低的纠缠系数对应于良好的聚类对齐。此外,使用具有可用肺阻塞功能数据的低剂量 CT 扫描(n=88)的独立数据集,使用识别出的最佳聚类来评估与肺阻塞的关联,并验证肺表型范例。不同特征提取窗口大小下的两种重建算法(K=2 时 P<0.05)生成的热图确定了两种不同的肺实质表型模式。基于放射组学的聚类与临床协变量的关联显示出 BMI 和吸烟包年数的显著差异(P<0.05)。当使用较小的窗口大小(W=4 和 8 毫米)进行放射组学特征提取时,放射组学表型模式在两个重建内核之间更相似,这是通过它们的纠缠系数来判断的。使用具有肺阻塞的独立样本的聚类映射进行聚类方法的验证也显示了两种具有统计学意义的表型(P<0.05),BMI 和吸烟包年数存在显著差异。放射组学分析可用于从低剂量 CT 扫描中描述肺实质表型,对于不同的重建内核似乎具有可重复性。进一步的工作应寻求评估其他 CT 采集参数的影响,并在表征肺癌筛查人群中验证这些表型,以潜在地更好地分层疾病模式和癌症风险。