• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

良性和恶性肺结节的放射组学特征标准化技术评估

Evaluation of radiomic feature harmonization techniques for benign and malignant pulmonary nodules.

作者信息

Huchthausen Claire, Shi Menglin, de Sousa Gabriel L A, Colen Jonathan, Shelley Emery, Larner James, Janowski Einsley, Wijesooriya Krishni

机构信息

Department of Physics, University of Virginia.

Department of Biomedical Engineering, Northwestern University.

出版信息

ArXiv. 2025 Jan 15:arXiv:2412.16758v2.

PMID:39876938
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11774441/
Abstract

BACKGROUND

Conventional methods for detecting lung cancer early are often qualitative and subject to interpretation. Radiomics provides quantitative characteristics of pulmonary nodules (PNs) in medical images, but variability in medical image acquisition is an obstacle to consistent clinical application of these quantitative features. Correcting radiomic features' dependency on acquisition parameters is problematic when combining data from benign and malignant PNs, as is necessary when the goal is to diagnose lung cancer, because acquisition effects may differ between them due to their biological differences.

PURPOSE

We evaluated whether we must account for biological differences between benign and malignant PNs when correcting the dependency of radiomic features on acquisition parameters, and we compared methods of doing this using ComBat harmonization.

METHODS

This study used a dataset of 567 clinical chest CT scans containing both malignant and benign PNs. Scans were grouped as benign, malignant, or lung cancer screening (mixed benign and malignant). Preprocessing and feature extraction from ROIs were performed using PyRadiomics. Optimized Permutation Nested ComBat harmonization was performed on extracted features to account for variability in four imaging protocols: contrast enhancement, scanner manufacturer, acquisition voltage, focal spot size. Three methods were compared: harmonizing all data collectively in the standard manner, harmonizing all data with a covariate to preserve distinctions between subgroups, and harmonizing subgroups separately. A significant ( ≤ 0.05) Kruskal-Wallis test determined whether harmonization removed a feature's dependency on an acquisition parameter. A LASSO-SVM pipeline was trained using acquisition-independent radiomic features to predict whether PNs were malignant or benign. To evaluate the predictive information made available by each harmonization method, the trained harmonization estimators and predictive model were applied to a corresponding unseen test set. Harmonization and predictive performance metrics were assessed over 10 trials of 5-fold cross validation.

RESULTS

Kruskal-Wallis defined an average 2.1% of features (95% CI: 1.9-2.4%) as acquisition-independent when data were harmonized collectively, 27.3% of features (95% CI: 25.7-28.9%) as acquisition-independent when harmonized with a covariate, and 90.9% of features (95% CI: 90.4-91.5%) as acquisition-independent when harmonized separately. LASSO-SVM models trained on data harmonized separately or with a covariate had higher ROC-AUC for lung cancer screening scans than models trained on data harmonized without distinction between benign and malignant tissues (Delong test, Holm-Bonferroni adjusted ≤ 0.05). There was not a conclusive difference in ROC-AUC between models trained on data harmonized separately and models trained on data harmonized with a covariate.

CONCLUSIONS

Radiomic features of benign and malignant PNs require different corrective transformations to recover acquisition-independent distributions. This can be done using separate harmonization or harmonization with a covariate. Separate harmonization enabled the greatest number of predictive features to be used in a machine learning model to retrospectively detect lung cancer. Features harmonized separately and features harmonized with a covariate enabled predictive models to achieve similar performance on lung cancer screening scans.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/650c/11774441/0ab48de4d02a/nihpp-2412.16758v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/650c/11774441/ff43548ee54d/nihpp-2412.16758v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/650c/11774441/6f3ccd8269e5/nihpp-2412.16758v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/650c/11774441/0ab48de4d02a/nihpp-2412.16758v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/650c/11774441/ff43548ee54d/nihpp-2412.16758v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/650c/11774441/6f3ccd8269e5/nihpp-2412.16758v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/650c/11774441/0ab48de4d02a/nihpp-2412.16758v2-f0003.jpg
摘要

背景

传统的早期肺癌检测方法通常是定性的,且易受主观解读影响。放射组学可提供医学图像中肺结节(PNs)的定量特征,但医学图像采集的变异性是这些定量特征在临床中一致应用的障碍。当结合来自良性和恶性PNs的数据时,校正放射组学特征对采集参数的依赖性存在问题,而在旨在诊断肺癌时这是必要的,因为由于它们的生物学差异,它们之间的采集效应可能不同。

目的

我们评估了在校正放射组学特征对采集参数的依赖性时是否必须考虑良性和恶性PNs之间的生物学差异,并使用ComBat归一化比较了实现此目的的方法。

方法

本研究使用了包含恶性和良性PNs的567例临床胸部CT扫描数据集。扫描被分为良性、恶性或肺癌筛查(良性和恶性混合)。使用PyRadiomics对感兴趣区域进行预处理和特征提取。对提取的特征进行优化排列嵌套ComBat归一化,以考虑四种成像协议中的变异性:对比增强、扫描仪制造商、采集电压、焦点尺寸。比较了三种方法:以标准方式统一所有数据、使用协变量统一所有数据以保留亚组之间的差异、分别统一亚组。显著(≤0.05)的Kruskal-Wallis检验确定归一化是否消除了特征对采集参数的依赖性。使用与采集无关的放射组学特征训练LASSO-SVM管道,以预测PNs是恶性还是良性。为了评估每种归一化方法提供的预测信息,将训练好的归一化估计器和预测模型应用于相应的未见过的测试集。在10次5折交叉验证试验中评估归一化和预测性能指标。

结果

当数据统一处理时,Kruskal-Wallis将平均2.1%的特征(95%CI:1.9-2.4%)定义为与采集无关;当使用协变量统一时,27.3%的特征(95%CI:25.7-28.9%)为与采集无关;当分别统一时,90.9%的特征(95%CI:90.4-91.5%)为与采集无关。在肺癌筛查扫描中,基于分别统一或使用协变量统一的数据训练的LASSO-SVM模型比基于未区分良性和恶性组织统一的数据训练的模型具有更高的ROC-AUC(Delong检验,Holm-Bonferroni校正≤0.05)。基于分别统一的数据训练的模型和基于使用协变量统一的数据训练的模型在ROC-AUC上没有决定性差异。

结论

良性和恶性PNs的放射组学特征需要不同的校正变换以恢复与采集无关的分布。这可以通过分别归一化或使用协变量归一化来实现。分别归一化使得在机器学习模型中能够使用最多数量的预测特征来回顾性检测肺癌。分别归一化的特征和使用协变量归一化的特征使预测模型在肺癌筛查扫描中具有相似的性能。

相似文献

1
Evaluation of radiomic feature harmonization techniques for benign and malignant pulmonary nodules.良性和恶性肺结节的放射组学特征标准化技术评估
ArXiv. 2025 Jan 15:arXiv:2412.16758v2.
2
External validation of radiomics-based predictive models in low-dose CT screening for early lung cancer diagnosis.基于影像组学的预测模型在低剂量CT筛查早期肺癌诊断中的外部验证
Med Phys. 2020 Sep;47(9):4125-4136. doi: 10.1002/mp.14308. Epub 2020 Jun 23.
3
Differential Radiomics-Based Signature Predicts Lung Cancer Risk Accounting for Imaging Parameters in NLST Cohort.基于放射组学差异的签名可预测 NLST 队列中考虑成像参数的肺癌风险。
Cancer Med. 2024 Oct;13(20):e70359. doi: 10.1002/cam4.70359.
4
How can we combat multicenter variability in MR radiomics? Validation of a correction procedure.如何应对磁共振影像组学的多中心变异性?一种校正程序的验证。
Eur Radiol. 2021 Apr;31(4):2272-2280. doi: 10.1007/s00330-020-07284-9. Epub 2020 Sep 25.
5
Phenotyping the Histopathological Subtypes of Non-Small-Cell Lung Carcinoma: How Beneficial Is Radiomics?非小细胞肺癌组织病理学亚型的表型分析:影像组学有多大益处?
Diagnostics (Basel). 2023 Mar 18;13(6):1167. doi: 10.3390/diagnostics13061167.
6
Radiomics analysis of pulmonary nodules in low-dose CT for early detection of lung cancer.低剂量 CT 肺结节放射组学分析用于肺癌早期检测。
Med Phys. 2018 Apr;45(4):1537-1549. doi: 10.1002/mp.12820. Epub 2018 Mar 12.
7
Generalized ComBat harmonization methods for radiomic features with multi-modal distributions and multiple batch effects.具有多模态分布和多个批次效应的放射组学特征的广义 ComBat 协调方法。
Sci Rep. 2022 Mar 16;12(1):4493. doi: 10.1038/s41598-022-08412-9.
8
Preliminary report on harmonization of features extraction process using the ComBat tool in the multi-center "Blue Sky Radiomics" study on stage III unresectable NSCLC.在多中心“蓝天放射组学”研究中,使用ComBat工具对III期不可切除非小细胞肺癌特征提取过程进行标准化的初步报告。
Insights Imaging. 2022 Mar 7;13(1):38. doi: 10.1186/s13244-022-01171-1.
9
A transfer learning approach to facilitate ComBat-based harmonization of multicentre radiomic features in new datasets.一种迁移学习方法,用于促进基于 ComBat 的多中心放射组学特征在新数据集上的协调。
PLoS One. 2021 Jul 1;16(7):e0253653. doi: 10.1371/journal.pone.0253653. eCollection 2021.
10
The impact of the combat method on radiomics feature compensation and analysis of scanners from different manufacturers.战斗方法对不同制造商扫描仪的放射组学特征补偿和分析的影响。
BMC Med Imaging. 2024 Jun 6;24(1):137. doi: 10.1186/s12880-024-01306-4.

本文引用的文献

1
Reproducibility in Radiomics: A Comparison of Feature Extraction Methods and Two Independent Datasets.放射组学中的可重复性:特征提取方法与两个独立数据集的比较
Appl Sci (Basel). 2024 Feb 20;166(1). doi: 10.3390/app13127291.
2
Cancer statistics, 2023.癌症统计数据,2023 年。
CA Cancer J Clin. 2023 Jan;73(1):17-48. doi: 10.3322/caac.21763.
3
Improved generalized ComBat methods for harmonization of radiomic features.改进的广义 ComBat 方法用于协调放射组学特征。
Sci Rep. 2022 Nov 8;12(1):19009. doi: 10.1038/s41598-022-23328-0.
4
Development of a robust radiomic biomarker of progression-free survival in advanced non-small cell lung cancer patients treated with first-line immunotherapy.开发一种稳健的放射组学生存无进展标志物,用于一线免疫治疗治疗的晚期非小细胞肺癌患者。
Sci Rep. 2022 Jun 15;12(1):9993. doi: 10.1038/s41598-022-14160-7.
5
Radiomic Phenotypes for Improving Early Prediction of Survival in Stage III Non-Small Cell Lung Cancer Adenocarcinoma after Chemoradiation.用于改善 III 期非小细胞肺癌腺癌放化疗后生存早期预测的放射组学表型
Cancers (Basel). 2022 Jan 29;14(3):700. doi: 10.3390/cancers14030700.
6
A Guide to ComBat Harmonization of Imaging Biomarkers in Multicenter Studies.多中心研究中成像生物标志物的 ComBat 均衡化处理指南。
J Nucl Med. 2022 Feb;63(2):172-179. doi: 10.2967/jnumed.121.262464. Epub 2021 Sep 16.
7
Reproducibility of CT-based radiomic features against image resampling and perturbations for tumour and healthy kidney in renal cancer patients.基于 CT 的放射组学特征在肾癌患者的肿瘤和健康肾脏的图像重采样和干扰方面的可重复性。
Sci Rep. 2021 Jun 2;11(1):11542. doi: 10.1038/s41598-021-90985-y.
8
Exploring the variability of radiomic features of lung cancer lesions on unenhanced and contrast-enhanced chest CT imaging.探讨非增强和增强胸部 CT 成像中肺癌病变的放射组学特征的可变性。
Phys Med. 2021 Feb;82:321-331. doi: 10.1016/j.ejmp.2021.02.014. Epub 2021 Mar 12.
9
Radiographic Texture Reproducibility: The Impact of Different Materials, their Arrangement, and Focal Spot Size.射线照相纹理再现性:不同材料、其排列方式及焦点尺寸的影响
J Med Signals Sens. 2020 Nov 11;10(4):275-285. doi: 10.4103/jmss.JMSS_64_19. eCollection 2020 Oct-Dec.
10
Minimizing acquisition-related radiomics variability by image resampling and batch effect correction to allow for large-scale data analysis.通过图像重采样和批处理效应校正来最小化获取相关的放射组学变异性,从而实现大规模数据分析。
Eur Radiol. 2021 Mar;31(3):1460-1470. doi: 10.1007/s00330-020-07174-0. Epub 2020 Sep 9.