Bhattacharya Indrani, Stacke Karin, Chan Emily, Lee Jeong Hoon, Tse Justin R, Liang Tie, Brooks James D, Sonn Geoffrey A, Rusu Mirabela
Department of Radiology, Stanford University, Stanford, California, USA.
Sectra, Linköping, Sweden.
Med Phys. 2025 Jan;52(1):300-320. doi: 10.1002/mp.17476. Epub 2024 Oct 24.
Renal cell carcinoma (RCC) is a common cancer that varies in clinical behavior. Clear cell RCC (ccRCC) is the most common RCC subtype, with both aggressive and indolent manifestations. Indolent ccRCC is often low-grade without necrosis and can be monitored without treatment. Aggressive ccRCC is often high-grade and can cause metastasis and death if not promptly detected and treated. While most RCCs are detected on computed tomography (CT) scans, aggressiveness classification is based on pathology images acquired from invasive biopsy or surgery.
CT imaging-based aggressiveness classification would be an important clinical advance, as it would facilitate non-invasive risk stratification and treatment planning. Here, we present a novel machine learning method, Correlated Feature Aggregation By Region (CorrFABR), for CT-based aggressiveness classification of ccRCC.
CorrFABR is a multimodal fusion algorithm that learns from radiology and pathology images, and clinical variables in a clinically-relevant manner. CorrFABR leverages registration-independent radiology (CT) and pathology image correlations using features from vision transformer-based foundation models to facilitate aggressiveness assessment on CT images. CorrFABR consists of three main steps: (a) Feature aggregation where region-level features are extracted from radiology and pathology images at widely varying image resolutions, (b) Fusion where radiology features correlated with pathology features (pathology-informed CT biomarkers) are learned, and (c) Classification where the learned pathology-informed CT biomarkers, together with clinical variables of tumor diameter, gender, and age, are used to distinguish aggressive from indolent ccRCC using multi-layer perceptron-based classifiers. Pathology images are only required in the first two steps of CorrFABR, and are not required in the prediction module. Therefore, CorrFABR integrates information from CT images, pathology images, and clinical variables during training, but for inference, it relies solely on CT images and clinical variables, ensuring its clinical applicability. CorrFABR was trained with heterogenous, publicly-available data from 298 ccRCC tumors (136 indolent tumors, 162 aggressive tumors) in a five-fold cross-validation setup and evaluated on an independent test set of 74 tumors with a balanced distribution of aggressive and indolent tumors. Ablation studies were performed to test the utility of each component of CorrFABR.
CorrFABR outperformed the other classification methods, achieving an ROC-AUC (area under the curve) of 0.855 ± 0.0005 (95% confidence interval: 0.775, 0.947), F1-score of 0.793 ± 0.029, sensitivity of 0.741 ± 0.058, and specificity of 0.876 ± 0.032 in classifying ccRCC as aggressive or indolent subtypes. It was found that pathology-informed CT biomarkers learned through registration-independent correlation learning improves classification performance over using CT features alone, irrespective of the kind of features or the classification model used. Tumor diameter, gender, and age provide complementary clinical information, and integrating pathology-informed CT biomarkers with these clinical variables further improves performance.
CorrFABR provides a novel method for CT-based aggressiveness classification of ccRCC by enabling the identification of pathology-informed CT biomarkers, and integrating them with clinical variables. CorrFABR enables learning of these pathology-informed CT biomarkers through a novel registration-independent correlation learning module that considers unaligned radiology and pathology images at widely varying image resolutions.
肾细胞癌(RCC)是一种临床行为各异的常见癌症。透明细胞肾细胞癌(ccRCC)是最常见的RCC亚型,具有侵袭性和惰性两种表现形式。惰性ccRCC通常为低级别,无坏死,可在不进行治疗的情况下进行监测。侵袭性ccRCC通常为高级别,如果不及时检测和治疗,可能会导致转移和死亡。虽然大多数RCC是在计算机断层扫描(CT)上检测到的,但侵袭性分类是基于从侵入性活检或手术获取的病理图像。
基于CT成像的侵袭性分类将是一项重要的临床进展,因为它将有助于进行非侵入性风险分层和治疗规划。在此,我们提出一种新的机器学习方法,即基于区域的相关特征聚合(CorrFABR),用于基于CT的ccRCC侵袭性分类。
CorrFABR是一种多模态融合算法,以临床相关的方式从放射学和病理图像以及临床变量中学习。CorrFABR利用基于视觉Transformer的基础模型的特征,利用与配准无关的放射学(CT)和病理图像相关性,以促进对CT图像的侵袭性评估。CorrFABR包括三个主要步骤:(a)特征聚合,从具有广泛不同图像分辨率的放射学和病理图像中提取区域级特征;(b)融合,学习与病理特征相关的放射学特征(病理信息CT生物标志物);(c)分类,使用基于多层感知器的分类器,将学习到的病理信息CT生物标志物与肿瘤直径、性别和年龄等临床变量一起用于区分侵袭性和惰性ccRCC。病理图像仅在CorrFABR的前两个步骤中需要,在预测模块中不需要。因此,CorrFABR在训练期间整合了来自CT图像、病理图像和临床变量的信息,但在推理时,它仅依赖于CT图像和临床变量,确保了其临床适用性。CorrFABR在五折交叉验证设置中使用来自298个ccRCC肿瘤(136个惰性肿瘤,162个侵袭性肿瘤)的异质公开可用数据进行训练,并在一个包含74个肿瘤的独立测试集上进行评估,该测试集侵袭性和惰性肿瘤分布均衡。进行了消融研究以测试CorrFABR每个组件的效用。
在将ccRCC分类为侵袭性或惰性亚型方面,CorrFABR优于其他分类方法,实现了0.855±0.0005(95%置信区间:0.7