Chen Meixu, Wang Kai, Kapur Payal, Brugarolas James, Hannan Raquibul, Wang Jing
Medical Artificial Intelligence and Automation (MAIA) Lab, Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, TX, 75235, USA.
Department of Radiation Oncology, University of Maryland Medical Center, Baltimore, MD 21201, USA.
ArXiv. 2024 Dec 10:arXiv:2412.07136v1.
A reliable and comprehensive cancer prognosis model for clear cell renal cell carcinoma (ccRCC) could better assist in personalizing treatment. In this work, we developed a multi-modal ensemble model (MMEM) which integrates pretreatment clinical information, multi-omics data, and histopathology whole slide image (WSI) data to learn complementary information to predict overall survival (OS) and disease-free survival (DFS) for patients with ccRCC.
We collected 226 patients from The Cancer Genome Atlas Kidney Renal Clear Cell Carcinoma dataset (TCGA-KIRC). These patients have OS and DFS follow up data available and five different data modalities provided, including clinical information, pathology data in the form of WSI, and three multi-omics data, which comprise mRNA expression, miRNA expression (miRSeq), and DNA methylation data. Five sets of separate survival prediction models were constructed separately for OS and DFS. We used a traditional Cox-proportional hazards (CPH) model with iterative forward feature selection for clinical and multi-omics data. Four different types of pre-trained encoder models, comprising ResNet and three recently developed general purpose foundation models for computational pathology, were utilized to extract features from processed WSI patches. A deep learning-based CPH model was constructed to predict survival outcomes using these encoded WSI features. For each of the survival outcomes of interest, we weigh and combine the predicted risk scores from all the five models to generate the final prediction. Model weighting was based on the training performance. Five-fold cross validation was performed to train and test the proposed workflow.
We employed the concordance index (C-index) and area under the receiver operating characteristic curve (AUROC) metrics to assess the performance of our models for time-to-event prediction and time-specific binary prediction, respectively. Among the sub-models, the clinical feature based CPH model has the highest weight for both prediction tasks. For WSI-based prediction, the encoded feature using an image-based general purpose foundation model (UNI) showed the best prediction performance over other pretrained feature encoders. Our final model outperformed corresponding single-modality models on all prediction labels, achieving C-indices of 0.820 and 0.833 for OS and DFS, respectively. The AUROC values for binary prediction at follow-up of 3 year were 0.831 and 0.862 for patient death and cancer recurrence, respectively. Using the medians of predicted risks as thresholds to identify high-risk and low-risk patient groups, we performed log-rank tests, which revealed improved performance in both OS and DFS compared to single-modality models.
We developed the first multi-modal prediction model MMEM for ccRCC patients that integrates features across five different data modalities. Our model demonstrated better prognostic ability compared with corresponding single-modality models for both prediction targets. If findings are independently reproduced, it has the potential to assist in management of ccRCC patients.
一个可靠且全面的透明细胞肾细胞癌(ccRCC)癌症预后模型能够更好地辅助个性化治疗。在本研究中,我们开发了一种多模态集成模型(MMEM),该模型整合了治疗前临床信息、多组学数据和组织病理学全切片图像(WSI)数据,以学习互补信息来预测ccRCC患者的总生存期(OS)和无病生存期(DFS)。
我们从癌症基因组图谱肾透明细胞癌数据集(TCGA-KIRC)中收集了226例患者。这些患者有可用的OS和DFS随访数据,并提供了五种不同的数据模态,包括临床信息、WSI形式的病理数据以及三种多组学数据,即mRNA表达、miRNA表达(miRSeq)和DNA甲基化数据。针对OS和DFS分别构建了五组独立的生存预测模型。对于临床和多组学数据,我们使用带有迭代向前特征选择的传统Cox比例风险(CPH)模型。利用四种不同类型的预训练编码器模型,包括ResNet和最近开发的三种用于计算病理学的通用基础模型,从处理后的WSI切片中提取特征。构建了一个基于深度学习的CPH模型,使用这些编码的WSI特征来预测生存结果。对于每个感兴趣的生存结果,我们对所有五个模型预测的风险分数进行加权和组合,以生成最终预测。模型加权基于训练性能。进行了五折交叉验证以训练和测试所提出的工作流程。
我们分别采用一致性指数(C-index)和受试者操作特征曲线下面积(AUROC)指标来评估我们的模型在事件发生时间预测和特定时间二元预测方面的性能。在子模型中,基于临床特征的CPH模型在两个预测任务中权重最高。对于基于WSI的预测,使用基于图像的通用基础模型(UNI)编码的特征在其他预训练特征编码器中表现出最佳预测性能。我们的最终模型在所有预测标签上均优于相应的单模态模型,OS和DFS的C-index分别达到0.820和0.833。在3年随访时患者死亡和癌症复发的二元预测的AUROC值分别为0.831和0.862。使用预测风险的中位数作为阈值来识别高危和低危患者组,我们进行了对数秩检验,结果显示与单模态模型相比,OS和DFS的性能均有所提高。
我们为ccRCC患者开发了首个整合五种不同数据模态特征的多模态预测模型MMEM。与相应的单模态模型相比,我们的模型在两个预测目标上均表现出更好的预后能力。如果研究结果能够独立重现,则有可能辅助ccRCC患者的管理。