使用带注意力融合的掩码自动编码器和视觉变换器的多模态深度学习对头颈部癌症进行分期分类

Head and neck squamous cell carcinoma (HNSCC) is a prevalent and aggressive cancer, and accurate staging using the AJCC system is essential for treatment planning. This study aims to enhance AJCC staging by integrating both clinical and imaging data using a multimodal deep learning pipeline. We propose a framework that employs a VGG16-based masked autoencoder (MAE) for self-supervised visual feature learning, enhanced by attention mechanisms (CBAM and BAM), and fuses image and clinical features using an attention-weighted fusion network. The models, benchmarked on the HNSCC and HN1 datasets, achieved approximately 80% accuracy (four classes) and ~66% accuracy (five classes), with notable AUC improvements, especially under BAM. The integration of clinical features significantly enhances stage-classification performance, setting a precedent for robust multimodal pipelines in radiomics-based oncology applications.

头颈部鳞状细胞癌（HNSCC）是一种常见且侵袭性强的癌症，使用美国癌症联合委员会（AJCC）系统进行准确分期对于治疗规划至关重要。本研究旨在通过使用多模态深度学习管道整合临床和影像数据来改进AJCC分期。我们提出了一个框架，该框架采用基于VGG16的掩码自动编码器（MAE）进行自监督视觉特征学习，并通过注意力机制（CBAM和BAM）进行增强，同时使用注意力加权融合网络融合图像和临床特征。在HNSCC和HN1数据集上进行基准测试的模型，实现了约80%的准确率（四类）和约66%的准确率（五类），曲线下面积（AUC）有显著提高，尤其是在BAM下。临床特征的整合显著提高了分期分类性能，为基于放射组学的肿瘤学应用中强大的多模态管道树立了先例。

新学期，新优惠

Suppr 超能文献

新学期，新优惠

Suppr 超能文献

Multimodal Deep Learning for Stage Classification of Head and Neck Cancer Using Masked Autoencoders and Vision Transformers with Attention-Based Fusion.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

推荐工具