Computer and Information Engineering Department, Tianjin Chengjian University, Tianjin, China.
J Appl Clin Med Phys. 2024 Nov;25(11):e14527. doi: 10.1002/acm2.14527. Epub 2024 Sep 16.
Accurate segmentation of brain tumors from multimodal magnetic resonance imaging (MRI) holds significant importance in clinical diagnosis and surgical intervention, while current deep learning methods cope with situations of multimodal MRI by an early fusion strategy that implicitly assumes that the modal relationships are linear, which tends to ignore the complementary information between modalities, negatively impacting the model's performance. Meanwhile, long-range relationships between voxels cannot be captured due to the localized character of the convolution procedure.
Aiming at this problem, we propose a multimodal segmentation network based on a late fusion strategy that employs multiple encoders and a decoder for the segmentation of brain tumors. Each encoder is specialized for processing distinct modalities. Notably, our framework includes a feature fusion module based on a 3D discrete wavelet transform aimed at extracting complementary features among the encoders. Additionally, a 3D global context-aware module was introduced to capture the long-range dependencies of tumor voxels at a high level of features. The decoder combines fused and global features to enhance the network's segmentation performance.
Our proposed model is experimented on the publicly available BraTS2018 and BraTS2021 datasets. The experimental results show competitiveness with state-of-the-art methods.
The results demonstrate that our approach applies a novel concept for multimodal fusion within deep neural networks and delivers more accurate and promising brain tumor segmentation, with the potential to assist physicians in diagnosis.
从多模态磁共振成像(MRI)中准确分割脑肿瘤在临床诊断和手术干预中具有重要意义,而当前的深度学习方法通过早期融合策略来应对多模态 MRI 情况,该策略隐含地假设模态关系是线性的,这往往忽略了模态之间的互补信息,从而影响模型的性能。同时,由于卷积过程的局部性质,无法捕捉体素之间的长程关系。
针对这一问题,我们提出了一种基于晚期融合策略的多模态分割网络,该网络使用多个编码器和解码器对脑肿瘤进行分割。每个编码器专门用于处理不同的模态。值得注意的是,我们的框架包括一个基于 3D 离散小波变换的特征融合模块,旨在从编码器中提取互补特征。此外,引入了一个 3D 全局上下文感知模块,以在高级特征中捕获肿瘤体素的长程依赖性。解码器将融合和全局特征相结合,以提高网络的分割性能。
我们的模型在公开的 BraTS2018 和 BraTS2021 数据集上进行了实验。实验结果表明,该模型与最先进的方法具有竞争力。
结果表明,我们的方法在深度神经网络中应用了一种新的多模态融合概念,实现了更准确和有前途的脑肿瘤分割,有望辅助医生进行诊断。