Zhou Haoyang, Li Haojiang, Chen Shuchao, Yang Shixin, Ruan Guangying, Liu Lizhi, Chen Hongbo
School of Life & Environmental Science, Guangxi Colleges and Universities Key Laboratory of Biomedical Sensors and Intelligent Instruments, Guilin University of Electronic Technology, Guilin, Guangxi, China.
School of Electronic Information and Automation, Guilin University of Aerospace Technology, Guilin, Guangxi, China.
Front Hum Neurosci. 2023 Jan 10;16:1068713. doi: 10.3389/fnhum.2022.1068713. eCollection 2022.
Automatically and accurately delineating the primary nasopharyngeal carcinoma (NPC) tumors in head magnetic resonance imaging (MRI) images is crucial for patient staging and radiotherapy. Inspired by the bilateral symmetry of head and complementary information of different modalities, a multi-modal neural network named BSMM-Net is proposed for NPC segmentation.
First, a bilaterally symmetrical patch block (BSP) is used to crop the image and the bilaterally flipped image into patches. BSP can improve the precision of locating NPC lesions and is a simulation of radiologist locating the tumors with the bilateral difference of head in clinical practice. Second, modality-specific and multi-modal fusion features (MSMFFs) are extracted by the proposed MSMFF encoder to fully utilize the complementary information of T1- and T2-weighted MRI. The MSMFFs are then fed into the base decoder to aggregate representative features and precisely delineate the NPC. MSMFF is the output of MSMFF encoder blocks, which consist of six modality-specific networks and one multi-modal fusion network. Except T1 and T2, the other four modalities are generated from T1 and T2 by the BSP and DT modal generate block. Third, the MSMFF decoder with similar structure to the MSMFF encoder is deployed to supervise the encoder during training and assure the validity of the MSMFF from the encoder. Finally, experiments are conducted on the dataset of 7633 samples collected from 745 patients.
The global DICE, precision, recall and IoU of the testing set are 0.82, 0.82, 0.86, and 0.72, respectively. The results show that the proposed model is better than the other state-of-the-art methods for NPC segmentation. In clinical diagnosis, the BSMM-Net can give precise delineation of NPC, which can be used to schedule the radiotherapy.
在头部磁共振成像(MRI)图像中自动准确地勾勒出原发性鼻咽癌(NPC)肿瘤对于患者分期和放射治疗至关重要。受头部双侧对称性和不同模态互补信息的启发,提出了一种名为BSMM-Net的多模态神经网络用于NPC分割。
首先,使用双侧对称补丁块(BSP)将图像和双侧翻转图像裁剪成补丁。BSP可以提高定位NPC病变的精度,是模拟放射科医生在临床实践中利用头部双侧差异定位肿瘤的方法。其次,通过提出的MSMFF编码器提取特定模态和多模态融合特征(MSMFFs),以充分利用T1加权和T2加权MRI的互补信息。然后将MSMFFs输入到基础解码器中,以聚合代表性特征并精确勾勒出NPC。MSMFF是MSMFF编码器块的输出,该编码器块由六个特定模态网络和一个多模态融合网络组成。除了T1和T2之外,其他四个模态由BSP和DT模态生成块从T1和T2生成。第三,部署结构与MSMFF编码器相似的MSMFF解码器,在训练期间监督编码器,并确保来自编码器的MSMFF的有效性。最后,在从745名患者收集的7633个样本的数据集上进行实验。
测试集的全局DICE、精度、召回率和IoU分别为0.82、0.82、0.86和0.72。结果表明,所提出的模型在NPC分割方面优于其他现有方法。在临床诊断中,BSMM-Net可以对NPC进行精确勾勒,可用于安排放射治疗。