School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China; Key Laboratory of Image and Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan, 750021, China.
School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China; Key Laboratory of Image and Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan, 750021, China.
Comput Biol Med. 2024 May;173:108291. doi: 10.1016/j.compbiomed.2024.108291. Epub 2024 Mar 20.
It is very important to detect mandibular fracture region. However, the size of mandibular fracture region is different due to different anatomical positions, different sites and different degrees of force. It is difficult to locate and recognize fracture region accurately.
To solve these problems, MYOLOv5 model is proposed in this paper. Three feature enhancement strategies are designed, which improve the ability of model to locate and recognize mandibular fracture region. Firstly, Global-Local Feature Extraction Module (GLFEM) is designed. By effectively combining Convolutional Neural Network (CNN) and Transformer, the problem of insufficient global information extraction ability of CNN is complemented, and the positioning ability of the model to the fracture region is improved. Secondly, in order to improve the interaction ability of context information, Deep-Shallow Feature Interaction Module (DSFIM) is designed. In this module, the spatial information in the shallow feature layer is embedded to the deep feature layer by the spatial attention mechanism, and the semantic information in the deep feature layer is embedded to the shallow feature layer by the channel attention mechanism. The fracture region recognition ability of the model is improved. Finally, Multi-scale Multi receptive-field Feature Mixing Module (MMFMM) is designed. Deep separate convolution chains are used in this modal, which is composed by multiple layers of different scales and different dilation coefficients. This method provides richer receptive field for the model, and the ability to detect fracture region of different scales is improved.
The precision rate, mAP value, recall rate and F1 value of MYOLOv5 model on mandibular fracture CT data set are 97.18%, 96.86%, 94.42% and 95.58% respectively. The experimental results show that there is better performance about MYOLOv5 model than the mainstream detection models.
The MYOLOv5 model can effectively recognize and locate the mandibular fracture region, which is of great significance for doctors' clinical diagnosis.
准确检测下颌骨骨折区域非常重要。然而,由于解剖位置、骨折部位和受力程度的不同,下颌骨骨折区域的大小也不同,难以准确定位和识别骨折区域。
为了解决这些问题,本文提出了 MYOLOv5 模型。设计了三种特征增强策略,提高了模型对下颌骨骨折区域的定位和识别能力。首先,设计了全局-局部特征提取模块(GLFEM)。通过有效结合卷积神经网络(CNN)和 Transformer,弥补了 CNN 全局信息提取能力不足的问题,提高了模型对骨折区域的定位能力。其次,为了提高上下文信息的交互能力,设计了深度-浅层特征交互模块(DSFIM)。在该模块中,通过空间注意力机制将浅层特征层中的空间信息嵌入到深层特征层中,通过通道注意力机制将深层特征层中的语义信息嵌入到浅层特征层中,提高了模型对骨折区域的识别能力。最后,设计了多尺度多感受野特征混合模块(MMFMM)。该模型使用了多个深度分离卷积链,由多个不同尺度和不同扩张系数的层组成。这种方法为模型提供了更丰富的感受野,提高了检测不同尺度骨折区域的能力。
在下颌骨骨折 CT 数据集上,MYOLOv5 模型的精度、mAP 值、召回率和 F1 值分别为 97.18%、96.86%、94.42%和 95.58%。实验结果表明,与主流检测模型相比,MYOLOv5 模型具有更好的性能。
MYOLOv5 模型能够有效识别和定位下颌骨骨折区域,对医生的临床诊断具有重要意义。