Zhong Y, Che W, Gao S
Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China.
Yunnan Key Laboratory of Computer Technology Applications, Kunming University of Science and Technology, Kunming 650500, China.
Nan Fang Yi Ke Da Xue Xue Bao. 2022 Nov 20;42(11):1662-1671. doi: 10.12122/j.issn.1673-4254.2022.11.10.
To propose a deep learning target detection model AM- YOLO that integrates coordinate attention and efficient attention mechanism.
Mosaic image enhancement and MixUp mixed-class enhancement were used for image preprocessing. In the target detection model YOLOv5s with One-Stage structure and modified backbone network and neck network, the maximum pooling layer of the spatial pyramid of the backbone network was replaced with a two-dimensional maximum pooling layer, and the coordinate attention mechanism and the efficient channel attention mechanism were integrated into the C3 module and the backbone network of the model, respectively. The improved model was compared with the unmodified YOLOv5s model, YOLOv3 model, YOLOv3-SPP model, and YOLOv3-tiny model for relevant algorithmic indicators in comparative experiments.
The AM-YOLO model incorporating coordinate attention and efficient channel attention mechanism effectively improved the accuracy of melanoma recognition with also a reduced size of the model weight. This model showed significantly better performance than other models in terms of precision, recall rate and mean average precision, and its mean average precision for benign and malignant melanoma reached 92.8% and 87.1%, respectively.
The deep learning-based target object detection algorithm model can be applied in recognition of melanoma targets.
提出一种融合坐标注意力和高效注意力机制的深度学习目标检测模型AM - YOLO。
采用马赛克图像增强和MixUp混合类增强进行图像预处理。在具有单阶段结构且修改了主干网络和颈部网络的目标检测模型YOLOv5s中,将主干网络空间金字塔的最大池化层替换为二维最大池化层,并将坐标注意力机制和高效通道注意力机制分别集成到模型的C3模块和主干网络中。在对比实验中,将改进后的模型与未修改的YOLOv5s模型、YOLOv3模型、YOLOv3 - SPP模型和YOLOv3 - tiny模型进行相关算法指标的比较。
融合坐标注意力和高效通道注意力机制的AM - YOLO模型有效提高了黑色素瘤识别的准确率,同时模型权重大小减小。该模型在精确率、召回率和平均精度均值方面表现明显优于其他模型,其对良性和恶性黑色素瘤的平均精度均值分别达到92.8%和87.1%。
基于深度学习的目标物体检测算法模型可应用于黑色素瘤目标的识别。