Zeng Yue, Meng Cai
Image Processing Center, School of Astronautics, Beihang University, Beijing 100191, China.
Sensors (Basel). 2024 Sep 24;24(19):6183. doi: 10.3390/s24196183.
Automatic detection and recognition of wheel hub text, which can boost the efficiency and accuracy of product information recording, are undermined by the obscurity and orientation variability of text on wheel hubs. To address these issues, this paper constructs a wheel hub text dataset and proposes a wheel hub text detection and recognition model called HubNet. The dataset captured images on real industrial production line scenes, including 446 images, 934 word instances, and 2947 character instances. HubNet is an end-to-end text detection and recognition model, not only comprising conventional detection and recognition heads but also incorporating a feature cross-fusion module, which improves the accuracy of recognizing wheel hub texts by utilizing both global and local features. Experimental results show that on the wheel hub text dataset, the HubNet achieves an accuracy of 86.5%, a recall of 79.4%, and an F1-score of 0.828, and the feature cross-fusion module increases the accuracy by 2% to 4%. The wheel hub dataset and the HubNet offer a significant reference for automatic detection and recognition of wheel hub text.
轮毂文本的自动检测与识别能够提高产品信息记录的效率和准确性,但轮毂上文本的模糊性和方向多变性对其产生了不利影响。为了解决这些问题,本文构建了一个轮毂文本数据集,并提出了一种名为HubNet的轮毂文本检测与识别模型。该数据集采集了真实工业生产线场景下的图像,包括446张图像、934个单词实例和2947个字符实例。HubNet是一个端到端的文本检测与识别模型,不仅包含传统的检测和识别头,还集成了一个特征交叉融合模块,该模块通过利用全局和局部特征提高了轮毂文本识别的准确性。实验结果表明,在轮毂文本数据集上,HubNet的准确率达到86.5%,召回率为79.4%,F1分数为0.828,并且特征交叉融合模块使准确率提高了2%至4%。轮毂数据集和HubNet为轮毂文本的自动检测与识别提供了重要参考。