School of Information Science and Technology, Northeast Normal University, Changchun 130117, China.
Sensors (Basel). 2021 Mar 9;21(5):1919. doi: 10.3390/s21051919.
With the aim to solve issues of robot object recognition in complex scenes, this paper proposes an object recognition method based on scene text reading. The proposed method simulates human-like behavior and accurately identifies objects with texts through careful reading. First, deep learning models with high accuracy are adopted to detect and recognize text in multi-view. Second, datasets including 102,000 Chinese and English scene text images and their inverse are generated. The F-measure of text detection is improved by 0.4% and the recognition accuracy is improved by 1.26% because the model is trained by these two datasets. Finally, a robot object recognition method is proposed based on the scene text reading. The robot detects and recognizes texts in the image and then stores the recognition results in a text file. When the user gives the robot a fetching instruction, the robot searches for corresponding keywords from the text files and achieves the confidence of multiple objects in the scene image. Then, the object with the maximum confidence is selected as the target. The results show that the robot can accurately distinguish objects with arbitrary shape and category, and it can effectively solve the problem of object recognition in home environments.
为了解决机器人在复杂场景下的物体识别问题,本文提出了一种基于场景文本阅读的物体识别方法。该方法通过仔细阅读,模拟人类行为,准确识别带有文本的物体。首先,采用高精度的深度学习模型,从多视角检测和识别文本。其次,生成包含 102000 个中英文场景文本图像及其反转的数据集。通过这两个数据集进行训练,文本检测的 F 值提高了 0.4%,识别准确率提高了 1.26%。最后,提出了一种基于场景文本阅读的机器人物体识别方法。机器人在图像中检测和识别文本,然后将识别结果存储在文本文件中。当用户给机器人一个取物指令时,机器人从文本文件中搜索相应的关键字,实现对场景图像中多个物体的置信度。然后,选择置信度最高的物体作为目标。实验结果表明,机器人可以准确区分任意形状和类别的物体,有效解决了家庭环境中的物体识别问题。