Dang Thai-Viet, Tran Dinh-Manh-Cuong, Tan Phan Xuan
Department of Mechatronics, School of Mechanical Engineering, Hanoi University of Science and Technology, Hanoi 10000, Vietnam.
Graduate School of Engineering and Science, Shibaura Institute of Technology, Toyosu, Koto-ku, Tokyo 135-8548, Japan.
Sensors (Basel). 2023 Aug 3;23(15):6907. doi: 10.3390/s23156907.
Computer vision plays a significant role in mobile robot navigation due to the wealth of information extracted from digital images. Mobile robots localize and move to the intended destination based on the captured images. Due to the complexity of the environment, obstacle avoidance still requires a complex sensor system with a high computational efficiency requirement. This study offers a real-time solution to the problem of extracting corridor scenes from a single image using a lightweight semantic segmentation model integrating with the quantization technique to reduce the numerous training parameters and computational costs. The proposed model consists of an FCN as the decoder and MobilenetV2 as the decoder (with multi-scale fusion). This combination allows us to significantly minimize computation time while achieving high precision. Moreover, in this study, we also propose to use the Balance Cross-Entropy loss function to handle diverse datasets, especially those with class imbalances and to integrate a number of techniques, for example, the Adam optimizer and Gaussian filters, to enhance segmentation performance. The results demonstrate that our model can outperform baselines across different datasets. Moreover, when being applied to practical experiments with a real mobile robot, the proposed model's performance is still consistent, supporting the optimal path planning, allowing the mobile robot to efficiently and effectively avoid the obstacles.
由于从数字图像中提取的信息丰富,计算机视觉在移动机器人导航中发挥着重要作用。移动机器人根据捕获的图像进行定位并移动到预定目的地。由于环境的复杂性,避障仍然需要一个具有高计算效率要求的复杂传感器系统。本研究提供了一种实时解决方案,即使用轻量级语义分割模型与量化技术相结合,从单幅图像中提取走廊场景,以减少大量训练参数和计算成本。所提出的模型由一个全卷积网络(FCN)作为编码器和MobileNetV2作为解码器(具有多尺度融合)组成。这种组合使我们能够在实现高精度的同时显著减少计算时间。此外,在本研究中,我们还建议使用平衡交叉熵损失函数来处理不同的数据集,特别是那些具有类别不平衡的数据集,并集成一些技术,例如Adam优化器和高斯滤波器,以提高分割性能。结果表明,我们的模型在不同数据集上的表现优于基线。此外,当应用于实际移动机器人的实验时,所提出模型的性能仍然稳定,支持最优路径规划,使移动机器人能够高效地避开障碍物。