Hachaj Tomasz
Institute of Computer Science, Pedagogical University of Krakow, 2 Podchorazych Ave, 30-084 Krakow, Poland.
Sensors (Basel). 2022 Sep 5;22(17):6703. doi: 10.3390/s22176703.
In this work, a new method is proposed that allows the use of a single RGB camera for the real-time detection of objects that could be potential collision sources for Unmanned Aerial Vehicles. For this purpose, a new network with an encoder-decoder architecture has been developed, which allows rapid distance estimation from a single image by performing RGB to depth mapping. Based on a comparison with other existing RGB to depth mapping methods, the proposed network achieved a satisfactory trade-off between complexity and accuracy. With only 6.3 million parameters, it achieved efficiency close to models with more than five times the number of parameters. This allows the proposed network to operate in real time. A special algorithm makes use of the distance predictions made by the network, compensating for measurement inaccuracies. The entire solution has been implemented and tested in practice in an indoor environment using a micro-drone equipped with a front-facing RGB camera. All data and source codes and pretrained network weights are available to download. Thus, one can easily reproduce the results, and the resulting solution can be tested and quickly deployed in practice.
在这项工作中,提出了一种新方法,该方法允许使用单个RGB相机实时检测可能成为无人机潜在碰撞源的物体。为此,开发了一种具有编码器-解码器架构的新网络,该网络通过执行RGB到深度映射,能够从单张图像快速估计距离。与其他现有的RGB到深度映射方法相比,所提出的网络在复杂度和准确性之间实现了令人满意的权衡。它只有630万个参数,却实现了接近参数数量为其五倍多的模型的效率。这使得所提出的网络能够实时运行。一种特殊算法利用网络做出的距离预测,补偿测量误差。整个解决方案已在室内环境中使用配备前置RGB相机的微型无人机进行了实际实施和测试。所有数据、源代码和预训练网络权重均可下载。因此,人们可以轻松重现结果,并且所得解决方案可以在实际中进行测试并快速部署。