Department of Structural Engineering, Tongji University, 1239 Siping Road, Shanghai 200092, China.
Sensors (Basel). 2023 May 17;23(10):4851. doi: 10.3390/s23104851.
Tower cranes can cover most of the area of a construction site, which brings significant safety risks, including potential collisions with other entities. To address these issues, it is necessary to obtain accurate and real-time information on the orientation and location of tower cranes and hooks. As a non-invasive sensing method, computer vision-based (CVB) technology is widely applied on construction sites for object detection and three-dimensional (3D) localization. However, most existing methods mainly address the localization on the construction ground plane or rely on specific viewpoints and positions. To address these issues, this study proposes a framework for the real-time recognition and localization of tower cranes and hooks using monocular far-field cameras. The framework consists of four steps: far-field camera autocalibration using feature matching and horizon-line detection, deep learning-based segmentation of tower cranes, geometric feature reconstruction of tower cranes, and 3D localization estimation. The pose estimation of tower cranes using monocular far-field cameras with arbitrary views is the main contribution of this paper. To evaluate the proposed framework, a series of comprehensive experiments were conducted on construction sites in different scenarios and compared with ground-truth data obtained by sensors. The experimental results show that the proposed framework achieves high precision in both crane jib orientation estimation and hook position estimation, thereby contributing to the development of safety management and productivity analysis.
塔吊可以覆盖建筑工地的大部分区域,这带来了重大的安全风险,包括与其他实体潜在的碰撞。为了解决这些问题,有必要获取塔吊和吊钩的方位和位置的准确和实时信息。作为一种非侵入式传感方法,基于计算机视觉的(CVB)技术在建筑工地中被广泛应用于目标检测和三维(3D)定位。然而,大多数现有的方法主要解决施工地面上的定位问题,或者依赖于特定的视角和位置。为了解决这些问题,本研究提出了一种使用单目远距摄像机实时识别和定位塔吊和吊钩的框架。该框架由四个步骤组成:使用特征匹配和地平线检测的远距摄像机自标定、基于深度学习的塔吊分割、塔吊几何特征重建和 3D 定位估计。本文的主要贡献是使用任意视角的单目远距摄像机进行塔吊的位姿估计。为了评估所提出的框架,在不同场景的建筑工地进行了一系列全面的实验,并与传感器获得的地面真实数据进行了比较。实验结果表明,该框架在塔吊臂架方向估计和吊钩位置估计方面都具有高精度,从而有助于安全管理和生产力分析的发展。