The First Construction Engineering Limited Company of China Construction Third Engineering Bureau, Wuhan, P.R. China.
Sichuan University of Science & Engineering, Yibin, P.R. China.
PLoS One. 2023 Jan 26;18(1):e0271051. doi: 10.1371/journal.pone.0271051. eCollection 2023.
As a dense instance segmentation problem, rebar counting in a complex environment such as rebar yard and rebar transpotation has received significant attention in both academic and industrial contexts. Traditional counting approaches, such as manual counting and machine vision-based algorithms, are often inefficient or inaccurate since rebars with varied sizes and shapes are stacked overlapping, rebar image is not clear for complex light condition such as dawn, night and strong light, and other environmental noises exist in rebar image; thus, they no longer fulfil the requirements of modern automation. This paper proposes MaskID, an innovative counting method based on deep learning and heuristic strategies. First, an improved version of the Mask region-based convolutional neural network (Mask R-CNN) was designed to obtain the segmentation results through splitting and rescaling so as to capture more detail in a large-scale rebar image. Then, a series of intelligent denoising strategies corresponding to aspect ratio of recognized box, overlapping recognized objects, object distribution and environmental noise, were applied to improve the segmentation results. The performance of the proposed method was evaluated on open-competition and test-platform datasets. The F1-score was found to be over 0.99 on all datasets. The experimental results demonstrate that the proposed method is effective for dense rebar counting and significantly outperforms existing state-of-the-art methods.
作为一个密集的实例分割问题,在钢筋堆场和钢筋运输等复杂环境中进行钢筋计数在学术界和工业界都受到了广泛关注。传统的计数方法,如人工计数和基于机器视觉的算法,通常效率低下或不准确,因为不同尺寸和形状的钢筋是重叠堆放的,钢筋图像在黎明、夜晚和强光等复杂光线条件下不清楚,并且钢筋图像中存在其他环境噪声;因此,它们不再满足现代自动化的要求。本文提出了一种基于深度学习和启发式策略的创新计数方法 MaskID。首先,设计了一种改进的 Mask 区域卷积神经网络(Mask R-CNN),通过分割和重缩放来获得分割结果,以便在大规模钢筋图像中捕捉更多细节。然后,应用了一系列与识别框的纵横比、重叠识别对象、对象分布和环境噪声对应的智能去噪策略来改进分割结果。在公开竞赛数据集和测试平台数据集上评估了所提出方法的性能。在所有数据集上,F1 分数均超过 0.99。实验结果表明,所提出的方法对于密集的钢筋计数是有效的,并且明显优于现有的最先进方法。