Chen Chuanjun, Liu Junjie, Yin Haonan, Huang Biqing
Department of Automation, Tsinghua University, Beijing 100084, China.
BZS (Beijing) Technology Development Co., Ltd., No.1 Jiaochangkou, Deshengmenwai, Beijing 100120, China.
Sensors (Basel). 2025 Apr 21;25(8):2623. doi: 10.3390/s25082623.
Automated storage and retrieval systems (AS/RS) play a crucial role in modern logistics, yet effectively monitoring cargo stacking patterns remains challenging. While computer vision and deep learning offer promising solutions, existing methods struggle to balance detection accuracy, computational efficiency, and environmental adaptability. This paper proposes a novel machine vision-based detection algorithm that integrates a pallet surface object detection network (STEGNet) with a box edge detection algorithm. STEGNet's core innovation is the Efficient Gated Pyramid Feature Network (EG-FPN), which integrates a Gated Feature Fusion module and a Lightweight Attention Mechanism to optimize feature extraction and fusion. In addition, we introduce a geometric constraint method for box edge detection and employ a Perspective-n-Point (PnP)-based 2D-to-3D transformation approach for precise pose estimation. Experimental results show that STEGNet achieves 93.49% mAP on our proposed GY Warehouse Box View 4-Dimension (GY-WSBW-4D) dataset and 83.2% mAP on the WSGID-B dataset, surpassing existing benchmarks. The lightweight variant maintains competitive accuracy while reducing the model size by 34% and increasing the inference speed by 68%. In practical applications, the system achieves pose estimation with a Mean Absolute Error within 4 cm and a Rotation Angle Error below 2°, demonstrating robust performance in complex warehouse environments. This research provides a reliable solution for automated cargo stack monitoring in modern logistics systems.
自动存储和检索系统(AS/RS)在现代物流中起着至关重要的作用,但有效地监测货物堆叠模式仍然具有挑战性。虽然计算机视觉和深度学习提供了有前景的解决方案,但现有方法难以在检测精度、计算效率和环境适应性之间取得平衡。本文提出了一种基于机器视觉的新型检测算法,该算法将托盘表面物体检测网络(STEGNet)与箱边检测算法相结合。STEGNet的核心创新在于高效门控金字塔特征网络(EG-FPN),它集成了门控特征融合模块和轻量级注意力机制,以优化特征提取和融合。此外,我们引入了一种用于箱边检测的几何约束方法,并采用基于透视n点(PnP)的二维到三维变换方法进行精确的姿态估计。实验结果表明,STEGNet在我们提出的GY仓库箱视图四维(GY-WSBW-4D)数据集上实现了93.49%的平均精度均值(mAP),在WSGID-B数据集上实现了83.2%的mAP,超过了现有基准。轻量级变体在保持竞争力的精度的同时,将模型大小减少了34%,推理速度提高了68%。在实际应用中,该系统实现了平均绝对误差在4厘米以内、旋转角度误差低于2°的姿态估计,在复杂的仓库环境中表现出强大的性能。这项研究为现代物流系统中的自动货物堆叠监测提供了可靠的解决方案。