Zhang Qiliang, Hua Kaiwen, Zhang Zi, Zhao Yiwei, Chen Pengpeng
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China.
Mine Digitization Engineering Research Center of the Ministry of Education, Xuzhou 221116, China.
Sensors (Basel). 2025 Aug 3;25(15):4776. doi: 10.3390/s25154776.
In intelligent vehicular networks, the accuracy of semantic segmentation in road scenes is crucial for vehicle-mounted artificial intelligence to achieve environmental perception, decision support, and safety control. Although deep learning methods have made significant progress, two main challenges remain: first, the difficulty in balancing global and local features leads to blurred object boundaries and misclassification; second, conventional convolutions have limited ability to perceive irregular objects, causing information loss and affecting segmentation accuracy. To address these issues, this paper proposes a global-local collaborative attention module and a spider web convolution module. The former enhances feature representation through bidirectional feature interaction and dynamic weight allocation, reducing false positives and missed detections. The latter introduces an asymmetric sampling topology and six-directional receptive field paths to effectively improve the recognition of irregular objects. Experiments on the Cityscapes, CamVid, and BDD100K datasets, collected using vehicle-mounted cameras, demonstrate that the proposed method performs excellently across multiple evaluation metrics, including mIoU, mRecall, mPrecision, and mAccuracy. Comparative experiments with classical segmentation networks, attention mechanisms, and convolution modules validate the effectiveness of the proposed approach. The proposed method demonstrates outstanding performance in sensor-based semantic segmentation tasks and is well-suited for environmental perception systems in autonomous driving.
在智能车载网络中,道路场景语义分割的准确性对于车载人工智能实现环境感知、决策支持和安全控制至关重要。尽管深度学习方法取得了显著进展,但仍存在两个主要挑战:第一,平衡全局和局部特征的困难导致物体边界模糊和分类错误;第二,传统卷积感知不规则物体的能力有限,导致信息丢失并影响分割精度。为了解决这些问题,本文提出了一种全局-局部协作注意力模块和一种蛛网卷积模块。前者通过双向特征交互和动态权重分配增强特征表示,减少误报和漏检。后者引入了非对称采样拓扑和六向感受野路径,有效地提高了对不规则物体的识别能力。使用车载摄像头收集的Cityscapes、CamVid和BDD100K数据集上的实验表明,该方法在包括mIoU、mRecall、mPrecision和mAccuracy在内的多个评估指标上表现出色。与经典分割网络、注意力机制和卷积模块的对比实验验证了所提方法的有效性。所提方法在基于传感器的语义分割任务中表现出色,非常适合自动驾驶中的环境感知系统。