利用MaskCNN和航空影像在OpenStreetMap中进行自动化路面分类。

Automated road surface classification in OpenStreetMap using MaskCNN and aerial imagery.

作者信息

Parvathi R, Pattabiraman V, Saxena Nancy, Mishra Aakarsh, Mishra Utkarsh, Pandey Ansh

机构信息

School of Computer Science and Engineering, Vellore Institute of Technology - Chennai Campus, Chennai, Tamil Nadu, India.

出版信息

Front Big Data. 2025 Aug 13;8:1657320. doi: 10.3389/fdata.2025.1657320. eCollection 2025.

DOI:10.3389/fdata.2025.1657320

PMID:40881822

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12382388/

Abstract

INTRODUCTION

OpenStreetMap (OSM) road surface data is critical for navigation, infrastructure monitoring, and urban planning but is often incomplete or inconsistent. This study addresses the need for automated validation and classification of road surfaces by leveraging high-resolution aerial imagery and deep learning techniques.

METHODS

We propose a MaskCNN-based deep learning model enhanced with attention mechanisms and a hierarchical loss function to classify road surfaces into four types: asphalt, concrete, gravel, and dirt. The model uses NAIP (National Agriculture Imagery Program) aerial imagery aligned with OSM labels. Preprocessing includes georeferencing, data augmentation, label cleaning, and class balancing. The architecture comprises a ResNet-50 encoder with squeeze-and-excitation blocks and a U-Net-style decoder with spatial attention. Evaluation metrics include accuracy, mIoU, precision, recall, and F1-score.

RESULTS

The proposed model achieved an overall accuracy of 92.3% and a mean Intersection over Union (mIoU) of 83.7%, outperforming baseline models such as SVM (81.2% accuracy), Random Forest (83.7%), and standard U-Net (89.6%). Class-wise performance showed high precision and recall even for challenging surface types like gravel and dirt. Comparative evaluations against state-of-the-art models (COANet, SA-UNet, MMFFNet) also confirmed superior performance.

DISCUSSION

The results demonstrate that combining NAIP imagery with attention-guided CNN architectures and hierarchical loss functions significantly improves road surface classification. The model is robust across varied terrains and visual conditions and shows potential for real-world applications such as OSM data enhancement, infrastructure analysis, and autonomous navigation. Limitations include label noise in OSM and class imbalance, which can be addressed through future work involving semi-supervised learning and multimodal data integration.

摘要

引言

开放街道地图（OpenStreetMap，OSM）的路面数据对于导航、基础设施监测和城市规划至关重要，但往往不完整或不一致。本研究通过利用高分辨率航空影像和深度学习技术，满足了对路面进行自动验证和分类的需求。

方法

我们提出了一种基于MaskCNN的深度学习模型，该模型通过注意力机制和分层损失函数进行增强，以将路面分为四种类型：沥青、混凝土、砾石和土路。该模型使用与OSM标签对齐的国家农业影像计划（NAIP）航空影像。预处理包括地理配准、数据增强、标签清理和类别平衡。该架构包括一个带有挤压与激励模块的ResNet-50编码器和一个带有空间注意力的U-Net风格解码器。评估指标包括准确率、平均交并比（mIoU）、精确率、召回率和F1分数。

结果

所提出的模型实现了92.3%的总体准确率和83.7%的平均交并比，优于支持向量机（SVM，准确率81.2%）、随机森林（83.7%）和标准U-Net（89.6%）等基线模型。即使对于砾石和土路等具有挑战性的路面类型，按类别划分的性能也显示出高精度和召回率。与现有最先进模型（COANet、SA-UNet、MMFFNet）的比较评估也证实了其卓越性能。

讨论

结果表明，将NAIP影像与注意力引导的CNN架构和分层损失函数相结合，可显著提高路面分类效果。该模型在各种地形和视觉条件下都具有鲁棒性，并显示出在诸如OSM数据增强、基础设施分析和自主导航等实际应用中的潜力。局限性包括OSM中的标签噪声和类别不平衡，可通过未来涉及半监督学习和多模态数据集成的工作来解决。