Gao Ai, Yang Guang
School of Information Engineering, Institute of Disaster Prevention, Sanhe, China.
PeerJ Comput Sci. 2024 May 2;10:e2006. doi: 10.7717/peerj-cs.2006. eCollection 2024.
Automatic building extraction from very high-resolution remote sensing images is of great significance in several application domains, such as emergency information analysis and intelligent city construction. In recent years, with the development of deep learning technology, convolutional neural networks (CNNs) have made considerable progress in improving the accuracy of building extraction from remote sensing imagery. However, most existing methods require numerous parameters and large amounts of computing and storage resources. This affects their efficiency and limits their practical application. In this study, to balance the accuracy and amount of computation required for building extraction, a novel efficient lightweight residual network (ELRNet) with an encoder-decoder structure is proposed for building extraction. ELRNet consists of a series of downsampling blocks and lightweight feature extraction modules (LFEMs) for the encoder and an appropriate combination of LFEMs and upsampling blocks for the decoder. The key to the proposed ELRNet is the LFEM which has depthwise-factorised convolution incorporated in its design. In addition, the effective channel attention (ECA) added to LFEM, performs local cross-channel interactions, thereby fully extracting the relevant information between channels. The performance of ELRNet was evaluated on the public WHU Building dataset, achieving 88.24% IoU with 2.92 GFLOPs and 0.23 million parameters. The proposed ELRNet was compared with six state-of-the-art baseline networks (SegNet, U-Net, ENet, EDANet, ESFNet, and ERFNet). The results show that ELRNet offers a better tradeoff between accuracy and efficiency in the automatic extraction of buildings in very highresolution remote sensing images. This code is publicly available on GitHub (https://github.com/GaoAi/ELRNet).
从超高分辨率遥感影像中自动提取建筑物在多个应用领域具有重要意义,如应急信息分析和智慧城市建设。近年来,随着深度学习技术的发展,卷积神经网络(CNN)在提高从遥感影像中提取建筑物的准确性方面取得了显著进展。然而,大多数现有方法需要大量参数以及大量计算和存储资源。这影响了它们的效率并限制了其实际应用。在本研究中,为了平衡建筑物提取所需的准确性和计算量,提出了一种具有编码器 - 解码器结构的新型高效轻量级残差网络(ELRNet)用于建筑物提取。ELRNet由一系列用于编码器的下采样块和轻量级特征提取模块(LFEM)以及用于解码器的LFEM和上采样块的适当组合组成。所提出的ELRNet的关键在于其设计中融入了深度可分离卷积的LFEM。此外,添加到LFEM的有效通道注意力(ECA)执行局部跨通道交互,从而充分提取通道之间的相关信息。ELRNet在公共WHU建筑物数据集上进行了评估,以2.92 GFLOPs和23万个参数实现了88.24%的交并比(IoU)。将所提出的ELRNet与六个最先进的基线网络(SegNet、U-Net、ENet、EDANet、ESFNet和ERFNet)进行了比较。结果表明,在超高分辨率遥感影像中建筑物的自动提取方面,ELRNet在准确性和效率之间提供了更好的权衡。此代码可在GitHub(https://github.com/GaoAi/ELRNet)上公开获取。