• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Mask R-CNN在无人机遥感影像建筑物检测中的应用。

Application of mask R-CNN for building detection in UAV remote sensing images.

作者信息

Hou Tao, Li Jing

机构信息

Wuhan University of Science and Technology, Wuhan, Hubei, 430000, China.

出版信息

Heliyon. 2024 Sep 19;10(19):e38141. doi: 10.1016/j.heliyon.2024.e38141. eCollection 2024 Oct 15.

DOI:10.1016/j.heliyon.2024.e38141
PMID:39397997
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11467628/
Abstract

This study aims to tackle the challenges of low accuracy in building feature extraction and insufficient details in three-dimensional (3D) modeling faced by traditional methods, particularly in complex backgrounds. To address these issues, a method for building feature extraction based on Mask Region-Convolutional Neural Network (Mask R-CNN) is proposed. This approach combines deep learning techniques with aerial images to ensure precise and efficient automatic detection and feature extraction. Urban building images are captured through aerial photography, and building outlines are annotated to create a comprehensive dataset of building features. The Mask R-CNN-based method efficiently processes and classifies the features of the dataset, generating candidate regions for further analysis. Additionally, this method demonstrates significant advantages in building feature extraction by employing the Mask R-CNN model to generate adaptive features. Comparative analysis with models such as Convolutional Neural Network (CNN), Region-based Convolutional Neural Network (R-CNN), Fast Region-based Convolutional Neural Network (Fast R-CNN), Faster Region-based Convolutional Neural Network (Faster R-CNN), and Generative Adversarial Network (GAN) indicates that Mask R-CNN exhibits superior performance in building feature extraction. The Mask R-CNN-based approach achieved approximately 95 % classification accuracy, while also showcasing strong stability and generalization capabilities. This study provides new methodologies and insights for enhancing feature extraction in aerial building imagery, offering significant reference value for the fields of architectural design and urban planning.

摘要

本研究旨在应对传统方法在建筑物特征提取精度低以及三维(3D)建模细节不足方面所面临的挑战,尤其是在复杂背景下。为解决这些问题,提出了一种基于掩码区域卷积神经网络(Mask R-CNN)的建筑物特征提取方法。该方法将深度学习技术与航空影像相结合,以确保精确且高效的自动检测和特征提取。通过航空摄影获取城市建筑物图像,并对建筑物轮廓进行标注,以创建一个全面的建筑物特征数据集。基于Mask R-CNN的方法有效地处理和分类数据集的特征,生成候选区域以供进一步分析。此外,该方法通过使用Mask R-CNN模型生成自适应特征,在建筑物特征提取方面展现出显著优势。与卷积神经网络(CNN)、基于区域的卷积神经网络(R-CNN)、快速基于区域的卷积神经网络(Fast R-CNN)、更快基于区域的卷积神经网络(Faster R-CNN)和生成对抗网络(GAN)等模型的对比分析表明,Mask R-CNN在建筑物特征提取方面表现出卓越性能。基于Mask R-CNN的方法实现了约95%的分类准确率,同时还展现出强大的稳定性和泛化能力。本研究为增强航空建筑图像中的特征提取提供了新的方法和见解,为建筑设计和城市规划领域提供了重要的参考价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/414fa894f133/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/73b4f03330ee/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/abc9af12f396/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/8819d09e4e8f/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/b883c9610d40/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/e403a56687e2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/89e3f023d4ac/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/bd2a0776734f/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/804bb1ebc876/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/43e19b637fef/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/414fa894f133/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/73b4f03330ee/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/abc9af12f396/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/8819d09e4e8f/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/b883c9610d40/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/e403a56687e2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/89e3f023d4ac/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/bd2a0776734f/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/804bb1ebc876/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/43e19b637fef/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/245c/11467628/414fa894f133/gr10.jpg

相似文献

1
Application of mask R-CNN for building detection in UAV remote sensing images.Mask R-CNN在无人机遥感影像建筑物检测中的应用。
Heliyon. 2024 Sep 19;10(19):e38141. doi: 10.1016/j.heliyon.2024.e38141. eCollection 2024 Oct 15.
2
An Efficient Building Extraction Method from High Spatial Resolution Remote Sensing Images Based on Improved Mask R-CNN.基于改进的 Mask R-CNN 的高空间分辨率遥感图像高效建筑物提取方法。
Sensors (Basel). 2020 Mar 6;20(5):1465. doi: 10.3390/s20051465.
3
Enhance the Accuracy of Landslide Detection in UAV Images Using an Improved Mask R-CNN Model: A Case Study of Sanming, China.利用改进的 Mask R-CNN 模型提高无人机图像中滑坡检测的精度:以中国三明市为例。
Sensors (Basel). 2023 Apr 26;23(9):4287. doi: 10.3390/s23094287.
4
Improved brain metastases segmentation using generative adversarial network and conditional random field optimization mask R-CNN.利用生成对抗网络和条件随机场优化掩模 R-CNN 提高脑转移瘤分割效果。
Med Phys. 2024 Sep;51(9):5990-6001. doi: 10.1002/mp.17176. Epub 2024 May 22.
5
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
6
Automatic Building Extraction from Google Earth Images under Complex Backgrounds Based on Deep Instance Segmentation Network.基于深度实例分割网络的复杂背景下谷歌地球影像建筑物自动提取
Sensors (Basel). 2019 Jan 15;19(2):333. doi: 10.3390/s19020333.
7
EEG-Based Feature Classification Combining 3D-Convolutional Neural Networks with Generative Adversarial Networks for Motor Imagery.基于脑电的特征分类,结合三维卷积神经网络和生成对抗网络,用于运动想象。
J Integr Neurosci. 2024 Aug 20;23(8):153. doi: 10.31083/j.jin2308153.
8
Image segmentation for pest detection of crop leaves by improvement of regional convolutional neural network.基于区域卷积神经网络改进的作物叶片虫害检测图像分割
Sci Rep. 2024 Oct 15;14(1):24160. doi: 10.1038/s41598-024-75391-4.
9
Using Deep Learning and Low-Cost RGB and Thermal Cameras to Detect Pedestrians in Aerial Images Captured by Multirotor UAV.利用深度学习以及低成本的 RGB 和热成像摄像机,检测多旋翼无人机航拍图像中的行人。
Sensors (Basel). 2018 Jul 12;18(7):2244. doi: 10.3390/s18072244.
10
Deep Learning Approaches on Defect Detection in High Resolution Aerial Images of Insulators.深度学习方法在绝缘子高分辨率航拍图像缺陷检测中的应用。
Sensors (Basel). 2021 Feb 3;21(4):1033. doi: 10.3390/s21041033.

引用本文的文献

1
A study on the non-contact measurement of sunflower disk inclination and its application to accurate phenotypic analysis.向日葵花盘倾斜度的非接触测量及其在精确表型分析中的应用研究
Front Plant Sci. 2025 Aug 6;16:1614898. doi: 10.3389/fpls.2025.1614898. eCollection 2025.
2
A Transfer Learning-Based VGG-16 Model for COD Detection in UV-Vis Spectroscopy.一种基于迁移学习的用于紫外-可见光谱中化学需氧量检测的VGG-16模型。
J Imaging. 2025 May 17;11(5):159. doi: 10.3390/jimaging11050159.

本文引用的文献

1
A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet.基于改进型 UNet 的遥感图像语义分割精度优化的深度学习方法。
Sci Rep. 2023 May 10;13(1):7600. doi: 10.1038/s41598-023-34379-2.
2
A survey on deep learning applied to medical images: from simple artificial neural networks to generative models.关于深度学习应用于医学图像的综述:从简单人工神经网络到生成模型
Neural Comput Appl. 2023;35(3):2291-2323. doi: 10.1007/s00521-022-07953-4. Epub 2022 Nov 4.
3
Tools, techniques, datasets and application areas for object detection in an image: a review.
图像中目标检测的工具、技术、数据集及应用领域综述
Multimed Tools Appl. 2022;81(27):38297-38351. doi: 10.1007/s11042-022-13153-y. Epub 2022 Apr 23.
4
Tuning of data augmentation hyperparameters in deep learning to building construction image classification with small datasets.深度学习中数据增强超参数的调整用于小数据集的建筑施工图像分类。
Int J Mach Learn Cybern. 2023;14(1):171-186. doi: 10.1007/s13042-022-01555-1. Epub 2022 Apr 13.