• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于改进的StyleGAN2-ADA构建与增强农村道路实例分割数据集

Construction and Enhancement of a Rural Road Instance Segmentation Dataset Based on an Improved StyleGAN2-ADA.

作者信息

Yao Zhixin, Xi Renna, Zhang Taihong, Zhao Yunjie, Tian Yongqiang, Hou Wenjing

机构信息

College of Computer and Information Engineering, Xinjiang Agricultural University, Urumqi 830052, China.

Research Center for Intelligent Agriculture, Ministry of Education Engineering, Urumqi 830052, China.

出版信息

Sensors (Basel). 2025 Apr 15;25(8):2477. doi: 10.3390/s25082477.

DOI:10.3390/s25082477
PMID:40285169
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12030941/
Abstract

With the advancement of agricultural automation, the demand for road recognition and understanding in agricultural machinery autonomous driving systems has significantly increased. To address the scarcity of instance segmentation data for rural roads and rural unstructured scenes, particularly the lack of support for high-resolution and fine-grained classification, a 20-class instance segmentation dataset was constructed, comprising 10,062 independently annotated instances. An improved StyleGAN2-ADA data augmentation method was proposed to generate higher-quality image data. This method incorporates a decoupled mapping network (DMN) to reduce the coupling degree of latent codes in W-space and integrates the advantages of convolutional networks and transformers by designing a convolutional coupling transfer block (CCTB). The core cross-shaped window self-attention mechanism in the CCTB enhances the network's ability to capture complex contextual information and spatial layouts. Ablation experiments comparing the improved and original StyleGAN2-ADA networks demonstrate significant improvements, with the inception score (IS) increasing from 42.38 to 77.31 and the Fréchet inception distance (FID) decreasing from 25.09 to 12.42, indicating a notable enhancement in data generation quality and authenticity. In order to verify the effect of data enhancement on the model performance, the algorithms Mask R-CNN, SOLOv2, YOLOv8n, and OneFormer were tested to compare the performance difference between the original dataset and the enhanced dataset, which further confirms the effectiveness of the improved module.

摘要

随着农业自动化的发展,农业机械自动驾驶系统对道路识别与理解的需求显著增加。为解决农村道路和农村非结构化场景实例分割数据稀缺的问题,特别是缺乏对高分辨率和细粒度分类的支持,构建了一个包含20个类别的实例分割数据集,其中包含10062个独立标注的实例。提出了一种改进的StyleGAN2-ADA数据增强方法来生成更高质量的图像数据。该方法引入了解耦映射网络(DMN)以降低W空间中潜在代码的耦合度,并通过设计卷积耦合传输块(CCTB)融合了卷积网络和Transformer的优点。CCTB中的核心十字形窗口自注意力机制增强了网络捕捉复杂上下文信息和空间布局的能力。对比改进后的StyleGAN2-ADA网络与原始网络的消融实验表明有显著改进,其中初始得分(IS)从42.38提高到77.31,弗雷歇初始距离(FID)从25.09降低到12.42,表明数据生成质量和真实性有显著提高。为验证数据增强对模型性能的影响,对Mask R-CNN、SOLOv2、YOLOv8n和OneFormer算法进行了测试,以比较原始数据集和增强数据集之间的性能差异,这进一步证实了改进模块的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/2f9186327249/sensors-25-02477-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/1ca5f5d261b2/sensors-25-02477-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/f0716c68b8c7/sensors-25-02477-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/c3761c3c1a34/sensors-25-02477-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/8641e19168f7/sensors-25-02477-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/62635ad3d2fc/sensors-25-02477-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/be2f3545d274/sensors-25-02477-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/6e2c9c1ce35e/sensors-25-02477-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/66248eacf846/sensors-25-02477-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/ed51de001d82/sensors-25-02477-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/2f9186327249/sensors-25-02477-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/1ca5f5d261b2/sensors-25-02477-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/f0716c68b8c7/sensors-25-02477-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/c3761c3c1a34/sensors-25-02477-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/8641e19168f7/sensors-25-02477-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/62635ad3d2fc/sensors-25-02477-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/be2f3545d274/sensors-25-02477-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/6e2c9c1ce35e/sensors-25-02477-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/66248eacf846/sensors-25-02477-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/ed51de001d82/sensors-25-02477-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff5/12030941/2f9186327249/sensors-25-02477-g010.jpg

相似文献

1
Construction and Enhancement of a Rural Road Instance Segmentation Dataset Based on an Improved StyleGAN2-ADA.基于改进的StyleGAN2-ADA构建与增强农村道路实例分割数据集
Sensors (Basel). 2025 Apr 15;25(8):2477. doi: 10.3390/s25082477.
2
Road Traffic Sign Detection Method Based on RTS R-CNN Instance Segmentation Network.基于RTS R-CNN实例分割网络的道路交通标志检测方法
Sensors (Basel). 2023 Jul 20;23(14):6543. doi: 10.3390/s23146543.
3
IDA-MIL: Classification of Glomerular with Spike-like Projections via Multiple Instance Learning with Instance-level Data Augmentation.IDA-MIL:基于实例级数据增强的多实例学习的具有刺状突起的肾小球分类。
Comput Methods Programs Biomed. 2022 Oct;225:107106. doi: 10.1016/j.cmpb.2022.107106. Epub 2022 Sep 2.
4
Fusing attention mechanism with Mask R-CNN for instance segmentation of grape cluster in the field.将注意力机制与Mask R-CNN融合用于田间葡萄串的实例分割。
Front Plant Sci. 2022 Jul 22;13:934450. doi: 10.3389/fpls.2022.934450. eCollection 2022.
5
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
6
Improved Mask R-CNN Multi-Target Detection and Segmentation for Autonomous Driving in Complex Scenes.改进的 Mask R-CNN 多目标检测与分割在复杂场景下的自动驾驶。
Sensors (Basel). 2023 Apr 10;23(8):3853. doi: 10.3390/s23083853.
7
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
8
Fine Segmentation of Chinese Character Strokes Based on Coordinate Awareness and Enhanced BiFPN.基于坐标感知与增强型双向特征金字塔网络的汉字笔画精细分割
Sensors (Basel). 2024 May 28;24(11):3480. doi: 10.3390/s24113480.
9
An Improved Instance Segmentation Method for Complex Elements of Farm UAV Aerial Survey Images.一种用于农场无人机航测图像复杂元素的改进实例分割方法。
Sensors (Basel). 2024 Sep 15;24(18):5990. doi: 10.3390/s24185990.
10
Extraction of Roads Using the Archimedes Tuning Process with the Quantum Dilated Convolutional Neural Network.使用量子扩张卷积神经网络的阿基米德调谐过程提取道路
Sensors (Basel). 2023 Oct 28;23(21):8783. doi: 10.3390/s23218783.

本文引用的文献

1
LM-CycleGAN: Improving Underwater Image Quality Through Learned Perceptual Image Patch Similarity and Multi-Scale Adaptive Fusion Attention.LM-CycleGAN:通过学习感知图像块相似性和多尺度自适应融合注意力提高水下图像质量
Sensors (Basel). 2024 Nov 21;24(23):7425. doi: 10.3390/s24237425.
2
Local inconsistency detection using the Kullback-Leibler divergence measure.利用 KL 散度测度进行局部不一致性检测。
Syst Rev. 2024 Oct 17;13(1):261. doi: 10.1186/s13643-024-02680-4.
3
A Review on Recent Deep Learning-Based Semantic Segmentation for Urban Greenness Measurement.
基于深度学习的城市绿化度测量语义分割研究综述
Sensors (Basel). 2024 Mar 31;24(7):2245. doi: 10.3390/s24072245.
4
Agricultural machinery automatic navigation technology.农业机械自动导航技术
iScience. 2023 Dec 14;27(2):108714. doi: 10.1016/j.isci.2023.108714. eCollection 2024 Feb 16.
5
Differentiable Image Data Augmentation and Its Applications: A Survey.可微图像数据增强及其应用:综述
IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1148-1164. doi: 10.1109/TPAMI.2023.3330862. Epub 2024 Jan 8.
6
Approaches and Limitations of Machine Learning for Synthetic Ultrasound Generation: A Scoping Review.用于合成超声生成的机器学习方法与局限性:一项范围综述
J Ultrasound Med. 2023 Dec;42(12):2695-2706. doi: 10.1002/jum.16332. Epub 2023 Sep 29.
7
CCT-Unet: A U-Shaped Network Based on Convolution Coupled Transformer for Segmentation of Peripheral and Transition Zones in Prostate MRI.CCT-Unet:一种基于卷积耦合Transformer 的 U 型网络,用于前列腺 MRI 中周边区和移行区的分割。
IEEE J Biomed Health Inform. 2023 Sep;27(9):4341-4351. doi: 10.1109/JBHI.2023.3289913. Epub 2023 Sep 6.
8
GAN Inversion: A Survey.GAN 反转:综述。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3121-3138. doi: 10.1109/TPAMI.2022.3181070.
9
A Survey on Vision Transformer.视觉Transformer综述
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.
10
Unsupervised MR-to-CT Synthesis Using Structure-Constrained CycleGAN.基于结构约束循环生成对抗网络的无监督磁共振-计算机断层合成。
IEEE Trans Med Imaging. 2020 Dec;39(12):4249-4261. doi: 10.1109/TMI.2020.3015379. Epub 2020 Nov 30.