• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

双注意力网络:一种集成空间和通道注意力机制的双分支自动驾驶图像分割网络。

BiAttentionNet: a dual-branch automatic driving image segmentation network integrating spatial and channel attention mechanisms.

作者信息

Liu Ruijun, Zhang Yijun, Chen Jieying, Wu Zhigang, Zhu Yaohui, Liu Jun, Chen Min

机构信息

Jiangxi Dongrui Intelligent Equipment Technology Co., LTD., Nanchang, 330028, China.

Gansu Power Transmission Engineering Co., Ltd., Lanzhou, 730050, China.

出版信息

Sci Rep. 2025 Apr 16;15(1):13193. doi: 10.1038/s41598-025-95470-4.

DOI:10.1038/s41598-025-95470-4
PMID:40240778
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12003632/
Abstract

Real-time semantic segmentation is one of the most researched areas in the field of computer vision, and research on dual-branch networks has gradually become a popular direction in network architecture research. In this paper, a dual-branch automatic driving image segmentation network integrating spatial and channel attention mechanisms is proposed with named as "BiAttentionNet". The network aims to balance network accuracy and real-time performance by processing high-level semantic information and low-level detail information separately. BiAttentionNet consists of three main parts: the detail branch, the semantic branch, and the proposed attention-guided fusion layer. The detail branch extracts local and surrounding context features using the designed PCSD convolution module to process wide-channel low-level feature information. The semantic branch utilizes an improved lightweight Unet network to extract semantic information from deep narrow channels. Finally, the proposed attention-guided fusion layer fuses the features of the dual branches using detail attention and channel attention mechanisms to achieve image segmentation tasks in road scenes. Comparative experiments with recent mainstream networks such as BiseNet v2, Fast-SCNN, ConvNeXt, SegNeXt, Segformer, CGNet, etc., on the Cityscapes dataset show that BiAttentionNet achieves a highest accuracy of 65.89% in the mIoU metric for the backbone network. This validates the effectiveness of the proposed BiAttentionNet.

摘要

实时语义分割是计算机视觉领域研究最多的领域之一,对双分支网络的研究逐渐成为网络架构研究中的一个热门方向。本文提出了一种集成空间和通道注意力机制的双分支自动驾驶图像分割网络,命名为“BiAttentionNet”。该网络旨在通过分别处理高级语义信息和低级细节信息来平衡网络准确性和实时性能。BiAttentionNet由三个主要部分组成:细节分支、语义分支和提出的注意力引导融合层。细节分支使用设计的PCSD卷积模块处理宽通道低级特征信息,以提取局部和周围上下文特征。语义分支利用改进的轻量级Unet网络从深窄通道中提取语义信息。最后,提出的注意力引导融合层使用细节注意力和通道注意力机制融合双分支的特征,以实现道路场景中的图像分割任务。在Cityscapes数据集上与最近的主流网络如BiseNet v2、Fast-SCNN、ConvNeXt、SegNeXt、Segformer、CGNet等进行的对比实验表明,BiAttentionNet在骨干网络的mIoU指标中达到了65.89%的最高准确率。这验证了所提出的BiAttentionNet的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/2543d70447a0/41598_2025_95470_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/eb4a773396a3/41598_2025_95470_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/6e9b7dce7f94/41598_2025_95470_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/89f875e06fd8/41598_2025_95470_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/f9ac60a089ba/41598_2025_95470_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/57e284567629/41598_2025_95470_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/4c68e5496988/41598_2025_95470_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/2543d70447a0/41598_2025_95470_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/eb4a773396a3/41598_2025_95470_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/6e9b7dce7f94/41598_2025_95470_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/89f875e06fd8/41598_2025_95470_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/f9ac60a089ba/41598_2025_95470_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/57e284567629/41598_2025_95470_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/4c68e5496988/41598_2025_95470_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b833/12003632/2543d70447a0/41598_2025_95470_Fig7_HTML.jpg

相似文献

1
BiAttentionNet: a dual-branch automatic driving image segmentation network integrating spatial and channel attention mechanisms.双注意力网络:一种集成空间和通道注意力机制的双分支自动驾驶图像分割网络。
Sci Rep. 2025 Apr 16;15(1):13193. doi: 10.1038/s41598-025-95470-4.
2
TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.TGDAUNet:基于 Transformer 和 GCNN 的双分支注意力 U-Net 用于医学图像分割。
Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.
3
Rethinking 1D convolution for lightweight semantic segmentation.重新思考用于轻量级语义分割的一维卷积
Front Neurorobot. 2023 Feb 9;17:1119231. doi: 10.3389/fnbot.2023.1119231. eCollection 2023.
4
Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation.双边注意解码器:用于实时语义分割的轻量级解码器。
Neural Netw. 2021 May;137:188-199. doi: 10.1016/j.neunet.2021.01.021. Epub 2021 Jan 30.
5
A lightweight multi-dimension dynamic convolutional network for real-time semantic segmentation.一种用于实时语义分割的轻量级多维动态卷积网络。
Front Neurorobot. 2022 Dec 15;16:1075520. doi: 10.3389/fnbot.2022.1075520. eCollection 2022.
6
Implementation of a Lightweight Semantic Segmentation Algorithm in Road Obstacle Detection.在道路障碍物检测中实现轻量级语义分割算法。
Sensors (Basel). 2020 Dec 10;20(24):7089. doi: 10.3390/s20247089.
7
Multi-Level and Multi-Scale Feature Aggregation Network for Semantic Segmentation in Vehicle-Mounted Scenes.车载场景语义分割的多层次多尺度特征聚合网络。
Sensors (Basel). 2021 May 9;21(9):3270. doi: 10.3390/s21093270.
8
Contour-aware semantic segmentation network with spatial attention mechanism for medical image.用于医学图像的具有空间注意力机制的轮廓感知语义分割网络。
Vis Comput. 2022;38(3):749-762. doi: 10.1007/s00371-021-02075-9. Epub 2021 Feb 22.
9
UAV Imagery Real-Time Semantic Segmentation with Global-Local Information Attention.具有全局-局部信息注意力的无人机图像实时语义分割
Sensors (Basel). 2025 Mar 13;25(6):1786. doi: 10.3390/s25061786.
10
Based on cross-scale fusion attention mechanism network for semantic segmentation for street scenes.基于跨尺度融合注意力机制网络的街景语义分割
Front Neurorobot. 2023 Aug 31;17:1204418. doi: 10.3389/fnbot.2023.1204418. eCollection 2023.

本文引用的文献

1
CGNet: A Light-Weight Context Guided Network for Semantic Segmentation.CGNet:用于语义分割的轻量级上下文引导网络
IEEE Trans Image Process. 2021;30:1169-1179. doi: 10.1109/TIP.2020.3042065. Epub 2020 Dec 17.
2
CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation.CA-Net:用于可解释医学图像分割的综合注意力卷积神经网络。
IEEE Trans Med Imaging. 2021 Feb;40(2):699-711. doi: 10.1109/TMI.2020.3035253. Epub 2021 Feb 2.
3
Deep High-Resolution Representation Learning for Visual Recognition.
用于视觉识别的深度高分辨率表征学习
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3349-3364. doi: 10.1109/TPAMI.2020.2983686. Epub 2021 Sep 2.