• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AESeg:使用特征类映射知识蒸馏的亲和力增强分割器,用于室内场景的高效RGB-D语义分割。

AESeg: Affinity-enhanced segmenter using feature class mapping knowledge distillation for efficient RGB-D semantic segmentation of indoor scenes.

作者信息

Zhou Wujie, Xiao Yuxiang, Qiang Fangfang, Dong Xiena, Xu Caie, Yu Lu

机构信息

School of Information & Electronic Engineering, Zhejiang University of Science & Technology, Hangzhou 310023, China; School of Computer Science and Engineering, Nanyang Technological University, Singapore 308232, Singapore.

School of Information & Electronic Engineering, Zhejiang University of Science & Technology, Hangzhou 310023, China.

出版信息

Neural Netw. 2025 Aug;188:107438. doi: 10.1016/j.neunet.2025.107438. Epub 2025 Mar 25.

DOI:10.1016/j.neunet.2025.107438
PMID:40184869
Abstract

Recent advances in deep learning for semantic segmentation models have introduced dynamic segmentation methods as opposed to static segmentation methods represented by full convolutional networks. Dynamic prediction methods replace static classifiers with learnable class embeddings to achieve global semantic awareness. Although dynamic methods excel in accuracy, the learning and inference of class embeddings is usually accompanied by a tedious computational burden. To address this challenge, we propose an affinity-enhanced semantic segmentation framework that synergistically combines the strengths of static and dynamic methodologies. Specifically, our approach leverages semantic features to obtain preliminary static segmentation results and constructs a binary affinity matrix that explicitly encodes pixel-wise category relationships. This affinity matrix serves as a dynamic classification kernel, effectively integrating global context awareness with static features, achieving comparable performance to purely dynamic approaches but with a substantially reduced computational overhead. Furthermore, we introduce a novel feature-to-category mapping refinement technique. This technique performs feature knowledge migration by learning a linear transformation between the semantic feature space and the segmentation probability space, resulting in improved accuracy without increasing model complexity. Numerous experiments demonstrated that the proposed method achieves the best performance on the widely used NYUv2 and SUN-RGBD datasets. And the effectiveness of our method in different scenes is verified on the outdoor scene dataset CamVid.

摘要

深度学习语义分割模型的最新进展引入了动态分割方法,以区别于全卷积网络所代表的静态分割方法。动态预测方法用可学习的类别嵌入取代静态分类器,以实现全局语义感知。尽管动态方法在准确性方面表现出色,但类别嵌入的学习和推理通常伴随着繁重的计算负担。为应对这一挑战,我们提出了一种亲和力增强的语义分割框架,该框架协同结合了静态和动态方法的优势。具体而言,我们的方法利用语义特征来获得初步的静态分割结果,并构建一个二元亲和力矩阵,该矩阵明确编码逐像素的类别关系。这个亲和力矩阵充当动态分类内核,有效地将全局上下文感知与静态特征整合在一起,在性能上与纯动态方法相当,但计算开销大幅降低。此外,我们引入了一种新颖的特征到类别映射细化技术。该技术通过学习语义特征空间和分割概率空间之间的线性变换来执行特征知识迁移,在不增加模型复杂度的情况下提高了准确性。大量实验表明,所提出的方法在广泛使用的NYUv2和SUN-RGBD数据集上取得了最佳性能。并且我们的方法在室外场景数据集CamVid上验证了其在不同场景中的有效性。

相似文献

1
AESeg: Affinity-enhanced segmenter using feature class mapping knowledge distillation for efficient RGB-D semantic segmentation of indoor scenes.AESeg:使用特征类映射知识蒸馏的亲和力增强分割器,用于室内场景的高效RGB-D语义分割。
Neural Netw. 2025 Aug;188:107438. doi: 10.1016/j.neunet.2025.107438. Epub 2025 Mar 25.
2
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
3
Semantic Segmentation Using Pixel-Wise Adaptive Label Smoothing via Self-Knowledge Distillation for Limited Labeling Data.基于自知识蒸馏的像素级自适应标签平滑的有限标注数据语义分割。
Sensors (Basel). 2022 Mar 29;22(7):2623. doi: 10.3390/s22072623.
4
Multi-scale full spike pattern for semantic segmentation.多尺度全尖峰模式的语义分割。
Neural Netw. 2024 Aug;176:106330. doi: 10.1016/j.neunet.2024.106330. Epub 2024 Apr 20.
5
MSKD: Structured knowledge distillation for efficient medical image segmentation.MSKD:用于高效医学图像分割的结构化知识蒸馏。
Comput Biol Med. 2023 Sep;164:107284. doi: 10.1016/j.compbiomed.2023.107284. Epub 2023 Aug 2.
6
Multibranch semantic image segmentation model based on edge optimization and category perception.基于边缘优化和类别感知的多分支语义图像分割模型
PLoS One. 2024 Dec 19;19(12):e0315621. doi: 10.1371/journal.pone.0315621. eCollection 2024.
7
A Lightweight Semantic Segmentation Algorithm Based on Deep Convolutional Neural Networks.基于深度卷积神经网络的轻量级语义分割算法。
Comput Intell Neurosci. 2022 Sep 6;2022:5339664. doi: 10.1155/2022/5339664. eCollection 2022.
8
Euclidean-Distance-Preserved Feature Reduction for efficient person re-identification.基于欧几里得距离保特征降维的高效行人再识别
Neural Netw. 2024 Dec;180:106572. doi: 10.1016/j.neunet.2024.106572. Epub 2024 Aug 8.
9
Adversarial class-wise self-knowledge distillation for medical image segmentation.用于医学图像分割的对抗性类别自知识蒸馏
Sci Rep. 2025 Apr 17;15(1):13231. doi: 10.1038/s41598-025-98116-7.
10
Bagging Improves the Performance of Deep Learning-Based Semantic Segmentation with Limited Labeled Images: A Case Study of Crop Segmentation for High-Throughput Plant Phenotyping.基于有限标注图像的深度学习语义分割的 Bagging 改进:以高通量植物表型作物分割为例。
Sensors (Basel). 2024 May 26;24(11):3420. doi: 10.3390/s24113420.

引用本文的文献

1
CFANet: The Cross-Modal Fusion Attention Network for Indoor RGB-D Semantic Segmentation.CFANet:用于室内RGB-D语义分割的跨模态融合注意力网络
J Imaging. 2025 May 27;11(6):177. doi: 10.3390/jimaging11060177.