• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向高效 U-Nets:一种耦合和量化方法。

Towards Efficient U-Nets: A Coupled and Quantized Approach.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2038-2050. doi: 10.1109/TPAMI.2019.2907634. Epub 2019 Mar 26.

DOI:10.1109/TPAMI.2019.2907634
PMID:30932829
Abstract

In this paper, we propose to couple stacked U-Nets for efficient visual landmark localization. The key idea is to globally reuse features of the same semantic meanings across the stacked U-Nets. The feature reuse makes each U-Net light-weighted. Specially, we propose an order- K coupling design to trim off long-distance shortcuts, together with an iterative refinement and memory sharing mechanism. To further improve the efficiency, we quantize the parameters, intermediate features, and gradients of the coupled U-Nets to low bit-width numbers. We validate our approach in two tasks: human pose estimation and facial landmark localization. The results show that our approach achieves state-of-the-art localization accuracy but using  ∼ 70% fewer parameters,  ∼ 30% less inference time,  ∼ 98% less model size, and saving  ∼ 75% training memory compared with benchmark localizers.

摘要

在本文中,我们提出了一种堆叠 U-Net 来进行高效的视觉地标定位。其关键思想是在堆叠的 U-Net 之间全局重复使用相同语义的特征。这种特征复用使每个 U-Net 轻量化。具体来说,我们提出了一种 K 阶耦合设计来修剪掉长距离的捷径,同时结合了迭代细化和内存共享机制。为了进一步提高效率,我们对耦合 U-Net 的参数、中间特征和梯度进行量化,以达到低比特宽度的数字。我们在两个任务中验证了我们的方法:人体姿态估计和面部地标定位。结果表明,与基准定位器相比,我们的方法在实现最先进的定位精度的同时,使用的参数减少了约 70%,推理时间减少了约 30%,模型大小减少了约 98%,训练内存节省了约 75%。

相似文献

1
Towards Efficient U-Nets: A Coupled and Quantized Approach.迈向高效 U-Nets:一种耦合和量化方法。
IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2038-2050. doi: 10.1109/TPAMI.2019.2907634. Epub 2019 Mar 26.
2
Training high-performance and large-scale deep neural networks with full 8-bit integers.用全 8 位整数训练高性能和大规模深度神经网络。
Neural Netw. 2020 May;125:70-82. doi: 10.1016/j.neunet.2019.12.027. Epub 2020 Jan 15.
3
EAC-Net: Deep Nets with Enhancing and Cropping for Facial Action Unit Detection.EAC-Net:用于面部动作单元检测的增强和裁剪的深度网络。
IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2583-2596. doi: 10.1109/TPAMI.2018.2791608. Epub 2018 Jan 10.
4
Lazily Aggregated Quantized Gradient Innovation for Communication-Efficient Federated Learning.用于通信高效联邦学习的懒惰聚合量化梯度创新
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2031-2044. doi: 10.1109/TPAMI.2020.3033286. Epub 2022 Mar 4.
5
An investigation of the effect of fat suppression and dimensionality on the accuracy of breast MRI segmentation using U-nets.利用 U-Nets 研究脂肪抑制和维度对乳腺 MRI 分割准确性的影响。
Med Phys. 2019 Mar;46(3):1230-1244. doi: 10.1002/mp.13375. Epub 2019 Feb 4.
6
Visual Semantic Landmark-Based Robust Mapping and Localization for Autonomous Indoor Parking.基于视觉语义地标物的自主室内停车稳健映射与定位。
Sensors (Basel). 2019 Jan 4;19(1):161. doi: 10.3390/s19010161.
7
PicassoNet: Searching Adaptive Architecture for Efficient Facial Landmark Localization.毕加索网络:为高效面部地标定位搜索自适应架构
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10516-10527. doi: 10.1109/TNNLS.2022.3167743. Epub 2023 Nov 30.
8
Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization.用于地标定位的结构感知全卷积网络的对抗学习
IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1654-1669. doi: 10.1109/TPAMI.2019.2901875. Epub 2019 Feb 26.
9
Pose and Expression Independent Facial Landmark Localization Using Dense-SURF and the Hausdorff Distance.基于密集 SURF 和 Hausdorff 距离的姿态和表情无关的人脸地标定位
IEEE Trans Pattern Anal Mach Intell. 2013 Mar;35(3):624-38. doi: 10.1109/TPAMI.2012.87. Epub 2012 Apr 10.
10
CGNet: A Light-Weight Context Guided Network for Semantic Segmentation.CGNet:用于语义分割的轻量级上下文引导网络
IEEE Trans Image Process. 2021;30:1169-1179. doi: 10.1109/TIP.2020.3042065. Epub 2020 Dec 17.

引用本文的文献

1
SinusC-Net for automatic classification of surgical plans for maxillary sinus augmentation using a 3D distance-guided network.基于 3D 距离引导网络的鼻窦 C-Net 用于上颌窦提升术手术方案的自动分类。
Sci Rep. 2023 Jul 19;13(1):11653. doi: 10.1038/s41598-023-38273-9.
2
A Novel Approach for Efficient Solar Panel Fault Classification Using Coupled UDenseNet.基于耦合 UDenseNet 的高效太阳能电池板故障分类新方法
Sensors (Basel). 2023 May 19;23(10):4918. doi: 10.3390/s23104918.
3
GradFreeBits: Gradient-Free Bit Allocation for Mixed-Precision Neural Networks.
无梯度比特分配在混合精度神经网络中的应用。
Sensors (Basel). 2022 Dec 13;22(24):9772. doi: 10.3390/s22249772.
4
Deep learning with noise-to-noise training for denoising in SPECT myocardial perfusion imaging.基于噪声对噪声训练的深度学习用于 SPECT 心肌灌注成像去噪。
Med Phys. 2021 Jan;48(1):156-168. doi: 10.1002/mp.14577. Epub 2020 Nov 23.