• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LiteGaze:用于高效注视估计的神经结构搜索。

LiteGaze: Neural architecture search for efficient gaze estimation.

机构信息

School of Mechanical Engineering, University of Science and Technology Beijing, Beijing, China.

College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China.

出版信息

PLoS One. 2023 May 1;18(5):e0284814. doi: 10.1371/journal.pone.0284814. eCollection 2023.

DOI:10.1371/journal.pone.0284814
PMID:37126491
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10150965/
Abstract

Gaze estimation plays a critical role in human-centered vision applications such as human-computer interaction and virtual reality. Although significant progress has been made in automatic gaze estimation by deep convolutional neural networks, it is still difficult to directly deploy deep learning based gaze estimation models across different edge devices, due to the high computational cost and various resource constraints. This work proposes LiteGaze, a deep learning framework to learn architectures for efficient gaze estimation via neural architecture search (NAS). Inspired by the once-for-all model (Cai et al., 2020), this work decouples the model training and architecture search into two different stages. In particular, a supernet is trained to support diverse architectural settings. Then specialized sub-networks are selected from the obtained supernet, given different efficiency constraints. Extensive experiments are performed on two gaze estimation datasets and demonstrate the superiority of the proposed method over previous works, advancing the real-time gaze estimation on edge devices.

摘要

注视估计在以人为中心的视觉应用中起着至关重要的作用,例如人机交互和虚拟现实。尽管通过深度卷积神经网络在自动注视估计方面取得了重大进展,但由于计算成本高和各种资源限制,仍然难以直接在不同的边缘设备上部署基于深度学习的注视估计模型。这项工作提出了 LiteGaze,这是一个通过神经架构搜索(NAS)学习高效注视估计架构的深度学习框架。受一次完成模型(Cai 等人,2020)的启发,这项工作将模型训练和架构搜索解耦到两个不同的阶段。具体来说,训练一个超网以支持各种架构设置。然后,根据不同的效率约束,从获得的超网中选择专门的子网。在两个注视估计数据集上进行了广泛的实验,证明了所提出方法优于以前的工作,推进了边缘设备上的实时注视估计。

相似文献

1
LiteGaze: Neural architecture search for efficient gaze estimation.LiteGaze:用于高效注视估计的神经结构搜索。
PLoS One. 2023 May 1;18(5):e0284814. doi: 10.1371/journal.pone.0284814. eCollection 2023.
2
Person-Specific Gaze Estimation from Low-Quality Webcam Images.基于低质量网络摄像头图像的特定人注视估计。
Sensors (Basel). 2023 Apr 20;23(8):4138. doi: 10.3390/s23084138.
3
One-Shot Neural Architecture Search by Dynamically Pruning Supernet in Hierarchical Order.分层动态剪枝超网的单步神经架构搜索。
Int J Neural Syst. 2021 Jul;31(7):2150029. doi: 10.1142/S0129065721500295. Epub 2021 Jun 14.
4
Multiview Multitask Gaze Estimation With Deep Convolutional Neural Networks.基于深度卷积神经网络的多视图多任务注视估计。
IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):3010-3023. doi: 10.1109/TNNLS.2018.2865525. Epub 2018 Sep 3.
5
PSE-Net: Channel pruning for Convolutional Neural Networks with parallel-subnets estimator.PSE-Net:基于并行子网估计器的卷积神经网络通道剪枝
Neural Netw. 2024 Jun;174:106263. doi: 10.1016/j.neunet.2024.106263. Epub 2024 Mar 20.
6
Automatic Gaze Analysis: A Survey of Deep Learning Based Approaches.自动眼动分析:基于深度学习方法的调查。
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):61-84. doi: 10.1109/TPAMI.2023.3321337. Epub 2023 Dec 5.
7
Disturbance-immune weight sharing for neural architecture search.抗干扰权重共享的神经架构搜索。
Neural Netw. 2021 Dec;144:553-564. doi: 10.1016/j.neunet.2021.09.002. Epub 2021 Sep 23.
8
One-Shot Neural Architecture Search: Maximising Diversity to Overcome Catastrophic Forgetting.单次神经架构搜索:通过最大化多样性克服灾难性遗忘。
IEEE Trans Pattern Anal Mach Intell. 2021 Sep;43(9):2921-2935. doi: 10.1109/TPAMI.2020.3035351. Epub 2021 Aug 4.
9
ATNAS: Automatic Termination for Neural Architecture Search.自动终止的神经架构搜索(ATNAS)
Neural Netw. 2023 Sep;166:446-458. doi: 10.1016/j.neunet.2023.07.011. Epub 2023 Jul 26.
10
EMONAS-Net: Efficient multiobjective neural architecture search using surrogate-assisted evolutionary algorithm for 3D medical image segmentation.EMONAS-Net:基于代理辅助进化算法的高效多目标神经架构搜索在 3D 医学图像分割中的应用。
Artif Intell Med. 2021 Sep;119:102154. doi: 10.1016/j.artmed.2021.102154. Epub 2021 Aug 24.

本文引用的文献

1
Improved Dwarf Mongoose Optimization for Constrained Engineering Design Problems.用于约束工程设计问题的改进型矮猫鼬优化算法
J Bionic Eng. 2023;20(3):1263-1295. doi: 10.1007/s42235-022-00316-8. Epub 2022 Dec 13.
2
Advanced dwarf mongoose optimization for solving CEC 2011 and CEC 2017 benchmark problems.高级矮狐优化算法求解 CEC 2011 和 CEC 2017 基准问题。
PLoS One. 2022 Nov 2;17(11):e0275346. doi: 10.1371/journal.pone.0275346. eCollection 2022.
3
Multiclass feature selection with metaheuristic optimization algorithms: a review.
基于元启发式优化算法的多类特征选择:综述
Neural Comput Appl. 2022;34(22):19751-19790. doi: 10.1007/s00521-022-07705-4. Epub 2022 Aug 30.
4
MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation.马克斯·普朗克智能系统研究所注视数据集:真实世界数据集与基于深度外观的注视估计
IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):162-175. doi: 10.1109/TPAMI.2017.2778103. Epub 2017 Nov 28.
5
Gaze-based assistive technology in daily activities in children with severe physical impairments-An intervention study.重度身体障碍儿童日常活动中基于注视的辅助技术——一项干预研究。
Dev Neurorehabil. 2017 Apr;20(3):129-141. doi: 10.3109/17518423.2015.1132281. Epub 2016 Mar 1.
6
Combining head pose and eye location information for gaze estimation.结合头部姿势和眼睛位置信息进行注视估计。
IEEE Trans Image Process. 2012 Feb;21(2):802-15. doi: 10.1109/TIP.2011.2162740. Epub 2011 Jul 22.