• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 3D-密集注意 GAN 网络的虚拟视图生成。

Virtual View Generation Based on 3D-Dense-Attentive GAN Networks.

机构信息

State Key Lab of Industrial Control Technology, Zhejiang University, Hangzhou 310027, China.

出版信息

Sensors (Basel). 2019 Jan 16;19(2):344. doi: 10.3390/s19020344.

DOI:10.3390/s19020344
PMID:30654544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6358985/
Abstract

A binocular vision system is a common perception component of an intelligent vehicle. Benefiting from the biomimetic structure, the system is simple and effective. Which are extremely snesitive on external factors, especially missing vision signals. In this paper, a virtual view-generation algorithm based on generative adversarial networks (GAN) is proposed to enhance the robustness of binocular vision systems. The proposed model consists of two parts: generative network and discriminator network. To improve the quality of a virtual view, a generative network structure based on 3D convolutional neural networks (3D-CNN) and attentive mechanisms is introduced to extract the time-series features from image sequences. To avoid gradient vanish during training, the dense block structure is utilized to improve the discriminator network. Meanwhile, three kinds of image features, including image edge, depth map and optical flow are extracted to constrain the supervised training of model. The final results on KITTI and Cityscapes datasets demonstrate that our algorithm outperforms conventional methods, and the missing vision signal can be replaced by a generated virtual view.

摘要

双目视觉系统是智能车辆常见的感知组件。得益于仿生结构,该系统简单有效,但对外界因素极其敏感,特别是缺失视觉信号。本文提出了一种基于生成对抗网络(GAN)的虚拟视图生成算法,以增强双目视觉系统的鲁棒性。所提出的模型由生成网络和判别网络两部分组成。为了提高虚拟视图的质量,引入了一种基于三维卷积神经网络(3D-CNN)和注意机制的生成网络结构,从图像序列中提取时间序列特征。为了避免训练过程中的梯度消失,利用密集块结构来改进判别网络。同时,提取三种图像特征,包括图像边缘、深度图和光流,以约束模型的监督训练。在 KITTI 和 Cityscapes 数据集上的最终结果表明,我们的算法优于传统方法,可以用生成的虚拟视图来替代缺失的视觉信号。

相似文献

1
Virtual View Generation Based on 3D-Dense-Attentive GAN Networks.基于 3D-密集注意 GAN 网络的虚拟视图生成。
Sensors (Basel). 2019 Jan 16;19(2):344. doi: 10.3390/s19020344.
2
Parallel Connected Generative Adversarial Network with Quadratic Operation for SAR Image Generation and Application for Classification.并行连接生成对抗网络与二次运算在 SAR 图像生成中的应用及分类。
Sensors (Basel). 2019 Feb 19;19(4):871. doi: 10.3390/s19040871.
3
A Hyperspectral Image Classification Method Based on Multi-Discriminator Generative Adversarial Networks.一种基于多判别器生成对抗网络的高光谱图像分类方法。
Sensors (Basel). 2019 Jul 25;19(15):3269. doi: 10.3390/s19153269.
4
Ea-GANs: Edge-Aware Generative Adversarial Networks for Cross-Modality MR Image Synthesis.Ea-GANs:用于跨模态磁共振图像合成的边缘感知生成对抗网络。
IEEE Trans Med Imaging. 2019 Jul;38(7):1750-1762. doi: 10.1109/TMI.2019.2895894. Epub 2019 Jan 29.
5
SDOF-GAN: Symmetric Dense Optical Flow Estimation With Generative Adversarial Networks.SDOF-GAN:基于生成对抗网络的对称密集光流估计
IEEE Trans Image Process. 2021;30:6036-6049. doi: 10.1109/TIP.2021.3084073. Epub 2021 Jul 5.
6
Structurally constrained and pathology-aware convolutional transformer generative adversarial network for virtual histology staining of human coronary optical coherence tomography images.结构约束和病理学感知卷积变压器生成对抗网络,用于人类冠状动脉光学相干断层扫描图像的虚拟组织染色。
J Biomed Opt. 2024 Mar;29(3):036004. doi: 10.1117/1.JBO.29.3.036004. Epub 2024 Mar 25.
7
An end-to-end stereo matching algorithm based on improved convolutional neural network.基于改进卷积神经网络的端到端立体匹配算法。
Math Biosci Eng. 2020 Nov 6;17(6):7787-7803. doi: 10.3934/mbe.2020396.
8
AMD-GAN: Attention encoder and multi-branch structure based generative adversarial networks for fundus disease detection from scanning laser ophthalmoscopy images.AMD-GAN:基于注意力编码器和多分支结构的生成式对抗网络,用于从扫描激光眼底图像中检测眼底疾病。
Neural Netw. 2020 Dec;132:477-490. doi: 10.1016/j.neunet.2020.09.005. Epub 2020 Sep 17.
9
High quality and fast compressed sensing MRI reconstruction via edge-enhanced dual discriminator generative adversarial network.基于边缘增强双鉴别器生成对抗网络的高质量快速压缩感知 MRI 重建。
Magn Reson Imaging. 2021 Apr;77:124-136. doi: 10.1016/j.mri.2020.12.011. Epub 2021 Jan 7.
10
Limited-view CT Reconstruction Based on Autoencoder-like Generative Adversarial Networks with Joint Loss.基于具有联合损失的类自动编码器生成对抗网络的有限视角CT重建
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:5570-5574. doi: 10.1109/EMBC.2018.8513659.

本文引用的文献

1
Deep Learning for Computer Vision: A Brief Review.深度学习在计算机视觉中的应用综述
Comput Intell Neurosci. 2018 Feb 1;2018:7068349. doi: 10.1155/2018/7068349. eCollection 2018.
2
Deep learning in neural networks: an overview.神经网络中的深度学习:综述。
Neural Netw. 2015 Jan;61:85-117. doi: 10.1016/j.neunet.2014.09.003. Epub 2014 Oct 13.
3
Stereo camera based virtual cane system with identifiable distance tactile feedback for the blind.基于立体摄像头的盲人虚拟手杖系统,具备可识别距离的触觉反馈。
Sensors (Basel). 2014 Jun 13;14(6):10412-31. doi: 10.3390/s140610412.
4
3D convolutional neural networks for human action recognition.三维卷积神经网络的人体动作识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.
5
FSIM: a feature similarity index for image quality assessment.FSIM:一种用于图像质量评估的特征相似性指数。
IEEE Trans Image Process. 2011 Aug;20(8):2378-86. doi: 10.1109/TIP.2011.2109730. Epub 2011 Jan 31.
6
Optical symbolic substitution: edge detection using Prewitt, Sobel, and Roberts operators.光学符号替换:使用普雷维特、索贝尔和罗伯茨算子进行边缘检测。
Appl Opt. 1989 Nov 1;28(21):4644-8. doi: 10.1364/AO.28.004644.
7
Framewise phoneme classification with bidirectional LSTM and other neural network architectures.使用双向长短期记忆网络和其他神经网络架构进行逐帧音素分类。
Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042.
8
Image quality assessment: from error visibility to structural similarity.图像质量评估:从误差可见性到结构相似性。
IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.