• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习动态纹理,用于人类演员的神经渲染。

Learning Dynamic Textures for Neural Rendering of Human Actors.

出版信息

IEEE Trans Vis Comput Graph. 2021 Oct;27(10):4009-4022. doi: 10.1109/TVCG.2020.2996594. Epub 2021 Sep 1.

DOI:10.1109/TVCG.2020.2996594
PMID:32746256
Abstract

Synthesizing realistic videos of humans using neural networks has been a popular alternative to the conventional graphics-based rendering pipeline due to its high efficiency. Existing works typically formulate this as an image-to-image translation problem in 2D screen space, which leads to artifacts such as over-smoothing, missing body parts, and temporal instability of fine-scale detail, such as pose-dependent wrinkles in the clothing. In this article, we propose a novel human video synthesis method that approaches these limiting factors by explicitly disentangling the learning of time-coherent fine-scale details from the embedding of the human in 2D screen space. More specifically, our method relies on the combination of two convolutional neural networks (CNNs). Given the pose information, the first CNN predicts a dynamic texture map that contains time-coherent high-frequency details, and the second CNN conditions the generation of the final video on the temporally coherent output of the first CNN. We demonstrate several applications of our approach, such as human reenactment and novel view synthesis from monocular video, where we show significant improvement over the state of the art both qualitatively and quantitatively.

摘要

使用神经网络合成逼真的人类视频,由于其高效性,已成为传统基于图形的渲染管道的热门替代方案。现有作品通常将其在 2D 屏幕空间中表述为图像到图像的转换问题,这会导致过度平滑、缺少身体部位以及精细细节的时间不稳定等伪影,例如服装上依赖姿势的皱纹。在本文中,我们提出了一种新颖的人类视频合成方法,通过明确将时间一致的精细细节的学习与人体在 2D 屏幕空间中的嵌入分开,来解决这些限制因素。具体来说,我们的方法依赖于两个卷积神经网络 (CNN) 的组合。给定姿势信息,第一个 CNN 预测包含时间一致的高频细节的动态纹理图,第二个 CNN 根据第一个 CNN 的时间一致输出条件生成最终视频。我们展示了我们的方法的几个应用,例如人类重现和从单目视频进行新视图合成,在定性和定量方面都明显优于现有技术。

相似文献

1
Learning Dynamic Textures for Neural Rendering of Human Actors.学习动态纹理,用于人类演员的神经渲染。
IEEE Trans Vis Comput Graph. 2021 Oct;27(10):4009-4022. doi: 10.1109/TVCG.2020.2996594. Epub 2021 Sep 1.
2
Reconstruction of Compressed-sensing MR Imaging Using Deep Residual Learning in the Image Domain.基于图像域的深度残差学习的压缩感知磁共振成像重建。
Magn Reson Med Sci. 2021 Jun 1;20(2):190-203. doi: 10.2463/mrms.mp.2019-0139. Epub 2020 Jul 2.
3
Video-based crowd synthesis.基于视频的人群合成。
IEEE Trans Vis Comput Graph. 2013 Nov;19(11):1935-47. doi: 10.1109/TVCG.2012.317.
4
Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.使用机器学习和深度学习技术评估白内障手术视频中的相位自动识别。
JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.
5
Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling.通过长时建模实现人体运动的时空流形学习
IEEE Trans Vis Comput Graph. 2021 Jan;27(1):216-227. doi: 10.1109/TVCG.2019.2936810. Epub 2020 Nov 24.
6
Disentangled Human Body Embedding Based on Deep Hierarchical Neural Network.基于深度层次神经网络的解缠人体嵌入。
IEEE Trans Vis Comput Graph. 2020 Aug;26(8):2560-2575. doi: 10.1109/TVCG.2020.2988476. Epub 2020 Apr 20.
7
Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.使用增强卷积和递归神经网络监测手术视频中的工具使用情况。
Med Image Anal. 2018 Jul;47:203-218. doi: 10.1016/j.media.2018.05.001. Epub 2018 May 9.
8
Single patient convolutional neural networks for real-time MR reconstruction: coherent low-resolution versus incoherent undersampling.单病例卷积神经网络在实时磁共振重建中的应用:相干低分辨率与非相干欠采样。
Phys Med Biol. 2020 Apr 23;65(8):08NT03. doi: 10.1088/1361-6560/ab7d13.
9
Deep Convolutional Neural Network for Ulcer Recognition in Wireless Capsule Endoscopy: Experimental Feasibility and Optimization.无线胶囊内窥镜中溃疡识别的深度卷积神经网络:实验可行性与优化。
Comput Math Methods Med. 2019 Sep 18;2019:7546215. doi: 10.1155/2019/7546215. eCollection 2019.
10
Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。
Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.