即时现实：用于 3D 虚拟现实流的注视相关感知优化。

Instant Reality: Gaze-Contingent Perceptual Optimization for 3D Virtual Reality Streaming.

出版信息

IEEE Trans Vis Comput Graph. 2022 May;28(5):2157-2167. doi: 10.1109/TVCG.2022.3150522. Epub 2022 Apr 8.

DOI:10.1109/TVCG.2022.3150522

Abstract

Media streaming, with an edge-cloud setting, has been adopted for a variety of applications such as entertainment, visualization, and design. Unlike video/audio streaming where the content is usually consumed passively, virtual reality applications require 3D assets stored on the edge to facilitate frequent edge-side interactions such as object manipulation and viewpoint movement. Compared to audio and video streaming, 3D asset streaming often requires larger data sizes and yet lower latency to ensure sufficient rendering quality, resolution, and latency for perceptual comfort. Thus, streaming 3D assets faces remarkably additional than streaming audios/videos, and existing solutions often suffer from long loading time or limited quality. To address this challenge, we propose a perceptually-optimized progressive 3D streaming method for spatial quality and temporal consistency in immersive interactions. On the cloud-side, our main idea is to estimate perceptual importance in 2D image space based on user gaze behaviors, including where they are looking and how their eyes move. The estimated importance is then mapped to 3D object space for scheduling the streaming priorities for edge-side rendering. Since this computational pipeline could be heavy, we also develop a simple neural network to accelerate the cloud-side scheduling process. We evaluate our method via subjective studies and objective analysis under varying network conditions (from 3G to 5G) and edge devices (HMD and traditional displays), and demonstrate better visual quality and temporal consistency than alternative solutions.

摘要

边缘云环境下的媒体流传输已广泛应用于娱乐、可视化和设计等多种领域。与通常被动消费内容的视频/音频流不同，虚拟现实应用需要将存储在边缘上的 3D 资产用于频繁的边缘交互，如对象操作和视点移动。与音频和视频流相比，3D 资产流通常需要更大的数据量，但延迟要更低，以确保足够的渲染质量、分辨率和感知舒适度的延迟。因此，3D 资产流比音频/视频流面临更大的挑战，而现有的解决方案通常存在加载时间长或质量有限的问题。为了解决这个挑战，我们提出了一种基于感知的优化渐进式 3D 流传输方法，用于沉浸式交互中的空间质量和时间一致性。在云侧，我们的主要思想是根据用户的注视行为，包括他们正在看哪里以及眼睛如何移动，在 2D 图像空间中估计感知重要性。然后，将估计的重要性映射到 3D 对象空间，以安排边缘渲染的流优先级。由于这个计算过程可能很繁重，我们还开发了一个简单的神经网络来加速云侧的调度过程。我们通过主观研究和客观分析，在不同的网络条件（从 3G 到 5G）和边缘设备（头戴式显示器和传统显示器）下评估了我们的方法，并展示了比其他解决方案更好的视觉质量和时间一致性。

相似文献

Instant Reality: Gaze-Contingent Perceptual Optimization for 3D Virtual Reality Streaming.即时现实：用于 3D 虚拟现实流的注视相关感知优化。

IEEE Trans Vis Comput Graph. 2022 May;28(5):2157-2167. doi: 10.1109/TVCG.2022.3150522. Epub 2022 Apr 8.

FoV-NeRF: Foveated Neural Radiance Fields for Virtual Reality.FoV-NeRF：用于虚拟现实的注视点神经辐射场。

IEEE Trans Vis Comput Graph. 2022 Nov;28(11):3854-3864. doi: 10.1109/TVCG.2022.3203102. Epub 2022 Oct 21.

Virtual Reality Telepresence: 360-Degree Video Streaming with Edge-Compute Assisted Static Foveated Compression.虚拟现实远程呈现：基于边缘计算辅助静态中心凹压缩的360度视频流

IEEE Trans Vis Comput Graph. 2023 Nov;29(11):4525-4534. doi: 10.1109/TVCG.2023.3320255. Epub 2023 Nov 2.

LiveObj: Object Semantics-based Viewport Prediction for Live Mobile Virtual Reality Streaming.基于对象语义的移动实时虚拟现实直播视口预测

IEEE Trans Vis Comput Graph. 2021 May;27(5):2736-2745. doi: 10.1109/TVCG.2021.3067686. Epub 2021 Apr 15.

Live Semantic 3D Perception for Immersive Augmented Reality.沉浸式增强现实的实时语义三维感知。

IEEE Trans Vis Comput Graph. 2020 May;26(5):2012-2022. doi: 10.1109/TVCG.2020.2973477. Epub 2020 Feb 13.

Leveraging Human Visual Perception for an Optimized Virtual Reality Experience.利用人类视觉感知优化虚拟现实体验。

IEEE Comput Graph Appl. 2021 Nov-Dec;41(6):164-170. doi: 10.1109/MCG.2021.3113392.

Learning Dynamic Textures for Neural Rendering of Human Actors.学习动态纹理，用于人类演员的神经渲染。

IEEE Trans Vis Comput Graph. 2021 Oct;27(10):4009-4022. doi: 10.1109/TVCG.2020.2996594. Epub 2021 Sep 1.

SGaze: A Data-Driven Eye-Head Coordination Model for Realtime Gaze Prediction.SGaze：用于实时眼-头协调预测的基于数据的眼-头协调模型。

IEEE Trans Vis Comput Graph. 2019 May;25(5):2002-2010. doi: 10.1109/TVCG.2019.2899187. Epub 2019 Feb 18.

Construction and 3D Simulation of Virtual Animation Instant Network Communication System Based on Convolution Neural Networks.基于卷积神经网络的虚拟动画即时网络通信系统的构建与三维仿真。

Comput Intell Neurosci. 2021 Aug 28;2021:7277733. doi: 10.1155/2021/7277733. eCollection 2021.

Viewport-Adaptive Scalable Multi-User Virtual Reality Mobile-Edge Streaming.视口自适应可扩展多用户虚拟现实移动边缘流

IEEE Trans Image Process. 2020 May 5. doi: 10.1109/TIP.2020.2986547.

引用本文的文献

Spatiotemporal image quality of virtual reality head mounted displays.虚拟现实头戴式显示器的时空图像质量。

Sci Rep. 2022 Nov 24;12(1):20235. doi: 10.1038/s41598-022-24345-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

即时现实：用于 3D 虚拟现实流的注视相关感知优化。

Instant Reality: Gaze-Contingent Perceptual Optimization for 3D Virtual Reality Streaming.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献