Suppr超能文献

利用深度引导神经辐射场和改进的深度补全增强视图合成

Enhancing View Synthesis with Depth-Guided Neural Radiance Fields and Improved Depth Completion.

作者信息

Wang Bojun, Zhang Danhong, Su Yixin, Zhang Huajun

机构信息

School of Automation, Wuhan University of Technology, Wuhan 430070, China.

出版信息

Sensors (Basel). 2024 Mar 16;24(6):1919. doi: 10.3390/s24061919.

Abstract

Neural radiance fields (NeRFs) leverage a neural representation to encode scenes, obtaining photorealistic rendering of novel views. However, NeRF has notable limitations. A significant drawback is that it does not capture surface geometry and only renders the object surface colors. Furthermore, the training of NeRF is exceedingly time-consuming. We propose Depth-NeRF as a solution to these issues. Specifically, our approach employs a fast depth completion algorithm to denoise and complete the depth maps generated by RGB-D cameras. These improved depth maps guide the sampling points of NeRF to be distributed closer to the scene's surface, benefiting from dense depth information. Furthermore, we have optimized the network structure of NeRF and integrated depth information to constrain the optimization process, ensuring that the termination distribution of the ray is consistent with the scene's geometry. Compared to NeRF, our method accelerates the training speed by 18%, and the rendered images achieve a higher PSNR than those obtained by mainstream methods. Additionally, there is a significant reduction in RMSE between the rendered scene depth and the ground truth depth, which indicates that our method can better capture the geometric information of the scene. With these improvements, we can train the NeRF model more efficiently and achieve more accurate rendering results.

摘要

神经辐射场(NeRFs)利用神经表示来编码场景,从而获得逼真的新视角渲染效果。然而,NeRF存在显著局限性。一个主要缺点是它无法捕捉表面几何信息,只能渲染物体表面颜色。此外,NeRF的训练极其耗时。我们提出深度NeRF作为解决这些问题的方案。具体而言,我们的方法采用一种快速深度补全算法对RGB-D相机生成的深度图进行去噪和补全。这些改进后的深度图引导NeRF的采样点更靠近场景表面分布,受益于密集的深度信息。此外,我们优化了NeRF的网络结构并整合深度信息来约束优化过程,确保光线的终止分布与场景几何形状一致。与NeRF相比,我们的方法将训练速度提高了18%,渲染图像的PSNR比主流方法获得的图像更高。此外,渲染场景深度与地面真值深度之间的均方根误差(RMSE)显著降低,这表明我们的方法能够更好地捕捉场景的几何信息。通过这些改进,我们可以更高效地训练NeRF模型并获得更准确的渲染结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b14e/10974786/d1bf88c2d24a/sensors-24-01919-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验