Zhang Yongshuo, Zhang Guangyuan, Li Kefeng, Zhu Zhenfang, Wang Peng, Wang Zhenfei, Fu Chen, Li Xiaotong, Fan Zhiming, Zhao Yongpeng
School of Information Science and Electrical Engineering, Shandong Jiaotong University, Jinan, Shandong, China.
Shandong Zhengyuan Yeda Environmental Technology Co., Ltd, Jinan, Shandong, China.
PLoS One. 2025 May 12;20(5):e0321878. doi: 10.1371/journal.pone.0321878. eCollection 2025.
To address the challenges of significant detail loss in Neural Radiance Fields (NeRF) under sparse-view input conditions, this paper proposes the DASNeRF framework. DASNeRF aims to generate high-detail novel views from a limited number of input viewpoints. To address the limitations of few-shot NeRF, including insufficient depth information and detail loss, DASNeRF introduces accurate depth priors and employs a depth constraint strategy combining relative depth ordering fidelity regularization and depth structural consistency regularization. These methods ensure reconstruction accuracy even with sparse input views. The depth priors provide high-quality depth data through a more accurate monocular depth estimation model, enhancing the reconstruction capability and stability of the model. The depth ordering fidelity regularization guides the network to learn relative relationships using local depth ranking priors, reducing blurring caused by inaccurate depth estimation. Depth structural consistency regularization maintains global depth consistency by enforcing continuity across neighboring depth pixels. These depth constraint strategies enhance DASNeRF's performance in complex scenes, making 3D reconstruction under sparse views more accurate and natural. In addition, we utilize a three-layer optimal sampling strategy, consisting of coarse sampling, optimized sampling, and fine sampling during the three-layer sampling process to better capture details in key regions. In the optimized sampling phase, the sampling point density in key regions is adaptively increased while reducing sampling in low-priority regions, enhancing detail capture accuracy. To alleviate overfitting, we proposed an MLP structure with per-layer input fusion. This design preserves the model's detail perception ability while effectively avoids overfitting. Specifically, each layer's input includes the output features from the previous layer and incorporates processed five-dimensional information, further enhancing fine detail reconstruction. Experimental results show that DASNeRF outperforms state-of-the-art methods on the LLFF and DTU dataset, achieving better performance in metrics such as PSNR, SSIM, and LPIPS. The reconstructed details and visual quality are significantly improved, demonstrating DASNeRF's potential in 3D reconstruction under sparse-view conditions.
为了解决神经辐射场(NeRF)在稀疏视图输入条件下显著细节丢失的挑战,本文提出了DASNeRF框架。DASNeRF旨在从有限数量的输入视角生成高细节的新视图。为了解决少样本NeRF的局限性,包括深度信息不足和细节丢失,DASNeRF引入了精确的深度先验,并采用了一种深度约束策略,该策略结合了相对深度排序保真度正则化和深度结构一致性正则化。即使在输入视图稀疏的情况下,这些方法也能确保重建精度。深度先验通过更精确的单目深度估计模型提供高质量的深度数据,增强了模型的重建能力和稳定性。深度排序保真度正则化引导网络使用局部深度排序先验来学习相对关系,减少因深度估计不准确而导致的模糊。深度结构一致性正则化通过强制相邻深度像素之间的连续性来保持全局深度一致性。这些深度约束策略提高了DASNeRF在复杂场景中的性能,使稀疏视图下的3D重建更加准确和自然。此外,我们在三层采样过程中采用了由粗采样、优化采样和精细采样组成的三层最优采样策略,以更好地捕捉关键区域的细节。在优化采样阶段,自适应增加关键区域的采样点密度,同时减少低优先级区域的采样,提高细节捕捉精度。为了缓解过拟合问题,我们提出了一种具有每层输入融合的MLP结构。这种设计在有效避免过拟合的同时保留了模型的细节感知能力。具体来说,每层的输入包括前一层的输出特征,并融入经过处理的五维信息,进一步增强精细细节重建。实验结果表明,DASNeRF在LLFF和DTU数据集上优于现有方法,在PSNR、SSIM和LPIPS等指标上取得了更好的性能。重建的细节和视觉质量得到了显著改善,证明了DASNeRF在稀疏视图条件下3D重建中的潜力。