Suppr超能文献

BSI-MVS:具有双向语义信息的多视图立体网络。

BSI-MVS: multi-view stereo network with bidirectional semantic information.

作者信息

Jia Ruiming, Yu Jun, Hu Zhenghui, Yuan Fei

机构信息

School of Information Science and Technology, North China University of Technology, Beijing, 100144, China.

Hangzhou Innovation Institute, Beihang University, Hangzhou, 310051, China.

出版信息

Sci Rep. 2024 Mar 21;14(1):6766. doi: 10.1038/s41598-024-55612-6.

Abstract

The basic principle of multi-view stereo (MVS) is to perform 3D reconstruction by extracting depth information from multiple views. Most current SOTA MVS networks are based on Vision Transformer, which usually means expensive computational complexity. To reduce computational complexity and improve depth map accuracy, we propose a MVS network with Bidirectional Semantic Information (BSI-MVS). Firstly, we design a Multi-Level Spatial Pyramid module to generate multiple layers of feature map for extracting multi-scale information. Then we propose a 2D Bidirectional-LSTM module to capture bidirectional semantic information at different time steps in the horizontal and vertical directions, which contains abundant depth information. Finally, cost volumes are built based on various levels of feature maps to optimize the final depth map. We experiment on the DTU and BlendedMVS datasets. The result shows that our network, in terms of overall metrics, surpasses TransMVSNet, CasMVSNet, CVP-MVSNet, and AACVP-MVSNet respectively by 17.84%, 36.42%, 14.96%, and 4.86%, which also shows a noticeable performance enhancement in objective metrics and visualizations.

摘要

多视图立体视觉(MVS)的基本原理是通过从多个视图中提取深度信息来进行三维重建。当前大多数最先进的MVS网络都是基于视觉Transformer,这通常意味着计算复杂度较高。为了降低计算复杂度并提高深度图精度,我们提出了一种具有双向语义信息的MVS网络(BSI-MVS)。首先,我们设计了一个多级空间金字塔模块来生成多层特征图,以提取多尺度信息。然后,我们提出了一个二维双向长短期记忆模块,用于在水平和垂直方向的不同时间步长捕获双向语义信息,其中包含丰富的深度信息。最后,基于不同层次的特征图构建代价体,以优化最终的深度图。我们在DTU和BlendedMVS数据集上进行了实验。结果表明,我们的网络在整体指标上分别比TransMVSNet、CasMVSNet、CVP-MVSNet和AACVP-MVSNet高出17.84%、36.42%、14.96%和4.86%,这在客观指标和可视化方面也显示出显著的性能提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a2b/10958035/17b2f69172ef/41598_2024_55612_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验