• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MM-NeRF:神经辐射场的多模态引导3D多风格转换

MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field.

作者信息

Yang Zijiang, Qiu Zhongwei, Xu Chang, Fu Dongmei

出版信息

IEEE Trans Vis Comput Graph. 2025 Sep;31(9):5842-5853. doi: 10.1109/TVCG.2024.3476331.

DOI:10.1109/TVCG.2024.3476331
PMID:39378248
Abstract

3D style transfer aims to generate stylized views of 3D scenes with specified styles, which requires high-quality generating and keeping multi-view consistency. Existing methods still suffer the challenges of high-quality stylization with texture details and stylization with multimodal guidance. In this paper, we reveal that the common training method of stylization with NeRF, which generates stylized multi-view supervision by 2D style transfer models, causes the same object in supervision to show various states (color tone, details, etc.) in different views, leading NeRF to tend to smooth the texture details, further resulting in low-quality rendering for 3D multi-style transfer. To tackle these problems, we propose a novel Multimodal-guided 3D Multi-style transfer of NeRF, termed MM-NeRF. First, MM-NeRF projects multimodal guidance into a unified space to keep the multimodal styles consistency and extracts multimodal features to guide the 3D stylization. Second, a novel multi-head learning scheme is proposed to relieve the difficulty of learning multi-style transfer, and a multi-view style consistent loss is proposed to track the inconsistency of multi-view supervision data. Finally, a novel incremental learning mechanism is proposed to generalize MM-NeRF to any new style with small costs. Extensive experiments on several real-world datasets show that MM-NeRF achieves high-quality 3D multi-style stylization with multimodal guidance, and keeps multi-view consistency and style consistency between multimodal guidance.

摘要

三维风格迁移旨在生成具有特定风格的三维场景的风格化视图,这需要高质量的生成并保持多视图一致性。现有方法在带有纹理细节的高质量风格化以及多模态引导的风格化方面仍面临挑战。在本文中,我们揭示了使用神经辐射场(NeRF)进行风格化的常见训练方法,即通过二维风格迁移模型生成风格化的多视图监督,会导致监督中的同一物体在不同视图中呈现出不同状态(色调、细节等),使得NeRF倾向于平滑纹理细节,进而导致三维多风格迁移的渲染质量较低。为了解决这些问题,我们提出了一种新颖的神经辐射场多模态引导三维多风格迁移方法,称为MM-NeRF。首先,MM-NeRF将多模态引导投影到统一空间以保持多模态风格的一致性,并提取多模态特征以指导三维风格化。其次,提出了一种新颖的多头学习方案来缓解学习多风格迁移的困难,并提出了一种多视图风格一致损失来跟踪多视图监督数据的不一致性。最后,提出了一种新颖的增量学习机制,以低成本将MM-NeRF推广到任何新风格。在多个真实世界数据集上进行的大量实验表明,MM-NeRF在多模态引导下实现了高质量的三维多风格化,并保持了多视图一致性以及多模态引导之间的风格一致性。

相似文献

1
MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field.MM-NeRF:神经辐射场的多模态引导3D多风格转换
IEEE Trans Vis Comput Graph. 2025 Sep;31(9):5842-5853. doi: 10.1109/TVCG.2024.3476331.
2
Super-NeRF: View-Consistent Detail Generation for NeRF Super-Resolution.Super-NeRF:用于神经辐射场超分辨率的视图一致细节生成
IEEE Trans Vis Comput Graph. 2025 Sep;31(9):6053-6066. doi: 10.1109/TVCG.2024.3490840.
3
UC-NeRF: Uncertainty-Aware Conditional Neural Radiance Fields From Endoscopic Sparse Views.UC-NeRF:基于内窥镜稀疏视图的不确定性感知条件神经辐射场
IEEE Trans Med Imaging. 2025 Mar;44(3):1284-1296. doi: 10.1109/TMI.2024.3496558. Epub 2025 Mar 17.
4
MIS-NeRF: neural radiance fields in minimally-invasive surgery.MIS-NeRF:微创手术中的神经辐射场
Int J Comput Assist Radiol Surg. 2025 Jul;20(7):1481-1490. doi: 10.1007/s11548-025-03429-7. Epub 2025 May 25.
5
Surgical neural radiance fields from one image.
Int J Comput Assist Radiol Surg. 2025 Jun 19. doi: 10.1007/s11548-025-03447-5.
6
NeRF-Art: Text-Driven Neural Radiance Fields Stylization.NeRF-Art:文本驱动的神经辐射场风格化
IEEE Trans Vis Comput Graph. 2024 Aug;30(8):4983-4996. doi: 10.1109/TVCG.2023.3283400. Epub 2024 Jul 1.
7
MPS-NeRF: Generalizable 3D Human Rendering From Multiview Images.MPS-NeRF:基于多视图图像的可泛化3D人体渲染
IEEE Trans Pattern Anal Mach Intell. 2025 Aug;47(8):6110-6121. doi: 10.1109/TPAMI.2022.3205910.
8
UPST-NeRF: Universal Photorealistic Style Transfer of Neural Radiance Fields for 3D Scene.UPST-NeRF:用于3D场景的神经辐射场通用逼真风格迁移
IEEE Trans Vis Comput Graph. 2025 Apr;31(4):2045-2057. doi: 10.1109/TVCG.2024.3378692. Epub 2025 Feb 27.
9
Non-orthogonal kV imaging guided patient position verification in non-coplanar radiation therapy with dataset-free implicit neural representation.在无数据集隐式神经表示的非共面放射治疗中,基于非正交千伏成像的患者体位验证
Med Phys. 2025 May 19. doi: 10.1002/mp.17885.
10
Oral morphine for cancer pain.口服吗啡用于癌症疼痛。
Cochrane Database Syst Rev. 2016 Apr 22;4(4):CD003868. doi: 10.1002/14651858.CD003868.pub4.