通过深度声学分析实现场景感知音频渲染

Scene-Aware Audio Rendering via Deep Acoustic Analysis.

作者信息

Tang Zhenyu, Bryan Nicholas J, Li Dingzeyu, Langlois Timothy R, Manocha Dinesh

出版信息

IEEE Trans Vis Comput Graph. 2020 May;26(5):1991-2001. doi: 10.1109/TVCG.2020.2973058. Epub 2020 Feb 13.

DOI:10.1109/TVCG.2020.2973058

PMID:32070967

Abstract

We present a new method to capture the acoustic characteristics of real-world rooms using commodity devices, and use the captured characteristics to generate similar sounding sources with virtual models. Given the captured audio and an approximate geometric model of a real-world room, we present a novel learning-based method to estimate its acoustic material properties. Our approach is based on deep neural networks that estimate the reverberation time and equalization of the room from recorded audio. These estimates are used to compute material properties related to room reverberation using a novel material optimization objective. We use the estimated acoustic material characteristics for audio rendering using interactive geometric sound propagation and highlight the performance on many real-world scenarios. We also perform a user study to evaluate the perceptual similarity between the recorded sounds and our rendered audio.

摘要

我们提出了一种使用商用设备捕捉真实世界房间声学特性的新方法，并利用捕捉到的特性通过虚拟模型生成具有相似音效的声源。给定捕捉到的音频和真实世界房间的近似几何模型，我们提出了一种基于学习的新颖方法来估计其声学材料属性。我们的方法基于深度神经网络，该网络从录制的音频中估计房间的混响时间和均衡。这些估计值用于通过一种新颖的材料优化目标来计算与房间混响相关的材料属性。我们将估计出的声学材料特性用于使用交互式几何声音传播的音频渲染，并突出了在许多真实世界场景中的性能。我们还进行了一项用户研究，以评估录制声音与我们渲染音频之间的感知相似度。

相似文献

Scene-Aware Audio Rendering via Deep Acoustic Analysis.通过深度声学分析实现场景感知音频渲染

IEEE Trans Vis Comput Graph. 2020 May;26(5):1991-2001. doi: 10.1109/TVCG.2020.2973058. Epub 2020 Feb 13.

Acoustic Classification and Optimization for Multi-Modal Rendering of Real-World Scenes.真实场景的多模态渲染的声学分类与优化。

IEEE Trans Vis Comput Graph. 2018 Mar;24(3):1246-1259. doi: 10.1109/TVCG.2017.2666150. Epub 2017 Feb 9.

Aural proxies and directionally-varying reverberation for interactive sound propagation in virtual environments.虚拟环境中交互式声音传播的听觉代理和方向变化混响。

IEEE Trans Vis Comput Graph. 2013 Apr;19(4):567-75. doi: 10.1109/TVCG.2013.27.

SynCoPation: Interactive Synthesis-Coupled Sound Propagation.同步拍频：交互式合成耦合声音传播

IEEE Trans Vis Comput Graph. 2016 Apr;22(4):1346-55. doi: 10.1109/TVCG.2016.2518421. Epub 2016 Jan 18.

Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation.基于深度学习的音频数据增强混响环境估计。

Sensors (Basel). 2022 Jan 13;22(2):592. doi: 10.3390/s22020592.

Statistics of natural reverberation enable perceptual separation of sound and space.自然混响的统计特性有助于实现声音与空间的感知分离。

Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):E7856-E7865. doi: 10.1073/pnas.1612524113. Epub 2016 Nov 10.

Co-Immersion in Audio Augmented Virtuality: The Case Study of a Static and Approximated Late Reverberation Algorithm.音频增强虚拟现实中的共同沉浸：一种静态近似后期混响算法的案例研究

IEEE Trans Vis Comput Graph. 2023 Nov;29(11):4472-4482. doi: 10.1109/TVCG.2023.3320213. Epub 2023 Nov 2.

WAVE: Interactive Wave-based Sound Propagation for Virtual Environments.WAVE：用于虚拟环境的基于波的交互式声音传播

IEEE Trans Vis Comput Graph. 2015 Apr;21(4):434-42. doi: 10.1109/TVCG.2015.2391858.

Sound propagation in realistic interactive 3D scenes with parameterized sources using deep neural operators.使用深度神经算子在具有参数化声源的逼真交互式3D场景中的声音传播。

Proc Natl Acad Sci U S A. 2024 Jan 9;121(2):e2312159120. doi: 10.1073/pnas.2312159120. Epub 2024 Jan 4.

Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates.端到端声学定位的深度学习方法：从音频信号到声源坐标。

Sensors (Basel). 2018 Oct 12;18(10):3418. doi: 10.3390/s18103418.

引用本文的文献

Deep Learning-Based Estimation of Reverberant Environment for Audio Data Augmentation.基于深度学习的音频数据增强混响环境估计。

Sensors (Basel). 2022 Jan 13;22(2):592. doi: 10.3390/s22020592.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过深度声学分析实现场景感知音频渲染

Scene-Aware Audio Rendering via Deep Acoustic Analysis.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献