Zhang Dashan, Guo Jie, Lei Xiujun, Zhu Chang'an
Department of Precision Machinery and Precision Instrumentation, University of Science and Technology of China, Hefei, Anhui 230026, People's Republic of China.
Rev Sci Instrum. 2016 Aug;87(8):086111. doi: 10.1063/1.4961979.
This note reports an efficient singular value decomposition (SVD)-based vibration extraction approach that recovers sound information in silent high-speed video. A high-speed camera of which frame rates are in the range of 2 kHz-10 kHz is applied to film the vibrating objects. Sub-images cut from video frames are transformed into column vectors and then reconstructed to a new matrix. The SVD of the new matrix produces orthonormal image bases (OIBs) and image projections onto specific OIB can be recovered as understandable acoustical signals. Standard frequencies of 256 Hz and 512 Hz tuning forks are extracted offline from their vibrating surfaces and a 3.35 s speech signal is recovered online from a piece of paper that is stimulated by sound waves within 1 min.
本笔记报告了一种基于奇异值分解(SVD)的高效振动提取方法,该方法可在无声高速视频中恢复声音信息。使用帧率在2 kHz - 10 kHz范围内的高速摄像机拍摄振动物体。从视频帧中裁剪出的子图像被转换为列向量,然后重建为一个新矩阵。新矩阵的奇异值分解产生正交归一化图像基(OIBs),并且在特定OIB上的图像投影可以恢复为可理解的声学信号。256 Hz和512 Hz音叉的标准频率从其振动表面离线提取,并且在1分钟内从一张受声波刺激的纸张上在线恢复了3.35秒的语音信号。