Lannan Nate, Zhou L E, Fan Guoliang
School of Electrical and Computer Engineering, Oklahoma State University, Stillwater, OK 74078, USA.
IEEE Access. 2022;10:29233-29251. doi: 10.1109/access.2022.3157605. Epub 2022 Mar 8.
We present a novel approach to enhance the quality of human motion data collected by low-cost depth sensors, namely D-Mocap, which suffers from low accuracy and poor stability due to occlusion, interference, and algorithmic limitations. Our approach takes advantage of a large set of high-quality and diverse Mocap data by learning a general motion manifold via the convolutional autoencoder. In addition, the Tobit Kalman filter (TKF) is used to capture the kinematics of each body joint and handle censored measurement distribution. The TKF is incorporated with the autoencoder via latent space optimization, maintaining adherence to the motion manifold while preserving the kinematic nature of the original motion data. Furthermore, due to the lack of an open source benchmark dataset for this research, we have developed an extension of the Berkeley Multimodal Human Action Database (MHAD) by generating D-Mocap data from RGB-D images. The newly extended MHAD dataset is skeleton-matched and time-synced to the corresponding Mocap data and is publicly available. Along with simulated D-Mocap data generated from the CMU Mocap dataset and our self-collected D-Mocap dataset, the proposed algorithm is thoroughly evaluated and compared with different settings. Experimental results show that our approach can improve the accuracy of joint positions and angles as well as skeletal bone lengths by over 50%.
我们提出了一种新颖的方法来提高由低成本深度传感器收集的人体运动数据的质量,即D-Mocap,由于遮挡、干扰和算法限制,该传感器存在精度低和稳定性差的问题。我们的方法通过卷积自动编码器学习通用运动流形,利用大量高质量且多样的运动捕捉(Mocap)数据。此外,托比特卡尔曼滤波器(TKF)用于捕捉每个身体关节的运动学并处理删失测量分布。TKF通过潜在空间优化与自动编码器相结合,在保持原始运动数据运动学特性的同时,保持对运动流形的遵循。此外,由于本研究缺乏开源基准数据集,我们通过从RGB-D图像生成D-Mocap数据,开发了伯克利多模态人类动作数据库(MHAD)的扩展版本。新扩展的MHAD数据集与相应的Mocap数据进行了骨骼匹配和时间同步,并且是公开可用的。连同从CMU Mocap数据集生成的模拟D-Mocap数据以及我们自己收集的D-Mocap数据集,对所提出的算法进行了全面评估,并在不同设置下进行了比较。实验结果表明,我们的方法可以将关节位置、角度以及骨骼长度的精度提高50%以上。