Gotardo Paulo F U, Martinez Aleix M
Department of Electrical and Computer Engineering, The Ohio State University, Columbus (OH), USA.
Proc IEEE Int Conf Comput Vis. 2011:802-809. doi: 10.1109/ICCV.2011.6126319.
Non-rigid structure from motion (NRSFM) is a difficult, underconstrained problem in computer vision. The standard approach in NRSFM constrains 3D shape deformation using a linear combination of K basis shapes; the solution is then obtained as the low-rank factorization of an input observation matrix. An important but overlooked problem with this approach is that non-linear deformations are often observed; these deformations lead to a weakened low-rank constraint due to the need to use additional basis shapes to linearly model points that move along curves. Here, we demonstrate how the kernel trick can be applied in standard NRSFM. As a result, we model complex, deformable 3D shapes as the outputs of a non-linear mapping whose inputs are points within a low-dimensional shape space. This approach is flexible and can use different kernels to build different non-linear models. Using the kernel trick, our model complements the low-rank constraint by capturing non-linear relationships in the shape coefficients of the linear model. The net effect can be seen as using non-linear dimensionality reduction to further compress the (shape) space of possible solutions.
非刚性运动结构(NRSFM)是计算机视觉中一个困难且约束不足的问题。NRSFM的标准方法使用K个基形状的线性组合来约束3D形状变形;然后通过对输入观测矩阵进行低秩分解来获得解决方案。这种方法存在一个重要但被忽视的问题,即经常会观察到非线性变形;由于需要使用额外的基形状来线性建模沿曲线移动的点,这些变形会导致低秩约束减弱。在此,我们展示了核技巧如何应用于标准的NRSFM。结果,我们将复杂的、可变形的3D形状建模为一个非线性映射的输出,其输入是低维形状空间内的点。这种方法很灵活,可以使用不同的核来构建不同的非线性模型。通过使用核技巧,我们的模型通过捕捉线性模型形状系数中的非线性关系来补充低秩约束。最终效果可以看作是使用非线性降维来进一步压缩可能解的(形状)空间。