Zhang Hong, Li Lu, Jia Wenyan, Fernstrom John D, Sclabassi Robert J, Mao Zhi-Hong, Sun Mingui
Image Processing Center, Beihang University, Beijing 100191, China.
Neurocomputing (Amst). 2011 Jun 1;74(12-13):2184-2192. doi: 10.1016/j.neucom.2011.02.014.
A new technique to extract and evaluate physical activity patterns from image sequences captured by a wearable camera is presented in this paper. Unlike standard activity recognition schemes, the video data captured by our device do not include the wearer him/herself. The physical activity of the wearer, such as walking or exercising, is analyzed indirectly through the camera motion extracted from the acquired video frames. Two key tasks, pixel correspondence identification and motion feature extraction, are studied to recognize activity patterns. We utilize a multiscale approach to identify pixel correspondences. When compared with the existing methods such as the Good Features detector and the Speed-up Robust Feature (SURF) detector, our technique is more accurate and computationally efficient. Once the pixel correspondences are determined which define representative motion vectors, we build a set of activity pattern features based on motion statistics in each frame. Finally, the physical activity of the person wearing a camera is determined according to the global motion distribution in the video. Our algorithms are tested using different machine learning techniques such as the K-Nearest Neighbor (KNN), Naive Bayesian and Support Vector Machine (SVM). The results show that many types of physical activities can be recognized from field acquired real-world video. Our results also indicate that, with a design of specific motion features in the input vectors, different classifiers can be used successfully with similar performances.
本文提出了一种从可穿戴相机捕获的图像序列中提取和评估身体活动模式的新技术。与标准活动识别方案不同,我们设备捕获的视频数据不包括佩戴者本人。通过从采集的视频帧中提取的相机运动来间接分析佩戴者的身体活动,如行走或锻炼。研究了两项关键任务,即像素对应识别和运动特征提取,以识别活动模式。我们采用多尺度方法来识别像素对应。与现有方法如良好特征检测器和加速鲁棒特征(SURF)检测器相比,我们的技术更准确且计算效率更高。一旦确定了定义代表性运动向量的像素对应,我们就基于每一帧中的运动统计构建一组活动模式特征。最后,根据视频中的全局运动分布确定佩戴相机者的身体活动。我们的算法使用不同的机器学习技术进行测试,如K近邻(KNN)、朴素贝叶斯和支持向量机(SVM)。结果表明,可以从现场采集的真实世界视频中识别出多种类型的身体活动。我们的结果还表明,通过在输入向量中设计特定的运动特征,不同的分类器可以成功使用且性能相似。