IEEE J Biomed Health Inform. 2019 Jan;23(1):314-323. doi: 10.1109/JBHI.2018.2808281. Epub 2018 Feb 20.
Fall detection is an important public healthcare problem. Timely detection could enable instant delivery of medical service to the injured. A popular nonintrusive solution for fall detection is based on videos obtained through ambient camera, and the corresponding methods usually require a large dataset to train a classifier and are inclined to be influenced by the image quality. However, it is hard to collect fall data and instead simulated falls are recorded to construct the training dataset, which is restricted to limited quantity. To address these problems, a three-dimensional convolutional neural network (3-D CNN) based method for fall detection is developed, which only uses video kinematic data to train an automatic feature extractor and could circumvent the requirement for large fall dataset of deep learning solution. 2-D CNN could only encode spatial information, and the employed 3-D convolution could extract motion feature from temporal sequence, which is important for fall detection. To further locate the region of interest in each frame, a long short-term memory (LSTM) based spatial visual attention scheme is incorporated. Sports dataset Sports-1 M with no fall examples is employed to train the 3-D CNN, which is then combined with LSTM to train a classifier with fall dataset. Experiments have verified the proposed scheme on fall detection benchmark with high accuracy as 100%. Superior performance has also been obtained on other activity databases.
跌倒检测是一个重要的公共卫生保健问题。及时检测可以使受伤者立即得到医疗服务。一种流行的非侵入性跌倒检测解决方案是基于通过环境摄像机获得的视频,并且相应的方法通常需要一个大型数据集来训练分类器,并且容易受到图像质量的影响。然而,收集跌倒数据很困难,而是记录模拟跌倒以构建训练数据集,这受到限制。为了解决这些问题,开发了一种基于三维卷积神经网络(3-D CNN)的跌倒检测方法,该方法仅使用视频运动数据来训练自动特征提取器,可以避免深度学习解决方案对大量跌倒数据的需求。2-D CNN 只能编码空间信息,而所采用的 3-D 卷积可以从时间序列中提取运动特征,这对于跌倒检测很重要。为了进一步定位每帧中的感兴趣区域,采用了基于长短期记忆(LSTM)的空间视觉注意力方案。利用没有跌倒示例的 Sports-1 M 运动数据集来训练 3-D CNN,然后将其与 LSTM 结合,使用跌倒数据集训练分类器。实验在具有 100%高精度的跌倒检测基准上验证了所提出的方案。在其他活动数据库上也取得了优异的性能。