IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):2801-2815. doi: 10.1109/TPAMI.2022.3183288. Epub 2023 Feb 3.
Gait recognition plays a special role in visual surveillance due to its unique advantage, e.g., long-distance, cross-view and non-cooperative recognition. However, it has not yet been widely applied. One reason for this awkwardness is the lack of a truly big dataset captured in practical outdoor scenarios. Here, the "big" at least means: (1) huge amount of gait videos; (2) sufficient subjects; (3) rich attributes; and (4) spatial and temporal variations. Moreover, most existing large-scale gait datasets are collected indoors, which have few challenges from real scenes, such as the dynamic and complex background clutters, illumination variations, vertical view variations, etc. In this article, we introduce a newly built big outdoor gait dataset, called CASIA-E. It contains more than one thousand people distributed over near one million videos. Each person involves 26 view angles and varied appearances caused by changes of bag carrying, dressing and walking styles. The videos are captured across five months and across three kinds of outdoor scenes. Soft biometric features are also recorded for all subjects including age, gender, height, weight, and nationality. Besides, we report an experimental benchmark and examine some meaningful problems that have not been well studied previously, e.g., the influence of million-level training videos, vertical view angles, walking styles, and the thermal infrared modality. We believe that such a big outdoor dataset and the experimental benchmark will promote the development of gait recognition in both academic research and industrial applications.
步态识别由于其独特的优势,例如远距离、跨视角和非合作识别,在视觉监控中起着特殊的作用。然而,它尚未得到广泛应用。造成这种尴尬局面的一个原因是缺乏真正在实际户外场景中捕获的大型数据集。这里的“大”至少意味着:(1)大量的步态视频;(2)足够多的对象;(3)丰富的属性;以及(4)空间和时间变化。此外,大多数现有的大规模步态数据集都是在室内收集的,这些数据集很少受到真实场景的挑战,例如动态和复杂的背景杂波、光照变化、垂直视角变化等。在本文中,我们介绍了一个新建立的大型户外步态数据集,称为 CASIA-E。它包含超过一千人分布在近百万个视频中。每个人涉及 26 个视角和由于携带包、穿着和行走方式的变化而引起的不同外观。这些视频是在五个月内,在三种不同的户外场景下拍摄的。还为所有受试者记录了软生物特征,包括年龄、性别、身高、体重和国籍。此外,我们报告了一个实验基准,并研究了一些以前没有得到很好研究的有意义的问题,例如百万级训练视频、垂直视角、行走方式和热红外模态的影响。我们相信,这样一个大型的户外数据集和实验基准将促进步态识别在学术研究和工业应用中的发展。