Zago Matteo, Luzzago Matteo, Marangoni Tommaso, De Cecco Mariolino, Tarabini Marco, Galli Manuela
Department of Electronics, Information and Bioengineering, Polytechnic of Milan, Milan, Italy.
Department of Mechanical Engineering, Polytechnic of Milan, Milan, Italy.
Front Bioeng Biotechnol. 2020 Mar 5;8:181. doi: 10.3389/fbioe.2020.00181. eCollection 2020.
The design of markerless systems to reconstruct human motion in a timely, unobtrusive and externally valid manner is still an open challenge. Artificial intelligence algorithms based on automatic landmarks identification on video images opened to a new approach, potentially e-viable with low-cost hardware. OpenPose is a library that t using a two-branch convolutional neural network allows for the recognition of skeletons in the scene. Although OpenPose-based solutions are spreading, their metrological performances relative to video setup are still largely unexplored. This paper aimed at validating a two-cameras OpenPose-based markerless system for gait analysis, considering its accuracy relative to three factors: cameras' relative distance, gait direction and video resolution. Two volunteers performed a walking test within a gait analysis laboratory. A marker-based optical motion capture system was taken as a reference. Procedures involved: calibration of the stereoscopic system; acquisition of video recordings, simultaneously with the reference marker-based system; video processing within OpenPose to extract the subject's skeleton; videos synchronization; triangulation of the skeletons in the two videos to obtain the 3D coordinates of the joints. Two set of parameters were considered for the accuracy assessment: errors in trajectory reconstruction and error in selected gait space-temporal parameters (step length, swing and stance time). The lowest error in trajectories (20 mm) was obtained with cameras 1.8 m apart, highest resolution and straight gait, and the highest (60 mm) with the 1.0 m, low resolution and diagonal gait configuration. The OpenPose-based system tended to underestimate step length of about 1.5 cm, while no systematic biases were found for swing/stance time. Step length significantly changed according to gait direction ( = 0.008), camera distance ( = 0.020), and resolution ( < 0.001). Among stance and swing times, the lowest errors (0.02 and 0.05 s for stance and swing, respectively) were obtained with the 1 m, highest resolution and straight gait configuration. These findings confirm the feasibility of tracking kinematics and gait parameters of a single subject in a 3D space using two low-cost webcams and the OpenPose engine. In particular, the maximization of cameras distance and video resolution enabled to achieve the highest metrological performances.
以及时、不引人注意且外部有效的方式重建人体运动的无标记系统设计仍然是一个悬而未决的挑战。基于视频图像自动地标识别的人工智能算法开启了一种新方法,使用低成本硬件可能具有电子可行性。OpenPose是一个库,它使用双分支卷积神经网络来识别场景中的骨骼。尽管基于OpenPose的解决方案正在普及,但其相对于视频设置的计量性能仍在很大程度上未被探索。本文旨在验证一种基于双摄像头OpenPose的无标记系统用于步态分析,考虑其相对于三个因素的准确性:摄像头的相对距离、步态方向和视频分辨率。两名志愿者在步态分析实验室进行了步行测试。以基于标记的光学运动捕捉系统作为参考。程序包括:立体系统的校准;与基于参考标记的系统同时采集视频记录;在OpenPose内进行视频处理以提取受试者的骨骼;视频同步;对两个视频中的骨骼进行三角测量以获得关节的三维坐标。考虑了两组参数进行准确性评估:轨迹重建误差和选定步态时空参数(步长、摆动和站立时间)的误差。在摄像头相距1.8米、最高分辨率和直线步态的情况下,轨迹误差最低(约20毫米),而在1.0米、低分辨率和对角步态配置下误差最高(约60毫米)。基于OpenPose的系统往往会低估步长约1.5厘米,而在摆动/站立时间方面未发现系统性偏差。步长根据步态方向( = 0.008)、摄像头距离( = 0.020)和分辨率( < 0.001)有显著变化。在站立和摆动时间中,在1米、最高分辨率和直线步态配置下获得的误差最低(站立和摆动分别为0.02和0.05秒)。这些发现证实了使用两个低成本网络摄像头和OpenPose引擎在三维空间中跟踪单个受试者运动学和步态参数的可行性。特别是,摄像头距离和视频分辨率的最大化能够实现最高的计量性能。