Department of Rehabilitation Medicine, Jiangxi Provincial People's Hospital, Nanchang, 330013, China.
Department of Rehabilitation Medicine, Suichuan County People's Hospital, Jian, 343900, China; Epilepsy Center, Suichuan County People's Hospital, Jian, 343900, China.
Artif Intell Med. 2023 Jan;135:102474. doi: 10.1016/j.artmed.2022.102474. Epub 2022 Dec 16.
Many biomedical applications require fine motor skill assessments; however, real-time and contactless fine motor skill assessments are not typically implemented. In this study, we followed the 2D-to-3D pipeline principle and proposed a transformer-based spatial-temporal network to accurately regress 3D hand joint locations by inputting infrared thermal video for eliminating need of multiple cameras or RGB-D devices. We also developed a dataset composed of infrared thermal videos and ground truth annotations for training. The label represents a set of 3D joint locations from infrared optical trackers, which is considered the gold standard for clinical applications. To demonstrate their potential, the proposed method was used to measure the finger motion angle, and we investigated its accuracy by comparing the proposal with the Azure Kinect system and Leap Motion system. On the proposed dataset, the proposed method achieved a 3D hand pose mean error of less than 14 mm and outperforms the other deep learning methods. When the error thresholds were larger than approximately 35 mm, our method first to achieved excellent performance (>80%) in terms of the fraction of good frames. For the finger motion angle calculation task, the proposed and commercial systems had comparable inter-system reliability (ICC ranging from 0.81 to 0.83) and excellent validity (Pearson's r-values ranging from 0.82 to 0.86). We believe that the proposed approaches can capture hand motion and measure finger motion angles and can be used in different biomedicine scenarios as an effective evaluation tool for fine motor skills.
许多生物医学应用都需要精细运动技能评估;然而,实时和非接触式精细运动技能评估通常无法实现。在本研究中,我们遵循 2D 到 3D 的流水线原理,提出了一种基于变压器的时空网络,通过输入红外热视频来准确回归 3D 手关节位置,从而无需使用多个摄像机或 RGB-D 设备。我们还开发了一个包含红外热视频和地面实况注释的数据集用于训练。标签代表一组来自红外光学跟踪器的 3D 关节位置,这被认为是临床应用的黄金标准。为了展示其潜力,我们使用该方法来测量手指运动角度,并通过将该方法与 Azure Kinect 系统和 Leap Motion 系统进行比较来研究其准确性。在我们提出的数据集上,该方法的 3D 手姿势平均误差小于 14mm,优于其他深度学习方法。当误差阈值大于约 35mm 时,我们的方法首先在良好帧数的比例方面表现出优异的性能(>80%)。对于手指运动角度计算任务,所提出的方法和商业系统具有可比的系统间可靠性(ICC 范围从 0.81 到 0.83)和极好的有效性(Pearson r 值范围从 0.82 到 0.86)。我们相信,所提出的方法可以捕捉手部运动并测量手指运动角度,并可在不同的生物医学场景中用作精细运动技能的有效评估工具。