Discipline of Information and Communication Technology, Wicking Dementia Research and Education Centre, University of Tasmania, Australia.
Sensorimotor Neuroscience and Aging Group, School of Psychological Sciences, University of Tasmania, Australia.
Comput Biol Med. 2022 Aug;147:105776. doi: 10.1016/j.compbiomed.2022.105776. Epub 2022 Jun 21.
Telemedicine video consultations are rapidly increasing globally, accelerated by the COVID-19 pandemic. This presents opportunities to use computer vision technologies to augment clinician visual judgement because video cameras are so ubiquitous in personal devices and new techniques, such as DeepLabCut (DLC) can precisely measure human movement from smartphone videos. However, the accuracy of DLC to track human movements in videos obtained from laptop cameras, which have a much lower FPS, has never been investigated; this is a critical gap because patients use laptops for most telemedicine consultations.
To determine the validity and reliability of DLC applied to laptop videos to measure finger tapping, a validated test of human movement.
Sixteen adults completed finger-tapping tests at 0.5 Hz, 1 Hz, 2 Hz, 3 Hz and at maximal speed. Hand movements were recorded simultaneously by a laptop camera at 30 frames per second (FPS) and by Optotrak, a 3D motion analysis system at 250 FPS. Eight DLC neural network architectures (ResNet50, ResNet101, ResNet152, MobileNetV1, MobileNetV2, EfficientNetB0, EfficientNetB3, EfficientNetB6) were applied to the laptop video and extracted movement features were compared to the ground truth Optotrak motion tracking.
Over 96% (529/552) of DLC measures were within +/-0.5 Hz of the Optotrak measures. At tapping frequencies >4 Hz, there was progressive decline in accuracy, attributed to motion blur associated with the laptop camera's low FPS. Computer vision methods hold potential for moving us towards intelligent telemedicine by providing human movement analysis during consultations. However, further developments are required to accurately measure the fastest movements.
远程医疗视频咨询在全球范围内迅速增加,这是由 COVID-19 大流行加速的。这为使用计算机视觉技术来增强临床医生的视觉判断提供了机会,因为视频摄像头在个人设备中如此普遍,而新技术,如 DeepLabCut(DLC),可以从智能手机视频中精确测量人类运动。然而,DLC 从未被用于跟踪笔记本电脑摄像头获取的视频中的人类运动,而笔记本电脑摄像头的帧率要低得多;这是一个关键的差距,因为患者在大多数远程医疗咨询中使用笔记本电脑。
确定 DLC 应用于笔记本电脑视频以测量手指敲击的有效性和可靠性,这是一种经过验证的人类运动测试。
16 名成年人以 0.5 Hz、1 Hz、2 Hz、3 Hz 和最大速度完成了手指敲击测试。手部运动同时由笔记本电脑摄像头以每秒 30 帧(fps)和 Optotrak(一种每秒 250 帧的 3D 运动分析系统)记录。将 8 种 DLC 神经网络架构(ResNet50、ResNet101、ResNet152、MobileNetV1、MobileNetV2、EfficientNetB0、EfficientNetB3、EfficientNetB6)应用于笔记本电脑视频,并将提取的运动特征与 Optotrak 运动跟踪的真实值进行比较。
超过 96%(529/552)的 DLC 测量值与 Optotrak 测量值相差在 0.5 Hz 以内。在高于 4 Hz 的敲击频率下,准确性逐渐下降,这归因于笔记本电脑摄像头的低帧率导致的运动模糊。计算机视觉方法有可能通过在咨询期间提供人类运动分析,使我们朝着智能远程医疗迈进。然而,还需要进一步的发展来准确测量最快的运动。