Chen Kaixu, Asada Tomoyuki, Ienaga Naoto, Miura Kousei, Sakashita Kotaro, Sunami Takahiro, Kadone Hideki, Yamazaki Masashi, Kuroda Yoshihiro
Degree Programs in Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan.
Department of Orthopaedic Surgery, Institute of Medicine, University of Tsukuba, Tsukuba, Japan.
Front Neurosci. 2023 Dec 11;17:1278584. doi: 10.3389/fnins.2023.1278584. eCollection 2023.
INTRODUCTION: Assessment of human gait posture can be clinically effective in diagnosing human gait deformities early in life. Currently, two methods-static and dynamic-are used to diagnose adult spinal deformity (ASD) and other spinal disorders. Full-spine lateral standing radiographs are used in the standard static method. However, this is a static assessment of joints in the standing position and does not include information on joint changes when the patient walks. Careful observation of long-distance walking can provide a dynamic assessment that reveals an uncompensated posture; however, this increases the workload of medical practitioners. A three-dimensional (3D) motion system is proposed for the dynamic method. Although the motion system successfully detected dynamic posture changes, access to the facilities was limited. Therefore, a diagnostic approach that is facility-independent, has low practice flow, and does not involve patient contact is required. METHODS: We focused on a video-based method to classify patients with spinal disorders either as ASD, or other forms of ASD. To achieve this goal, we present a video-based two-stage machine-learning method. In the first stage, deep learning methods are used to locate the patient and extract the area where the patient is located. In the second stage, a 3D CNN (convolutional neural network) device is used to capture spatial and temporal information (dynamic motion) from the extracted frames. Disease classification is performed by discerning posture and gait from the extracted frames. Model performance was assessed using the mean accuracy, F1 score, and area under the receiver operating characteristic curve (AUROC), with five-fold cross-validation. We also compared the final results with professional observations. RESULTS: Our experiments were conducted using a gait video dataset comprising 81 patients. The experimental results indicated that our method is effective for classifying ASD and other spinal disorders. The proposed method achieved a mean accuracy of 0.7553, an F1 score of 0.7063, and an AUROC score of 0.7864. Additionally, ablation experiments indicated the importance of the first stage (detection stage) and transfer learning of our proposed method. DISCUSSION: The observations from the two doctors were compared using the proposed method. The mean accuracies observed by the two doctors were 0.4815 and 0.5247, with AUROC scores of 0.5185 and 0.5463, respectively. We proved that the proposed method can achieve accurate and reliable medical testing results compared with doctors' observations using videos of 1 s duration. All our code, models, and results are available at https://github.com/ChenKaiXuSan/Walk_Video_PyTorch. The proposed framework provides a potential video-based method for improving the clinical diagnosis for ASD and non-ASD. This framework might, in turn, benefit both patients and clinicians to treat the disease quickly and directly and further reduce facility dependency and data-driven systems.
引言:评估人类步态姿势在临床早期诊断人类步态畸形方面可能具有重要作用。目前,有静态和动态两种方法用于诊断成人脊柱畸形(ASD)和其他脊柱疾病。标准的静态方法使用全脊柱站立位侧位X线片。然而,这是对站立位关节的静态评估,不包括患者行走时关节变化的信息。仔细观察远距离行走可提供动态评估,揭示未代偿的姿势;然而,这增加了医务人员的工作量。对于动态方法,有人提出了一种三维(3D)运动系统。尽管该运动系统成功检测到了动态姿势变化,但使用这些设备的机会有限。因此,需要一种独立于设备、操作流程简单且不涉及患者接触的诊断方法。 方法:我们专注于一种基于视频的方法,用于将脊柱疾病患者分类为ASD或其他形式的ASD。为实现这一目标,我们提出了一种基于视频的两阶段机器学习方法。在第一阶段,使用深度学习方法定位患者并提取患者所在区域。在第二阶段,使用3D卷积神经网络(CNN)设备从提取的帧中捕获空间和时间信息(动态运动)。通过从提取的帧中辨别姿势和步态来进行疾病分类。使用平均准确率、F1分数和受试者工作特征曲线下面积(AUROC),通过五折交叉验证评估模型性能。我们还将最终结果与专业观察结果进行了比较。 结果:我们使用包含81名患者的步态视频数据集进行了实验。实验结果表明,我们的方法在对ASD和其他脊柱疾病进行分类方面是有效的。所提出的方法平均准确率达到0.7553,F1分数为0.7063,AUROC分数为0.7864。此外,消融实验表明了我们所提出方法的第一阶段(检测阶段)和迁移学习的重要性。 讨论:使用所提出的方法对两位医生的观察结果进行了比较。两位医生观察到的平均准确率分别为0.4815和0.5247,AUROC分数分别为0.5185和0.5463。我们证明,与医生使用1秒时长视频的观察结果相比,所提出的方法能够获得准确可靠的医学检测结果。我们所有的代码、模型和结果都可在https://github.com/ChenKaiXuSan/Walk_Video_PyTorch获取。所提出的框架为改善ASD和非ASD的临床诊断提供了一种潜在的基于视频的方法。反过来,这个框架可能会使患者和临床医生都受益,能够快速直接地治疗疾病,并进一步减少对设备的依赖和数据驱动系统。
Front Neurosci. 2023-12-11
Cochrane Database Syst Rev. 2022-2-1
BMC Med Imaging. 2023-10-19
Zhongguo Gu Shang. 2018-6-25
Quant Imaging Med Surg. 2025-5-1
N Am Spine Soc J. 2024-10-1
IEEE J Biomed Health Inform. 2023-1
IEEE Trans Pattern Anal Mach Intell. 2023-3
IEEE Trans Pattern Anal Mach Intell. 2021-1