Perrone Mattia, Mell Steven P, Martin John T, Nho Shane J, Simmons Scott, Malloy Philip
Section of Young Adult Hip Surgery, Division of Sports Medicine, Department of Orthopedic Surgery, Rush Medical College of Rush University, Rush University Medical Center, Chicago, IL, USA.
Department of Mathematics and Computer Science, Drury University, Springfield, MO, USA.
Proc Inst Mech Eng H. 2025 Feb;239(2):202-211. doi: 10.1177/09544119251315877. Epub 2025 Feb 4.
Generative deep learning has emerged as a promising data augmentation technique in recent years. This approach becomes particularly valuable in areas such as motion analysis, where it is challenging to collect substantial amounts of data. The objective of the current study is to introduce a data augmentation strategy that relies on a variational autoencoder to generate synthetic data of kinetic and kinematic variables. The kinematic and kinetic variables consist of hip and knee joint angles and moments, respectively, in both sagittal and frontal plane, and ground reaction forces. Statistical parametric mapping (SPM) did not detect significant differences between real and synthetic data for each of the biomechanical variables considered. To further evaluate the effectiveness of this approach, a long-short term model (LSTM) was trained both only on real data (R) and on the combination of real and synthetic data (R&S); the performance of each of these two trained models was then assessed on real test data unseen during training. The principal findings included achieving comparable results in terms of nRMSE when predicting knee joint moments in the frontal (R&S: 9.86% vs R: 10.72%) and sagittal plane (R&S: 9.21% vs R: 9.75%), and hip joint moments in the frontal (R&S: 16.93% vs R: 16.79%) and sagittal plane (R&S: 13.29% vs R: 14.60%). The main novelty of this study lies in introducing an effective data augmentation approach in motion analysis settings.
近年来,生成式深度学习已成为一种很有前景的数据增强技术。这种方法在运动分析等领域变得尤为有价值,因为在这些领域收集大量数据具有挑战性。本研究的目的是引入一种数据增强策略,该策略依赖变分自编码器来生成动力学和运动学变量的合成数据。运动学和动力学变量分别由矢状面和额状面的髋关节和膝关节角度及力矩,以及地面反作用力组成。对于所考虑的每个生物力学变量,统计参数映射(SPM)未检测到真实数据和合成数据之间的显著差异。为了进一步评估这种方法的有效性,训练了一个长短期记忆模型(LSTM),一种仅基于真实数据(R)训练,另一种基于真实数据与合成数据的组合(R&S)训练;然后在训练期间未见过的真实测试数据上评估这两个训练模型的性能。主要发现包括,在预测额状面(R&S:9.86% 对 R:10.72%)和矢状面(R&S:9.21% 对 R:9.75%)的膝关节力矩以及额状面(R&S:16.93% 对 R:16.79%)和矢状面(R&S:13.29% 对 R:14.60%)的髋关节力矩时,在归一化均方根误差方面取得了可比的结果。本研究的主要新颖之处在于在运动分析设置中引入了一种有效的数据增强方法。