Dutta Amrita, Breloff Scott P, Dai Fei, Sinsel Erik W, Carey Robert E, Warren Christopher M, Wu John Z
Department of Civil and Environmental Engineering, West Virginia University, P.O. Box 6103, Morgantown, WV 26506, United States of America.
National Institute for Occupational Safety and Health, 1095 Willowdale Road, Morgantown, WV 26505, United States of America.
Autom Constr. 2020 Nov;119. doi: 10.1016/j.autcon.2020.103322.
Field or laboratory data collected for work-related musculoskeletal disorder (WMSD) risk assessment in construction often becomes unreliable as a large amount of data go missing due to technology-induced errors, instrument failures or sometimes at random. Missing data can adversely affect the assessment conclusions. This study proposes a method that applies Canonical Polyadic Decomposition (CPD) tensor decomposition to fuse multiple sparse risk-related datasets and fill in missing data by leveraging the correlation among multiple risk indicators within those datasets. Two knee WMSD risk-related datasets-3D knee rotation (kinematics) and electromyography (EMG) of five knee postural muscles-collected from previous studies were used for the validation and demonstration of the proposed method. The analysis results revealed that for a large portion of missing values (40%), the proposed method can generate a fused dataset that provides reliable risk assessment results highly consistent (70%-87%) with those obtained from the original experimental datasets. This signified the usefulness of the proposed method for use in WMSD risk assessment studies when data collection is affected by a significant amount of missing data, which will facilitate reliable assessment of WMSD risks among construction workers. In the future, findings of this study will be implemented to explore whether, and to what extent, the fused dataset outperforms the datasets with missing values by comparing consistencies of the risk assessment results obtained from these datasets for further investigation of the fusion performance.
在建筑行业中,为工作相关肌肉骨骼疾病(WMSD)风险评估而收集的现场或实验室数据往往变得不可靠,因为大量数据会因技术错误、仪器故障或有时是随机原因而丢失。缺失数据会对评估结论产生不利影响。本研究提出了一种方法,该方法应用典范多向分解(CPD)张量分解来融合多个稀疏的风险相关数据集,并通过利用这些数据集中多个风险指标之间的相关性来填补缺失数据。从先前研究中收集的两个与膝关节WMSD风险相关的数据集——五个膝关节姿势肌肉的三维膝关节旋转(运动学)和肌电图(EMG)——被用于对所提出方法的验证和演示。分析结果表明,对于很大一部分缺失值(40%),所提出的方法可以生成一个融合数据集,该数据集能提供与从原始实验数据集获得的结果高度一致(70%-87%)的可靠风险评估结果。这表明当数据收集受到大量缺失数据影响时,所提出的方法在WMSD风险评估研究中的有用性,这将有助于对建筑工人的WMSD风险进行可靠评估。未来,本研究的结果将通过比较从这些数据集获得的风险评估结果的一致性,来探索融合数据集在多大程度上优于有缺失值的数据集,以进一步研究融合性能。