Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia.
Institute of Medical Science Technology, Universiti Kuala Lumpur, Selangor, Malaysia.
Front Public Health. 2022 Sep 15;10:984099. doi: 10.3389/fpubh.2022.984099. eCollection 2022.
Workplace accidents can cause a catastrophic loss to the company including human injuries and fatalities. Occupational injury reports may provide a detailed description of how the incidents occurred. Thus, the narrative is a useful information to extract, classify and analyze occupational injury. This study provides a systematic review of text mining and Natural Language Processing (NLP) applications to extract text narratives from occupational injury reports. A systematic search was conducted through multiple databases including Scopus, PubMed, and Science Direct. Only original studies that examined the application of machine and deep learning-based Natural Language Processing models for occupational injury analysis were incorporated in this study. A total of 27, out of 210 articles were reviewed in this study by adopting the Preferred Reporting Items for Systematic Review (PRISMA). This review highlighted that various machine and deep learning-based NLP models such as K-means, Naïve Bayes, Support Vector Machine, Decision Tree, and K-Nearest Neighbors were applied to predict occupational injury. On top of these models, deep neural networks are also included in classifying the type of accidents and identifying the causal factors. However, there is a paucity in using the deep learning models in extracting the occupational injury reports. This is due to these techniques are pretty much very recent and making inroads into decision-making in occupational safety and health as a whole. Despite that, this paper believed that there is a huge and promising potential to explore the application of NLP and text-based analytics in this occupational injury research field. Therefore, the improvement of data balancing techniques and the development of an automated decision-making support system for occupational injury by applying the deep learning-based NLP models are the recommendations given for future research.
工作场所事故可能会给公司造成灾难性的损失,包括人员受伤和死亡。职业伤害报告可能会详细描述事故是如何发生的。因此,叙述是提取、分类和分析职业伤害的有用信息。本研究对从职业伤害报告中提取文本叙述的文本挖掘和自然语言处理(NLP)应用进行了系统回顾。通过 Scopus、PubMed 和 Science Direct 等多个数据库进行了系统搜索。本研究仅纳入了检查基于机器和深度学习的自然语言处理模型在职业伤害分析中的应用的原始研究。通过采用系统评价的首选报告项目(PRISMA),本研究共回顾了 27 篇文章,其中 210 篇文章。本综述强调,各种基于机器和深度学习的 NLP 模型,如 K-means、朴素贝叶斯、支持向量机、决策树和 K-最近邻,已被应用于预测职业伤害。除了这些模型之外,深度学习网络还被用于对事故类型进行分类和识别因果因素。然而,在提取职业伤害报告方面,使用深度学习模型的情况很少。这是因为这些技术非常新,正在整体上涉足职业安全和健康的决策制定。尽管如此,本文认为,在职业伤害研究领域探索 NLP 和基于文本的分析的应用具有巨大的潜力。因此,建议改进数据平衡技术,并通过应用基于深度学习的 NLP 模型开发职业伤害的自动化决策支持系统,作为未来研究的建议。