a Department of Industrial and Systems Engineering , Indian Institute of Technology Kharagpur , West Bengal , India.
Int J Inj Contr Saf Promot. 2018 Dec;25(4):416-426. doi: 10.1080/17457300.2018.1456468. Epub 2018 Apr 7.
The purpose of this study is to develop a text clustering-based cause and effect analysis methodology for incident data to unfold the root causes behind the incidents. A cause-effect diagram is usually prepared by using experts' knowledge which may fail to capture all the causes present at a workplace. On the other hand, the description of incidents provided by the workers in the form of incident reports is typically a rich data source and can be utilized to explore the causes and sub-causes of incidents. In this study, data were collected from an integrated steel plant. The text data were analysed using singular value decomposition (SVD) and expectation-maximization (EM) algorithm. Results suggest that text-document clustering can be used as a feasible method for exploring the hidden factors and trends from the description of incidents occurred at workplaces. The study also helped in finding out the anomaly in incident reporting.
本研究旨在开发一种基于文本聚类的事故数据分析方法,以揭示事故背后的根本原因。因果图通常是利用专家知识绘制的,可能无法捕捉到工作场所存在的所有原因。另一方面,工人以事故报告的形式提供的事故描述通常是丰富的数据来源,可以用来探索事故的原因和子原因。在这项研究中,数据是从一家综合钢铁厂收集的。使用奇异值分解(SVD)和期望最大化(EM)算法对文本数据进行了分析。结果表明,文本文档聚类可用于从工作场所发生的事故描述中探索隐藏因素和趋势。该研究还有助于发现事故报告中的异常情况。