Chvirova Diana, Egger Andreas, Fehrer Tobias, Kratsch Wolfgang, Röglinger Maximilian, Wittmann Jakob, Wördehoff Niklas
Branch Business & Information Systems Engineering of the Fraunhofer FIT, Alter Postweg 101, D-86159 Augsburg, Germany.
Branch Business & Information Systems Engineering of the Fraunhofer FIT, Wittelsbacherring 10, D-95444 Bayreuth, Germany.
Data Brief. 2024 Jul 8;55:110716. doi: 10.1016/j.dib.2024.110716. eCollection 2024 Aug.
This manuscript introduces a multimedia business process dataset provided by a German research institute. The dataset was systematically collected in a laboratory environment that reflects the workspace of IT staff managing IT Asset Management (ITAM) processes. It encompasses data from 121 process instances across six basic processes, captured using 37 video recordings from two camera perspectives, motion tracking, environmental sensors, an ITAM system trace, and event log data from user interactions. The data is made available in its raw state and processed form. The object-centric event log format (OCEL) provides discrete business process events from system activities. Event data from reality is supplied as raw video files and logs from environmental sensors. The video files were also manually labelled with identifiable business process activities and their associated entities. This multimedia dataset has been designed as a resource for developing, training, and evaluating process mining techniques based on unstructured data. Consequently, the dataset design emphasizes the traceability of activities and entities across the multimedia data sources.
本手稿介绍了一个由德国研究机构提供的多媒体业务流程数据集。该数据集是在实验室环境中系统收集的,该环境反映了管理信息技术资产管理(ITAM)流程的IT人员的工作空间。它涵盖了来自六个基本流程的121个流程实例的数据,这些数据通过从两个摄像机角度拍摄的37个视频记录、运动跟踪、环境传感器、ITAM系统跟踪以及用户交互的事件日志数据进行采集。数据以原始状态和处理后的形式提供。以对象为中心的事件日志格式(OCEL)提供来自系统活动的离散业务流程事件。来自现实的事件数据以原始视频文件和环境传感器的日志形式提供。视频文件还手动标记了可识别的业务流程活动及其相关实体。这个多媒体数据集被设计为一种资源,用于基于非结构化数据开发、训练和评估流程挖掘技术。因此,数据集设计强调了跨多媒体数据源的活动和实体的可追溯性。