Suppr超能文献

数据类型的数据挖掘策略综述,重点关注施工过程和健康安全管理。

A Review of Data Mining Strategies by Data Type, with a Focus on Construction Processes and Health and Safety Management.

机构信息

Department of Technological Innovations and Safety of Plants, Products and Anthropic Settlements (DIT), Italian National Institute for Insurance against Accidents at Work, Inail, 00144 Rome, Italy.

Department of Chemical Engineering Materials Environment (DICMA), Sapienza-University of Rome, 00184 Rome, Italy.

出版信息

Int J Environ Res Public Health. 2024 Jun 26;21(7):831. doi: 10.3390/ijerph21070831.

Abstract

UNLABELLED

Increasingly, information technology facilitates the storage and management of data useful for risk analysis and event prediction. Studies on data extraction related to occupational health and safety are increasingly available; however, due to its variability, the construction sector warrants special attention. This review is conducted under the research programs of the National Institute for Occupational Accident Insurance (Inail).

OBJECTIVES

The research question focuses on identifying which data mining (DM) methods, among supervised, unsupervised, and others, are most appropriate for certain investigation objectives, types, and sources of data, as defined by the authors.

METHODS

Scopus and ProQuest were the main sources from which we extracted studies in the field of construction, published between 2014 and 2023. The eligibility criteria applied in the selection of studies were based on the Preferred Reporting Items for Systematic Review and Meta-Analyses (PRISMA). For exploratory purposes, we applied hierarchical clustering, while for in-depth analysis, we used principal component analysis (PCA) and meta-analysis.

RESULTS

The search strategy based on the PRISMA eligibility criteria provided us with 63 out of 2234 potential articles, 206 observations, 89 methodologies, 4 survey purposes, 3 data sources, 7 data types, and 3 resource types. Cluster analysis and PCA organized the information included in the paper dataset into two dimensions and labels: "supervised methods, institutional dataset, and predictive and classificatory purposes" (correlation 0.97-8.18 × 10; -value 7.67 × 10-1.28 × 10) and the second, Dim2 "not-supervised methods; project, simulation, literature, text data; monitoring, decision-making processes; machinery and environment" (corr. 0.84-0.47; -value 5.79 × 10--3.59 × 10). We answered the research question regarding which method, among supervised, unsupervised, or other, is most suitable for application to data in the construction industry.

CONCLUSIONS

The meta-analysis provided an overall estimate of the better effectiveness of supervised methods (Odds Ratio = 0.71, Confidence Interval 0.53-0.96) compared to not-supervised methods.

摘要

未加标签

信息技术的发展越来越有助于存储和管理对风险分析和事件预测有用的数据。关于职业健康和安全相关数据提取的研究越来越多;然而,由于其多变性,建筑行业需要特别关注。本综述是在国家职业事故保险研究所(Inail)的研究计划下进行的。

目的

研究问题的重点是确定哪种数据挖掘(DM)方法,无论是监督式、非监督式还是其他方法,最适合作者定义的某些调查目标、数据类型和来源。

方法

Scopus 和 ProQuest 是我们从 2014 年至 2023 年发表的建筑领域研究中提取的主要来源。在选择研究时应用的纳入标准是基于系统评价和荟萃分析的首选报告项目(PRISMA)。出于探索性目的,我们应用了层次聚类,而对于深入分析,我们使用了主成分分析(PCA)和荟萃分析。

结果

基于 PRISMA 纳入标准的搜索策略为我们提供了 63 篇文章中的 2234 篇潜在文章,206 个观察结果,89 种方法,4 种调查目的,3 种数据源,7 种数据类型和 3 种资源类型。聚类分析和 PCA 将论文数据集中包含的信息组织成两个维度和标签:“监督方法、机构数据集和预测与分类目的”(相关性 0.97-8.18×10;-值 7.67×10-1.28×10)和第二个,Dim2“非监督方法;项目、模拟、文献、文本数据;监测、决策过程;机械和环境”(相关性 0.84-0.47;-值 5.79×10--3.59×10)。我们回答了关于在建筑行业中应用哪种方法(监督式、非监督式或其他方法)最适合的研究问题。

结论

荟萃分析提供了监督式方法(优势比=0.71,置信区间 0.53-0.96)比非监督式方法更有效的总体估计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ec4/11277231/531912167507/ijerph-21-00831-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验