The Department of Cognitive and Brain Sciences, The Zlotowski Center for Neuroscience, and The Data Science Research Center, Ben-Gurion University of the Negev, Beer-Sheva, Israel.
Gilasio Coding, Tel-Aviv, Israel.
Big Data. 2021 Dec;9(6):417-426. doi: 10.1089/big.2021.0191. Epub 2021 Oct 12.
The identification of extreme rare events is a challenge that appears in several real-world contexts, from screening for solo perpetrators to the prediction of failures in industrial production. In this article, we explain the challenge and present a new methodology for addressing it, a methodology that may be considered in terms of features engineering. This methodology, which is based on Jaynes inferential approach, is tested on a dataset dealing with failures in production in the pulp-and-paper industry. The results are discussed in the context of the benefits of using the approach for features engineering in practical contexts involving measurable risks.
极端罕见事件的识别是一个出现在多个现实背景中的挑战,从筛查单独作案者到预测工业生产中的故障。在本文中,我们解释了这一挑战,并提出了一种新的解决方法,从特征工程的角度来看,这种方法可以被认为是一种特征工程方法。这种基于杰恩斯推理方法的方法在一个涉及纸浆和造纸行业生产故障的数据集上进行了测试。结果在涉及可衡量风险的实际背景下使用该方法进行特征工程的好处方面进行了讨论。