Hira Swati, Deshpande P S
Department of Computer Science and Engineering, Visvesvaraya National Institute of Technology, 440010 Nagpur, India.
Springerplus. 2016 Sep 21;5(1):1625. doi: 10.1186/s40064-016-3292-0. eCollection 2016.
Discovery of cause-effect relationships, particularly in large databases of time-series is challenging because of continuous data of different characteristics and complex lagged relationships. In this paper, we have proposed a novel approach, to extract cause-effect relationships in large time series data set of socioeconomic indicators. The method enhances the scope of relationship discovery to cause-effect relationships by identifying multiple causal structures such as binary, transitive, many to one and cyclic. We use temporal association and temporal odds ratio to exclude noncausal association and to ensure the high reliability of discovered causal rules. We assess the method with both synthetic and real-world datasets. Our proposed method will help to build quantitative models to analyze socioeconomic processes by generating a precise cause-effect relationship between different economic indicators. The outcome shows that the proposed method can effectively discover existing causality structure in large time series databases.
发现因果关系,尤其是在大型时间序列数据库中,具有挑战性,因为存在具有不同特征的连续数据以及复杂的滞后关系。在本文中,我们提出了一种新颖的方法,用于在社会经济指标的大型时间序列数据集中提取因果关系。该方法通过识别多种因果结构,如二元、传递、多对一和循环因果结构,扩大了关系发现到因果关系的范围。我们使用时间关联和时间优势比来排除非因果关联,并确保所发现因果规则的高可靠性。我们使用合成数据集和真实世界数据集对该方法进行评估。我们提出的方法将有助于通过生成不同经济指标之间精确的因果关系来构建定量模型,以分析社会经济过程。结果表明,所提出的方法能够有效地在大型时间序列数据库中发现现有的因果结构。