Biza Konstantina, Tsamardinos Ioannis, Triantafillou Sofia
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):4963-4973. doi: 10.1109/TNNLS.2022.3185842. Epub 2024 Apr 4.
Causal discovery is continually being enriched with new algorithms for learning causal graphical probabilistic models. Each one of them requires a set of hyperparameters, creating a great number of combinations. Given that the true graph is unknown and the learning task is unsupervised, the challenge to a practitioner is how to tune these choices. We propose out-of-sample causal tuning (OCT) that aims to select an optimal combination. The method treats a causal model as a set of predictive models and uses out-of-sample protocols for supervised methods. This approach can handle general settings like latent confounders and nonlinear relationships. The method uses an information-theoretic approach to be able to generalize to mixed data types and a penalty for dense graphs to penalize for complexity. To evaluate OCT, we introduce a causal-based simulation method to create datasets that mimic the properties of real-world problems. We evaluate OCT against two other tuning approaches, based on stability and in-sample fitting. We show that OCT performs well in many experimental settings and it is an effective tuning method for causal discovery.
因果发现不断地被用于学习因果图形概率模型的新算法所丰富。它们中的每一个都需要一组超参数,从而产生大量的组合。鉴于真实的图是未知的且学习任务是无监督的,从业者面临的挑战是如何调整这些选择。我们提出了样本外因果调整(OCT),旨在选择最优组合。该方法将因果模型视为一组预测模型,并对监督方法使用样本外协议。这种方法可以处理诸如潜在混杂因素和非线性关系等一般情况。该方法使用信息论方法以便能够推广到混合数据类型,并对密集图使用惩罚项以惩罚复杂性。为了评估OCT,我们引入一种基于因果的模拟方法来创建模拟现实世界问题属性的数据集。我们基于稳定性和样本内拟合,将OCT与其他两种调整方法进行比较评估。我们表明,OCT在许多实验设置中表现良好,并且是一种用于因果发现的有效调整方法。