Suppr超能文献

用于因果发现的广义评分函数

Generalized Score Functions for Causal Discovery.

作者信息

Huang Biwei, Zhang Kun, Lin Yizhu, Schölkopf Bernhard, Glymour Clark

机构信息

Department of Philosophy, Carnegie Mellon University.

MPI for Intelligent Systems, Tübingen, Germany.

出版信息

KDD. 2018 Aug;2018:1551-1560. doi: 10.1145/3219819.3220104.

Abstract

Discovery of causal relationships from observational data is a fundamental problem. Roughly speaking, there are two types of methods for causal discovery, constraint-based ones and score-based ones. Score-based methods avoid the multiple testing problem and enjoy certain advantages compared to constraint-based ones. However, most of them need strong assumptions on the functional forms of causal mechanisms, as well as on data distributions, which limit their applicability. In practice the precise information of the underlying model class is usually unknown. If the above assumptions are violated, both spurious and missing edges may result. In this paper, we introduce generalized score functions for causal discovery based on the characterization of general (conditional) independence relationships between random variables, without assuming particular model classes. In particular, we exploit regression in RKHS to capture the dependence in a non-parametric way. The resulting causal discovery approach produces asymptotically correct results in rather general cases, which may have nonlinear causal mechanisms, a wide class of data distributions, mixed continuous and discrete data, and multidimensional variables. Experimental results on both synthetic and real-world data demonstrate the efficacy of our proposed approach.

摘要

从观测数据中发现因果关系是一个基本问题。大致来说,因果发现方法有两种类型,基于约束的方法和基于分数的方法。基于分数的方法避免了多重检验问题,并且与基于约束的方法相比具有一定优势。然而,它们中的大多数需要对因果机制的函数形式以及数据分布做出很强的假设,这限制了它们的适用性。在实际中,潜在模型类别的精确信息通常是未知的。如果违反了上述假设,可能会导致虚假边和缺失边。在本文中,我们基于随机变量之间一般(条件)独立关系的特征,引入用于因果发现的广义分数函数,而不假设特定的模型类别。特别是,我们利用再生核希尔伯特空间(RKHS)中的回归以非参数方式捕捉依赖性。由此产生的因果发现方法在相当一般的情况下产生渐近正确的结果,这些情况可能具有非线性因果机制、广泛的数据分布类别、混合的连续和离散数据以及多维变量。在合成数据和真实世界数据上的实验结果证明了我们提出的方法的有效性。

相似文献

1
Generalized Score Functions for Causal Discovery.用于因果发现的广义评分函数
KDD. 2018 Aug;2018:1551-1560. doi: 10.1145/3219819.3220104.
4
Nonlinear Causal Discovery for High-Dimensional Deterministic Data.高维确定性数据的非线性因果发现
IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2234-2245. doi: 10.1109/TNNLS.2021.3106111. Epub 2023 May 2.
9
Constraint-based causal discovery with mixed data.基于约束的混合数据因果发现
Int J Data Sci Anal. 2018;6(1):19-30. doi: 10.1007/s41060-018-0097-y. Epub 2018 Feb 2.

引用本文的文献

6
Deep causal learning for robotic intelligence.用于机器人智能的深度因果学习。
Front Neurorobot. 2023 Feb 22;17:1128591. doi: 10.3389/fnbot.2023.1128591. eCollection 2023.
9
Review of Causal Discovery Methods Based on Graphical Models.基于图形模型的因果发现方法综述
Front Genet. 2019 Jun 4;10:524. doi: 10.3389/fgene.2019.00524. eCollection 2019.

本文引用的文献

2
Behind Distribution Shift: Mining Driving Forces of Changes and Causal Arrows.分布转移背后:挖掘变化的驱动力和因果箭头
Proc IEEE Int Conf Data Min. 2017 Nov;2017:913-918. doi: 10.1109/ICDM.2017.114. Epub 2017 Dec 18.
7
A meta-analysis of sex differences in human brain structure.人类大脑结构性别差异的荟萃分析。
Neurosci Biobehav Rev. 2014 Feb;39(100):34-50. doi: 10.1016/j.neubiorev.2013.12.004. Epub 2013 Dec 26.
9
Asymptotic optimality of likelihood-based cross-validation.基于似然的交叉验证的渐近最优性。
Stat Appl Genet Mol Biol. 2004;3:Article4. doi: 10.2202/1544-6115.1036. Epub 2004 Mar 22.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验