Institute for Health Informatics, University of Minnesota, Minneapolis, MN, 55455, USA.
Mayo Clinic, Rochester, MN, 55905, USA.
Sci Rep. 2020 Feb 19;10(1):2975. doi: 10.1038/s41598-020-59669-x.
Causal Structure Discovery (CSD) is the problem of identifying causal relationships from large quantities of data through computational methods. With the limited ability of traditional association-based computational methods to discover causal relationships, CSD methodologies are gaining popularity. The goal of the study was to systematically examine whether (i) CSD methods can discover the known causal relationships from observational clinical data and (ii) to offer guidance to accurately discover known causal relationships. We used Alzheimer's disease (AD), a complex progressive disease, as a model because the well-established evidence provides a "gold-standard" causal graph for evaluation. We evaluated two CSD methods, Fast Causal Inference (FCI) and Fast Greedy Equivalence Search (FGES) in their ability to discover this structure from data collected by the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used structural equation models (which is not designed for CSD) as control. We applied these methods under three scenarios defined by increasing amounts of background knowledge provided to the methods. The methods were evaluated by comparing the resulting causal relationships with the "gold standard" graph that was constructed from literature. Dedicated CSD methods managed to discover graphs that nearly coincided with the gold standard. For best results, CSD algorithms should be used with longitudinal data providing as much prior knowledge as possible.
因果结构发现(CSD)是通过计算方法从大量数据中识别因果关系的问题。由于传统基于关联的计算方法发现因果关系的能力有限,CSD 方法越来越受欢迎。本研究的目的是系统地检查(i)CSD 方法是否可以从观察性临床数据中发现已知的因果关系,以及(ii)提供准确发现已知因果关系的指导。我们使用阿尔茨海默病(AD)作为模型,因为已建立的证据为评估提供了一个“黄金标准”因果图。我们评估了两种 CSD 方法,即快速因果推理(FCI)和快速贪婪等价搜索(FGES),以了解它们从阿尔茨海默病神经影像学倡议(ADNI)收集的数据中发现这种结构的能力。我们使用结构方程模型(不是为 CSD 设计的)作为对照。我们在三种情况下应用这些方法,这些情况由向方法提供的背景知识的数量来定义。通过将得到的因果关系与从文献中构建的“黄金标准”图进行比较来评估这些方法。专门的 CSD 方法设法发现了与黄金标准几乎一致的图。为了获得最佳结果,应使用提供尽可能多先验知识的纵向数据来使用 CSD 算法。