• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高维混合数据的因果发现。

Causal discoveries for high dimensional mixed data.

机构信息

Department of Statistics, Iowa State University, Ames, Iowa, USA.

Gilead Sciences, Foster City, California, USA.

出版信息

Stat Med. 2022 Oct 30;41(24):4924-4940. doi: 10.1002/sim.9544. Epub 2022 Aug 15.

DOI:10.1002/sim.9544
PMID:35968913
Abstract

Causal relationships are of crucial importance for biological and medical research. Algorithms have been proposed for causal structure learning with graphical visualizations. While much of the literature focuses on biological studies where data often follow the same distribution, for example, the normal distribution for all variables, challenges emerge from epidemiological and clinical studies where data are often mixed with continuous, binary, and ordinal variables. We propose to use a mixed latent Gaussian copula model to estimate the underlying correlation structure via the rank correlation for mixed data. This correlation structure is then incorporated into a popular causal discovery algorithm, the PC algorithm, to identify causal structures. The proposed algorithm, called the latent-PC algorithm, is able to discover the true causal structure consistently under mild conditions in high dimensional settings. From simulation studies, the latent-PC algorithm delivers a competitive performance in terms of a similar or higher true positive rate and a similar or lower false positive rate, compared with other variants of the PC algorithm. In the high dimensional settings where the number of variables is more than the number of observations, the causal graphs identified by the latent-PC algorithm are closer to the true causal structures, compared to other competing algorithms. Further, we demonstrate the utility of the latent-PC algorithm in a real dataset for hepatocellular carcinoma. Causal structures for patient survival are visualized and connected with clinical interpretations in the literature.

摘要

因果关系对于生物医学研究至关重要。已经提出了一些算法来进行具有图形可视化的因果结构学习。虽然大部分文献都集中在生物学研究上,这些研究中的数据通常遵循相同的分布,例如所有变量的正态分布,但来自流行病学和临床研究的数据通常是混合的,包括连续、二值和有序变量,这就带来了挑战。我们建议使用混合潜在高斯 Copula 模型通过混合数据的秩相关来估计潜在的相关结构。然后,将这种相关结构纳入一种流行的因果发现算法——PC 算法中,以识别因果结构。所提出的算法称为潜在-PC 算法,在高维环境下的温和条件下能够一致地发现真实的因果结构。通过模拟研究,与 PC 算法的其他变体相比,潜在-PC 算法在类似或更高的真阳性率和类似或更低的假阳性率方面表现出了有竞争力的性能。在变量数量超过观测数量的高维环境中,与其他竞争算法相比,潜在-PC 算法识别的因果图更接近真实的因果结构。此外,我们在肝细胞癌的真实数据集上展示了潜在-PC 算法的实用性。可视化了患者生存的因果结构,并与文献中的临床解释联系起来。

相似文献

1
Causal discoveries for high dimensional mixed data.高维混合数据的因果发现。
Stat Med. 2022 Oct 30;41(24):4924-4940. doi: 10.1002/sim.9544. Epub 2022 Aug 15.
2
Estimating causal effects of time-dependent exposures on a binary endpoint in a high-dimensional setting.在高维环境中估计时变暴露对二分类结局的因果效应。
BMC Med Res Methodol. 2018 Jul 3;18(1):67. doi: 10.1186/s12874-018-0527-5.
3
Causality in linear nongaussian acyclic models in the presence of latent gaussian confounders.存在潜在高斯混杂因素时线性非高斯非循环模型中的因果关系。
Neural Comput. 2013 Jun;25(6):1605-41. doi: 10.1162/NECO_a_00444. Epub 2013 Mar 21.
4
An algorithm for direct causal learning of influences on patient outcomes.一种用于直接因果学习对患者预后影响的算法。
Artif Intell Med. 2017 Jan;75:1-15. doi: 10.1016/j.artmed.2016.10.003. Epub 2016 Nov 5.
5
Nonparametric Causal Structure Learning in High Dimensions.高维非参数因果结构学习
Entropy (Basel). 2022 Feb 28;24(3):351. doi: 10.3390/e24030351.
6
Decomposition of variation of mixed variables by a latent mixed Gaussian copula model.混合变量变异的分解由潜在混合高斯 Copula 模型完成。
Biometrics. 2023 Jun;79(2):1187-1200. doi: 10.1111/biom.13660. Epub 2022 Mar 30.
7
Causal Discovery in Linear Non-Gaussian Acyclic Model With Multiple Latent Confounders.具有多个潜在混杂因素的线性非高斯无环模型中的因果发现。
IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):2816-2827. doi: 10.1109/TNNLS.2020.3045812. Epub 2022 Jul 6.
8
Developing a novel causal inference algorithm for personalized biomedical causal graph learning using meta machine learning.利用元机器学习开发个性化生物医学因果图学习的新因果推理算法。
BMC Med Inform Decis Mak. 2024 May 27;24(1):137. doi: 10.1186/s12911-024-02510-6.
9
Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data.利用不完全队列数据进行因果发现的多重插补和逐个检验删除。
Stat Med. 2022 Oct 15;41(23):4716-4743. doi: 10.1002/sim.9535. Epub 2022 Jul 31.
10
Transfer learning in high-dimensional semiparametric graphical models with application to brain connectivity analysis.基于高维半参数图模型的迁移学习及其在脑连接分析中的应用。
Stat Med. 2022 Sep 20;41(21):4112-4129. doi: 10.1002/sim.9499. Epub 2022 Jun 21.

引用本文的文献

1
Asymptotic Distribution-Free Independence Test for High Dimension Data.高维数据的渐近无分布独立性检验
J Am Stat Assoc. 2024;119(547):1794-1804. doi: 10.1080/01621459.2023.2218030. Epub 2023 Dec 21.