• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过剥离算法利用广义线性模型进行因果发现

Causal Discovery with Generalized Linear Models through Peeling Algorithms.

作者信息

Wang Minjie, Shen Xiaotong, Pan Wei

机构信息

Department of Mathematics and Statistics, Binghamton University, State University of New York, Binghamton, NY 13902, USA.

School of Statistics, University of Minnesota, Minneapolis, MN 55455, USA.

出版信息

J Mach Learn Res. 2024;25.

PMID:39758585
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11699566/
Abstract

This article presents a novel method for causal discovery with generalized structural equation models suited for analyzing diverse types of outcomes, including discrete, continuous, and mixed data. Causal discovery often faces challenges due to unmeasured confounders that hinder the identification of causal relationships. The proposed approach addresses this issue by developing two peeling algorithms (bottom-up and top-down) to ascertain causal relationships and valid instruments. This approach first reconstructs a super-graph to represent ancestral relationships between variables, using a peeling algorithm based on nodewise GLM regressions that exploit relationships between primary and instrumental variables. Then, it estimates parent-child effects from the ancestral relationships using another peeling algorithm while deconfounding a child's model with information borrowed from its parents' models. The article offers a theoretical analysis of the proposed approach, establishing conditions for model identifiability and providing statistical guarantees for accurately discovering parent-child relationships via the peeling algorithms. Furthermore, the article presents numerical experiments showcasing the effectiveness of our approach in comparison to state-of-the-art structure learning methods without confounders. Lastly, it demonstrates an application to Alzheimer's disease (AD), highlighting the method's utility in constructing gene-to-gene and gene-to-disease regulatory networks involving Single Nucleotide Polymorphisms (SNPs) for healthy and AD subjects.

摘要

本文提出了一种利用广义结构方程模型进行因果发现的新方法,该模型适用于分析包括离散、连续和混合数据在内的各种类型的结果。由于未测量的混杂因素阻碍了因果关系的识别,因果发现常常面临挑战。所提出的方法通过开发两种剥离算法(自下而上和自上而下)来确定因果关系和有效的工具变量,从而解决了这个问题。该方法首先使用基于节点广义线性模型回归的剥离算法重建一个超图,以表示变量之间的祖先关系,该算法利用主变量和工具变量之间的关系。然后,它使用另一种剥离算法从祖先关系中估计父子效应,同时利用从父模型借用的信息对子模型进行去混杂。本文对所提出的方法进行了理论分析,确定了模型可识别性的条件,并为通过剥离算法准确发现父子关系提供了统计保证。此外,本文还展示了数值实验,与无混杂因素的现有最先进结构学习方法相比,展示了我们方法的有效性。最后,它展示了在阿尔茨海默病(AD)中的应用,突出了该方法在构建健康和AD受试者的涉及单核苷酸多态性(SNP)的基因-基因和基因-疾病调控网络中的效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f1a/11699566/76a1e955cbd8/nihms-2044931-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f1a/11699566/1c8e2bcadda8/nihms-2044931-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f1a/11699566/76a1e955cbd8/nihms-2044931-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f1a/11699566/1c8e2bcadda8/nihms-2044931-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f1a/11699566/76a1e955cbd8/nihms-2044931-f0002.jpg

相似文献

1
Causal Discovery with Generalized Linear Models through Peeling Algorithms.通过剥离算法利用广义线性模型进行因果发现
J Mach Learn Res. 2024;25.
2
Discovery and inference of a causal network with hidden confounding.具有隐藏混杂因素的因果网络的发现与推断。
J Am Stat Assoc. 2024;119(548):2572-2584. doi: 10.1080/01621459.2023.2261658. Epub 2023 Nov 17.
3
Nonlinear causal discovery with confounders.含混杂因素的非线性因果发现
J Am Stat Assoc. 2024;119(546):1205-1214. doi: 10.1080/01621459.2023.2179490. Epub 2023 Mar 15.
4
Inference for a Large Directed Acyclic Graph with Unspecified Interventions.具有未指定干预措施的大型有向无环图的推断
J Mach Learn Res. 2023 Jan-Dec;24.
5
Structural factor equation models for causal network construction via directed acyclic mixed graphs.基于有向无环混合图的因果网络构建的结构因子方程模型。
Biometrics. 2021 Jun;77(2):573-586. doi: 10.1111/biom.13322. Epub 2020 Jul 18.
6
Directed Graphical Models and Causal Discovery for Zero-Inflated Data.零膨胀数据的有向图模型与因果发现
Proc Mach Learn Res. 2023;213:27-67.
7
AnchorFCI: harnessing genetic anchors for enhanced causal discovery of cardiometabolic disease pathways.锚定FCI:利用基因锚定技术增强对心血管代谢疾病通路的因果发现
Front Genet. 2024 Dec 9;15:1436947. doi: 10.3389/fgene.2024.1436947. eCollection 2024.
8
SLIVER: Unveiling large scale gene regulatory networks of single-cell transcriptomic data through causal structure learning and modules aggregation.SLIVER:通过因果结构学习和模块聚合揭示单细胞转录组数据的大规模基因调控网络。
Comput Biol Med. 2024 Aug;178:108690. doi: 10.1016/j.compbiomed.2024.108690. Epub 2024 Jun 9.
9
Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data.利用混合效应结构方程模型从观测数据中学习特定主题的有向无环图
Front Genet. 2018 Oct 2;9:430. doi: 10.3389/fgene.2018.00430. eCollection 2018.
10
Testability of Instrumental Variables in Linear Non-Gaussian Acyclic Causal Models.线性非高斯无环因果模型中工具变量的可检验性
Entropy (Basel). 2022 Apr 5;24(4):512. doi: 10.3390/e24040512.

本文引用的文献

1
Testing Directed Acyclic Graph via Structural, Supervised and Generative Adversarial Learning.通过结构、监督和生成对抗学习测试有向无环图
J Am Stat Assoc. 2024;119(547):1833-1846. doi: 10.1080/01621459.2023.2220169. Epub 2023 Jul 12.
2
Inference for a Large Directed Acyclic Graph with Unspecified Interventions.具有未指定干预措施的大型有向无环图的推断
J Mach Learn Res. 2023 Jan-Dec;24.
3
DAGBagM: learning directed acyclic graphs of mixed variables with an application to identify protein biomarkers for treatment response in ovarian cancer.
DAGBagM:学习混合变量的有向无环图及其在卵巢癌治疗反应中识别蛋白质生物标志物的应用。
BMC Bioinformatics. 2022 Aug 5;23(1):321. doi: 10.1186/s12859-022-04864-y.
4
Activating transcription factor 6 regulates cystathionine to increase autophagy and restore memory in Alzheimer' s disease model mice.激活转录因子 6 调节胱硫醚以增加自噬并恢复阿尔茨海默病模型小鼠的记忆。
Biochem Biophys Res Commun. 2022 Jul 30;615:109-115. doi: 10.1016/j.bbrc.2022.05.053. Epub 2022 May 17.
5
A numerical approach for detecting switch-like bistability in mass action chemical reaction networks with conservation laws.一种用于检测具有守恒律的质量作用化学反应网络中开关式双稳性的数值方法。
BMC Bioinformatics. 2022 Jan 4;23(1):1. doi: 10.1186/s12859-021-04477-x.
6
Common genetic signatures of Alzheimer's disease in Down Syndrome.唐氏综合征中阿尔茨海默病的常见遗传特征。
F1000Res. 2020 Nov 5;9:1299. doi: 10.12688/f1000research.27096.2. eCollection 2020.
7
Activating transcription factor 6 reduces Aβ1-42 and restores memory in Alzheimer's disease model mice.激活转录因子 6 减少 Aβ1-42 并恢复阿尔茨海默病模型小鼠的记忆。
Int J Neurosci. 2020 Oct;130(10):1015-1023. doi: 10.1080/00207454.2020.1715977. Epub 2020 Jan 21.
8
On the Use of the Lasso for Instrumental Variables Estimation with Some Invalid Instruments.关于在存在一些无效工具变量的情况下使用套索进行工具变量估计
J Am Stat Assoc. 2018 Nov 13;114(527):1339-1350. doi: 10.1080/01621459.2018.1498346. eCollection 2019.
9
Constrained likelihood for reconstructing a directed acyclic Gaussian graph.用于重建有向无环高斯图的约束似然法。
Biometrika. 2019 Mar;106(1):109-125. doi: 10.1093/biomet/asy057. Epub 2018 Dec 13.
10
Two-stage residual inclusion for survival data and competing risks-An instrumental variable approach with application to SEER-Medicare linked data.两阶段残差纳入法在生存数据和竞争风险中的应用——基于工具变量的方法及其在 SEER-Medicare 关联数据中的应用。
Stat Med. 2019 May 10;38(10):1775-1801. doi: 10.1002/sim.8071. Epub 2018 Dec 26.