• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用准实验量化数据科学中的因果关系。

Quantifying causality in data science with quasi-experiments.

作者信息

Liu Tony, Ungar Lyle, Kording Konrad

机构信息

Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, USA.

Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, USA.

出版信息

Nat Comput Sci. 2021 Jan;1(1):24-32. doi: 10.1038/s43588-020-00005-8. Epub 2021 Jan 14.

DOI:10.1038/s43588-020-00005-8
PMID:35662911
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9165615/
Abstract

Estimating causality from observational data is essential in many data science questions but can be a challenging task. Here we review approaches to causality that are popular in econometrics and that exploit (quasi) random variation in existing data, called quasi-experiments, and show how they can be combined with machine learning to answer causal questions within typical data science settings. We also highlight how data scientists can help advance these methods to bring causal estimation to high-dimensional data from medicine, industry and society.

摘要

从观测数据中估计因果关系在许多数据科学问题中至关重要,但可能是一项具有挑战性的任务。在这里,我们回顾了计量经济学中流行的因果关系方法,这些方法利用现有数据中的(准)随机变化,即所谓的准实验,并展示了如何将它们与机器学习相结合,以在典型的数据科学环境中回答因果问题。我们还强调了数据科学家如何能够帮助推进这些方法,以便将因果估计应用于来自医学、工业和社会的高维数据。

相似文献

1
Quantifying causality in data science with quasi-experiments.用准实验量化数据科学中的因果关系。
Nat Comput Sci. 2021 Jan;1(1):24-32. doi: 10.1038/s43588-020-00005-8. Epub 2021 Jan 14.
2
Quantifying causal effects from observed data using quasi-intervention.使用拟干预法从观测数据中量化因果效应。
BMC Med Inform Decis Mak. 2022 Dec 21;22(1):337. doi: 10.1186/s12911-022-02086-z.
3
Non-Gaussian Methods for Causal Structure Learning.非高斯方法在因果结构学习中的应用。
Prev Sci. 2019 Apr;20(3):431-441. doi: 10.1007/s11121-018-0901-x.
4
Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies.观察性研究中因果推断的靶向最大似然估计
Am J Epidemiol. 2017 Jan 1;185(1):65-73. doi: 10.1093/aje/kww165. Epub 2016 Dec 9.
5
Formulating and Answering High-Impact Causal Questions in Physiologic Childbirth Science: Concepts and Assumptions.在生理性分娩科学中提出并回答具有重大影响的因果问题:概念与假设
J Midwifery Womens Health. 2018 Nov;63(6):721-730. doi: 10.1111/jmwh.12868. Epub 2018 Jun 8.
6
Thirteen Questions About Using Machine Learning in Causal Research (You Won't Believe the Answer to Number 10!).使用机器学习进行因果研究的十三个问题(你不会相信问题 10 的答案!)!
Am J Epidemiol. 2021 Aug 1;190(8):1476-1482. doi: 10.1093/aje/kwab047.
7
Detecting causality from time series in a machine learning framework.在机器学习框架中从时间序列中检测因果关系。
Chaos. 2020 Jun;30(6):063116. doi: 10.1063/5.0007670.
8
Emergence and Causality in Complex Systems: A Survey of Causal Emergence and Related Quantitative Studies.复杂系统中的涌现与因果关系:因果涌现及相关定量研究综述
Entropy (Basel). 2024 Jan 24;26(2):108. doi: 10.3390/e26020108.
9
Quasi-experiments to establish causal effects of HIV care and treatment and to improve the cascade of care.旨在确定艾滋病护理与治疗的因果效应并改善护理流程的准实验。
Curr Opin HIV AIDS. 2015 Nov;10(6):495-501. doi: 10.1097/COH.0000000000000191.
10
Strengthening Association through Causal Inference.通过因果推理加强关联。
Plast Reconstr Surg. 2023 Oct 1;152(4):899-907. doi: 10.1097/PRS.0000000000010305. Epub 2023 Feb 15.

引用本文的文献

1
Machine Learning-Based Prediction of Well Logs Guided by Rock Physics and Its Interpretation.基于岩石物理指导的测井曲线机器学习预测及其解释
Sensors (Basel). 2025 Jan 30;25(3):836. doi: 10.3390/s25030836.
2
Causal inference concepts can guide research into the effects of climate on infectious diseases.因果推断概念可指导关于气候对传染病影响的研究。
Nat Ecol Evol. 2025 Feb;9(2):349-363. doi: 10.1038/s41559-024-02594-3. Epub 2024 Nov 25.
3
Layer-by-layer unsupervised clustering of statistically relevant fluctuations in noisy time-series data of complex dynamical systems.复杂动力系统噪声时间序列数据中统计相关波动的逐层无监督聚类。
Proc Natl Acad Sci U S A. 2024 Aug 13;121(33):e2403771121. doi: 10.1073/pnas.2403771121. Epub 2024 Aug 7.
4
Claim causality with clarity.明确主张因果关系。
Psychoradiology. 2023 Jun 9;3:kkad007. doi: 10.1093/psyrad/kkad007. eCollection 2023.
5
Exposure to urban and rural contexts shapes smartphone usage behavior.接触城市和农村环境会塑造智能手机使用行为。
PNAS Nexus. 2023 Nov 28;2(11):pgad357. doi: 10.1093/pnasnexus/pgad357. eCollection 2023 Nov.
6
Inferring causal connectivity from pairwise recordings and optogenetics.从成对记录和光遗传学推断因果连通性。
PLoS Comput Biol. 2023 Nov 7;19(11):e1011574. doi: 10.1371/journal.pcbi.1011574. eCollection 2023 Nov.
7
Bayesian model averaging for nonparametric discontinuity design.贝叶斯模型平均的非参数间断设计。
PLoS One. 2022 Jun 30;17(6):e0270310. doi: 10.1371/journal.pone.0270310. eCollection 2022.
8
Gendered beliefs about mathematics ability transmit across generations through children's peers.性别化的数学能力观念通过儿童的同龄人在代际间传递。
Nat Hum Behav. 2022 Jun;6(6):868-879. doi: 10.1038/s41562-022-01331-9. Epub 2022 Apr 7.
9
The impact of unemployment benefits on birth outcomes: Quasi-experimental evidence from European linked register data.失业救济金对生育结果的影响:来自欧洲关联登记数据的准实验证据。
PLoS One. 2022 Mar 2;17(3):e0264544. doi: 10.1371/journal.pone.0264544. eCollection 2022.
10
Modes of information flow in collective cohesion.集体凝聚力中的信息流模式。
Sci Adv. 2022 Feb 11;8(6):eabj1720. doi: 10.1126/sciadv.abj1720. Epub 2022 Feb 9.

本文引用的文献

1
Inferring causal connectivity from pairwise recordings and optogenetics.从成对记录和光遗传学推断因果连通性。
PLoS Comput Biol. 2023 Nov 7;19(11):e1011574. doi: 10.1371/journal.pcbi.1011574. eCollection 2023 Nov.
2
Regression discontinuity threshold optimization.回归间断点阈值优化。
PLoS One. 2022 Nov 16;17(11):e0276755. doi: 10.1371/journal.pone.0276755. eCollection 2022.
3
CAUSAL INTERPRETATIONS OF BLACK-BOX MODELS.黑箱模型的因果解释
J Bus Econ Stat. 2019;2019. doi: 10.1080/07350015.2019.1624293. Epub 2019 Jul 5.
4
Powerful three-sample genome-wide design and robust statistical inference in summary-data Mendelian randomization.基于汇总数据孟德尔随机化的强大三样本全基因组设计和稳健的统计推断。
Int J Epidemiol. 2019 Oct 1;48(5):1478-1492. doi: 10.1093/ije/dyz142.
5
Quasi-experimental causality in neuroscience and behavioural research.神经科学和行为研究中的准实验因果关系。
Nat Hum Behav. 2018 Dec;2(12):891-898. doi: 10.1038/s41562-018-0466-5. Epub 2018 Nov 26.
6
Causal inference in economics and marketing.经济学与市场营销中的因果推断。
Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):7310-5. doi: 10.1073/pnas.1510479113.
7
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
8
Regression discontinuity designs are underutilized in medicine, epidemiology, and public health: a review of current and best practice.回归不连续性设计在医学、流行病学和公共卫生领域未得到充分利用:对当前和最佳实践的综述。
J Clin Epidemiol. 2015 Feb;68(2):122-33. doi: 10.1016/j.jclinepi.2014.06.021.
9
Methods for evaluating changes in health care policy: the difference-in-differences approach.评估医疗保健政策变化的方法:双重差分法
JAMA. 2014 Dec 10;312(22):2401-2. doi: 10.1001/jama.2014.16153.
10
Association of the 2011 ACGME resident duty hour reforms with mortality and readmissions among hospitalized Medicare patients.2011年美国研究生医学教育认证委员会(ACGME)住院医师值班时长改革与医疗保险住院患者死亡率及再入院率的关联
JAMA. 2014 Dec 10;312(22):2364-73. doi: 10.1001/jama.2014.15273.