双机器学习估计量速率双重稳健性的无假设证伪检验。

Assumption-lean falsification tests of rate double-robustness of double-machine-learning estimators.

作者信息

Liu Lin, Mukherjee Rajarshi, Robins James M

机构信息

Institute of Natural Sciences, MOE-LSC, School of Mathematical Sciences, CMA-Shanghai, SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University; Shanghai Artificial Intelligence Laboratory.

Department of Biostatistics, Harvard University.

出版信息

J Econom. 2024 Mar;240(2). doi: 10.1016/j.jeconom.2023.105500. Epub 2023 Aug 19.

DOI:10.1016/j.jeconom.2023.105500

PMID:38680250

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11052545/

Abstract

The class of doubly robust (DR) functionals studied by Rotnitzky et al. (2021) is of central importance in economics and biostatistics. It strictly includes both (i) the class of mean-square continuous functionals that can be written as an expectation of an affine functional of a conditional expectation studied by Chernozhukov et al. (2022b) and the class of functionals studied by Robins et al. (2008). The present state-of-the-art estimators for DR functionals are double-machine-learning (DML) estimators (Chernozhukov et al., 2018). A DML estimator of depends on estimates and of a pair of nuisance functions and , and is said to satisfy "rate double-robustness" if the Cauchy-Schwarz upper bound of its bias is . Were it achievable, our scientific goal would have been to construct valid, assumption-lean (i.e. no complexity-reducing assumptions on or ) tests of the validity of a nominal () Wald confidence interval (CI) centered at . But this would require a test of the bias to be , which can be shown not to exist. We therefore adopt the less ambitious goal of falsifying, when possible, an analyst's justification for her claim that the reported () Wald CI is valid. In many instances, an analyst justifies her claim by imposing complexity-reducing assumptions on and to ensure "rate double-robustness". Here we exhibit valid, assumption-lean tests of : "rate double-robustness holds", with non-trivial power against certain alternatives. If is rejected, we will have falsified her justification. However, no assumption-lean test of , including ours, can be a consistent test. Thus, the failure of our test to reject is not meaningful evidence in favor of .

摘要

Rotnitzky等人（2021年）研究的双稳健（DR）泛函类在经济学和生物统计学中具有核心重要性。它严格包含以下两类：（i）可写成Chernozhukov等人（2022b年）研究的条件期望的仿射泛函期望的均方连续泛函类，以及Robins等人（2008年）研究的泛函类。目前DR泛函的最优估计量是双机器学习（DML）估计量（Chernozhukov等人，2018年）。DR泛函的DML估计量取决于一对干扰函数的估计量和，并且如果其偏差的柯西 - 施瓦茨上界为，则称其满足“速率双稳健性”。如果能够实现，我们的科学目标本应是构建关于以为中心的名义（）Wald置信区间（CI）有效性的有效、假设较少（即对或无复杂度降低假设）的检验。但这需要一个偏差为的检验，而这是不存在的。因此，我们采用了一个不那么雄心勃勃的目标，即尽可能证伪分析师声称所报告的（）Wald CI有效的理由。在许多情况下，分析师通过对和施加复杂度降低假设来确保“速率双稳健性”，以此来证明其声称的合理性。在这里，我们展示了关于“速率双稳健性成立”的有效、假设较少的检验，对某些备择假设具有非平凡的检验功效。如果被拒绝，我们就证伪了她的理由。然而，包括我们的检验在内，没有任何假设较少的检验可以是一致检验。因此，我们的检验未能拒绝并不是支持的有意义证据。

相似文献

Assumption-lean falsification tests of rate double-robustness of double-machine-learning estimators.双机器学习估计量速率双重稳健性的无假设证伪检验。

J Econom. 2024 Mar;240(2). doi: 10.1016/j.jeconom.2023.105500. Epub 2023 Aug 19.

Double Robust Efficient Estimators of Longitudinal Treatment Effects: Comparative Performance in Simulations and a Case Study.纵向治疗效果的双重稳健有效估计量：模拟中的比较性能及一个案例研究

Int J Biostat. 2019 Feb 26;15(2):/j/ijb.2019.15.issue-2/ijb-2017-0054/ijb-2017-0054.xml. doi: 10.1515/ijb-2017-0054.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Statistical inference for data-adaptive doubly robust estimators with survival outcomes.基于生存数据的自适应双稳健估计的统计推断。

Stat Med. 2019 Jul 10;38(15):2735-2748. doi: 10.1002/sim.8156. Epub 2019 Apr 4.

Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计

Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.

MULTIPLY ROBUST ESTIMATORS OF CAUSAL EFFECTS FOR SURVIVAL OUTCOMES.生存结局因果效应的多重稳健估计量

Scand Stat Theory Appl. 2022 Sep;49(3):1304-1328. doi: 10.1111/sjos.12561. Epub 2021 Nov 11.

Machine Learning for Causal Inference: On the Use of Cross-fit Estimators.机器学习在因果推断中的应用：基于交叉拟合估计量的研究。

Epidemiology. 2021 May 1;32(3):393-401. doi: 10.1097/EDE.0000000000001332.

Challenges in Obtaining Valid Causal Effect Estimates with Machine Learning Algorithms.使用机器学习算法获取有效因果效应估计值面临的挑战。

Am J Epidemiol. 2023 Sep 1;192(9). doi: 10.1093/aje/kwab201. Epub 2021 Jul 15.

Machine learning in causal inference for epidemiology.流行病学中的因果推理中的机器学习。

Eur J Epidemiol. 2024 Oct;39(10):1097-1108. doi: 10.1007/s10654-024-01173-x. Epub 2024 Nov 13.

Doubly robust inference for targeted minimum loss-based estimation in randomized trials with missing outcome data.在存在结局数据缺失的随机试验中，基于目标最小损失估计的双重稳健推断。

Stat Med. 2017 Oct 30;36(24):3807-3819. doi: 10.1002/sim.7389. Epub 2017 Jul 25.

本文引用的文献

Identifying Causal Effects With Proxy Variables of an Unmeasured Confounder.利用未测量混杂因素的替代变量识别因果效应。

Biometrika. 2018 Dec;105(4):987-993. doi: 10.1093/biomet/asy038. Epub 2018 Aug 13.

HIGHER ORDER ESTIMATING EQUATIONS FOR HIGH-DIMENSIONAL MODELS.高维模型的高阶估计方程

Ann Stat. 2017 Oct;45(5):1951-1987. doi: 10.1214/16-AOS1515. Epub 2017 Oct 31.

Semiparametric Minimax Rates.半参数极小极大率

Electron J Stat. 2009;3:1305-1321. doi: 10.1214/09-EJS479. Epub 2009 Dec 4.

Minimax estimation of the integral of a power of a density.密度幂次积分的极小极大估计。

Stat Probab Lett. 2008 Dec 15;78(18):3307-3311. doi: 10.1016/j.spl.2008.07.001. Epub 2008 Jul 15.

Higher Order Inference On A Treatment Effect Under Low Regularity Conditions.低正则性条件下治疗效果的高阶推断

Stat Probab Lett. 2011 Jul 1;81(7):821-828. doi: 10.1016/j.spl.2011.02.030.

Doubly robust estimation in missing data and causal inference models.缺失数据与因果推断模型中的双重稳健估计

Biometrics. 2005 Dec;61(4):962-73. doi: 10.1111/j.1541-0420.2005.00377.x.

Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models.迈向半参数模型的维度诅咒适配（CODA）渐近理论。

Stat Med. 1997;16(1-3):285-319. doi: 10.1002/(sici)1097-0258(19970215)16:3<285::aid-sim535>3.0.co;2-#.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。