Suppr超能文献

双机器学习估计量速率双重稳健性的无假设证伪检验。

Assumption-lean falsification tests of rate double-robustness of double-machine-learning estimators.

作者信息

Liu Lin, Mukherjee Rajarshi, Robins James M

机构信息

Institute of Natural Sciences, MOE-LSC, School of Mathematical Sciences, CMA-Shanghai, SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University; Shanghai Artificial Intelligence Laboratory.

Department of Biostatistics, Harvard University.

出版信息

J Econom. 2024 Mar;240(2). doi: 10.1016/j.jeconom.2023.105500. Epub 2023 Aug 19.

Abstract

The class of doubly robust (DR) functionals studied by Rotnitzky et al. (2021) is of central importance in economics and biostatistics. It strictly includes both (i) the class of mean-square continuous functionals that can be written as an expectation of an affine functional of a conditional expectation studied by Chernozhukov et al. (2022b) and the class of functionals studied by Robins et al. (2008). The present state-of-the-art estimators for DR functionals are double-machine-learning (DML) estimators (Chernozhukov et al., 2018). A DML estimator of depends on estimates and of a pair of nuisance functions and , and is said to satisfy "rate double-robustness" if the Cauchy-Schwarz upper bound of its bias is . Were it achievable, our scientific goal would have been to construct valid, assumption-lean (i.e. no complexity-reducing assumptions on or ) tests of the validity of a nominal () Wald confidence interval (CI) centered at . But this would require a test of the bias to be , which can be shown not to exist. We therefore adopt the less ambitious goal of falsifying, when possible, an analyst's justification for her claim that the reported () Wald CI is valid. In many instances, an analyst justifies her claim by imposing complexity-reducing assumptions on and to ensure "rate double-robustness". Here we exhibit valid, assumption-lean tests of : "rate double-robustness holds", with non-trivial power against certain alternatives. If is rejected, we will have falsified her justification. However, no assumption-lean test of , including ours, can be a consistent test. Thus, the failure of our test to reject is not meaningful evidence in favor of .

摘要

Rotnitzky等人(2021年)研究的双稳健(DR)泛函类在经济学和生物统计学中具有核心重要性。它严格包含以下两类:(i)可写成Chernozhukov等人(2022b年)研究的条件期望的仿射泛函期望的均方连续泛函类,以及Robins等人(2008年)研究的泛函类。目前DR泛函的最优估计量是双机器学习(DML)估计量(Chernozhukov等人,2018年)。DR泛函的DML估计量取决于一对干扰函数的估计量和,并且如果其偏差的柯西 - 施瓦茨上界为,则称其满足“速率双稳健性”。如果能够实现,我们的科学目标本应是构建关于以 为中心的名义()Wald置信区间(CI)有效性的有效、假设较少(即对或无复杂度降低假设)的检验。但这需要一个偏差为的检验,而这是不存在的。因此,我们采用了一个不那么雄心勃勃的目标,即尽可能证伪分析师声称所报告的()Wald CI有效的理由。在许多情况下,分析师通过对和施加复杂度降低假设来确保“速率双稳健性”,以此来证明其声称的合理性。在这里,我们展示了关于“速率双稳健性成立”的有效、假设较少的检验,对某些备择假设具有非平凡的检验功效。如果被拒绝,我们就证伪了她的理由。然而,包括我们的检验在内,没有任何假设较少的检验可以是一致检验。因此,我们的检验未能拒绝并不是支持的有意义证据。

相似文献

5
Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计
Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.
6
MULTIPLY ROBUST ESTIMATORS OF CAUSAL EFFECTS FOR SURVIVAL OUTCOMES.生存结局因果效应的多重稳健估计量
Scand Stat Theory Appl. 2022 Sep;49(3):1304-1328. doi: 10.1111/sjos.12561. Epub 2021 Nov 11.
9
Machine learning in causal inference for epidemiology.流行病学中的因果推理中的机器学习。
Eur J Epidemiol. 2024 Oct;39(10):1097-1108. doi: 10.1007/s10654-024-01173-x. Epub 2024 Nov 13.

本文引用的文献

2
HIGHER ORDER ESTIMATING EQUATIONS FOR HIGH-DIMENSIONAL MODELS.高维模型的高阶估计方程
Ann Stat. 2017 Oct;45(5):1951-1987. doi: 10.1214/16-AOS1515. Epub 2017 Oct 31.
3
Semiparametric Minimax Rates.半参数极小极大率
Electron J Stat. 2009;3:1305-1321. doi: 10.1214/09-EJS479. Epub 2009 Dec 4.
4
Minimax estimation of the integral of a power of a density.密度幂次积分的极小极大估计。
Stat Probab Lett. 2008 Dec 15;78(18):3307-3311. doi: 10.1016/j.spl.2008.07.001. Epub 2008 Jul 15.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验