Zhan Tianyu, Kang Jian
Data and Statistical Sciences, AbbVie Inc., North Chicago, IL.
Department of Biostatistics, University of Michigan, Ann Arbor, MI.
J Comput Graph Stat. 2022;31(3):856-865. doi: 10.1080/10618600.2021.2020128. Epub 2022 Feb 18.
In the problem of composite hypothesis testing, identifying the potential uniformly most powerful (UMP) unbiased test is of great interest. Beyond typical hypothesis settings with exponential family, it is usually challenging to prove the existence and further construct such UMP unbiased tests with finite sample size. For example in the COVID-19 pandemic with limited previous assumptions on the treatment for investigation and the standard of care, adaptive clinical trials are appealing due to ethical considerations, and the ability to accommodate uncertainty while conducting the trial. Although several methods have been proposed to control Type I error rates, how to find a more powerful hypothesis testing strategy is still an open question. Motivated by this problem, we propose an automatic framework of constructing test statistics and corresponding critical values via machine learning methods to enhance power in a finite sample. In this article, we particularly illustrate the performance using Deep Neural Networks (DNN) and discuss its advantages. Simulations and two case studies of adaptive designs demonstrate that our method is automatic, general and prespecified to construct statistics with satisfactory power in finite-sample. Supplemental materials are available online including R code and an R shiny app.
在复合假设检验问题中,识别潜在的一致最强大(UMP)无偏检验非常重要。除了指数族的典型假设设置外,证明这种UMP无偏检验在有限样本量下的存在性并进一步构建它们通常具有挑战性。例如,在新冠疫情中,由于对调查治疗和护理标准的先前假设有限,出于伦理考虑以及在进行试验时适应不确定性的能力,适应性临床试验很有吸引力。尽管已经提出了几种方法来控制I型错误率,但如何找到一种更强大的假设检验策略仍然是一个悬而未决的问题。受此问题启发,我们提出了一个通过机器学习方法构建检验统计量和相应临界值的自动框架,以在有限样本中提高检验功效。在本文中,我们特别说明了使用深度神经网络(DNN)的性能并讨论了其优点。模拟和两个适应性设计的案例研究表明,我们的方法是自动、通用且预先指定的,能够在有限样本中构建具有令人满意功效的统计量。补充材料可在网上获取,包括R代码和一个R shiny应用程序。