Suppr超能文献

双重经验贝叶斯检验

Double Empirical Bayes Testing.

作者信息

Tansey Wesley, Wang Yixin, Rabadan Raul, Blei David M

机构信息

Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.

Department of Statistics, Columbia University, New York, NY, USA.

出版信息

Int Stat Rev. 2020 Dec;88(Suppl 1):S91-S113. doi: 10.1111/insr.12430. Epub 2020 Nov 25.

Abstract

Analyzing data from large-scale, multi-experiment studies requires scientists to both analyze each experiment and to assess the results as a whole. In this article, we develop double empirical Bayes testing (DEBT), an empirical Bayes method for analyzing multi-experiment studies when many covariates are gathered per experiment. DEBT is a two-stage method: in the first stage, it reports which experiments yielded significant outcomes; in the second stage, it hypothesizes which covariates drive the experimental significance. In both of its stages, DEBT builds on Efron (2008), which lays out an elegant empirical Bayes approach to testing. DEBT enhances this framework by learning a series of black box predictive models to boost power and control the false discovery rate (FDR). In Stage 1, it uses a deep neural network prior to report which experiments yielded significant outcomes. In Stage 2, it uses an empirical Bayes version of the knockoff filter (Candes et al., 2018) to select covariates that have significant predictive power of Stage-1 significance. In both simulated and real data, DEBT increases the proportion of discovered significant outcomes and selects more features when signals are weak. In a real study of cancer cell lines, DEBT selects a robust set of biologically-plausible genomic drivers of drug sensitivity and resistance in cancer.

摘要

分析来自大规模多实验研究的数据,要求科学家既要分析每个实验,又要整体评估结果。在本文中,我们开发了双重经验贝叶斯检验(DEBT),这是一种经验贝叶斯方法,用于在每个实验收集了许多协变量时分析多实验研究。DEBT是一种两阶段方法:在第一阶段,它报告哪些实验产生了显著结果;在第二阶段,它假设哪些协变量驱动了实验的显著性。在其两个阶段中,DEBT都基于Efron(2008),该文献提出了一种优雅的经验贝叶斯检验方法。DEBT通过学习一系列黑箱预测模型来提高功效并控制错误发现率(FDR),从而增强了这一框架。在第一阶段,它使用深度神经网络来报告哪些实验产生了显著结果。在第二阶段,它使用仿冒筛选器(Candes等人,2018)的经验贝叶斯版本来选择对第一阶段显著性具有显著预测能力的协变量。在模拟数据和真实数据中,当信号较弱时,DEBT都会增加发现的显著结果的比例并选择更多特征。在一项对癌细胞系的实际研究中,DEBT选择了一组可靠的、具有生物学合理性的癌症药物敏感性和耐药性的基因组驱动因素。

相似文献

1
Double Empirical Bayes Testing.双重经验贝叶斯检验
Int Stat Rev. 2020 Dec;88(Suppl 1):S91-S113. doi: 10.1111/insr.12430. Epub 2020 Nov 25.
2
Deep direct likelihood knockoffs.深度直接似然性仿样
Adv Neural Inf Process Syst. 2020 Dec;33:5036-5046.
4
IPAD: Stable Interpretable Forecasting with Knockoffs Inference.IPAD:基于仿冒品推断的稳定可解释预测
J Am Stat Assoc. 2020;115(532):1822-1834. doi: 10.1080/01621459.2019.1654878. Epub 2019 Sep 17.
8
Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。
Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.
9
Competition-based control of the false discovery proportion.基于竞争的假发现率控制。
Biometrics. 2023 Dec;79(4):3472-3484. doi: 10.1111/biom.13830. Epub 2023 Jan 30.

本文引用的文献

2
Causal inference in genetic trio studies.遗传三体型研究中的因果推断。
Proc Natl Acad Sci U S A. 2020 Sep 29;117(39):24117-24126. doi: 10.1073/pnas.2007743117. Epub 2020 Sep 18.
5
A Landscape of Pharmacogenomic Interactions in Cancer.癌症中的药物基因组学相互作用全景
Cell. 2016 Jul 28;166(3):740-754. doi: 10.1016/j.cell.2016.06.017. Epub 2016 Jul 7.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验