双重经验贝叶斯检验

Double Empirical Bayes Testing.

作者信息

Tansey Wesley, Wang Yixin, Rabadan Raul, Blei David M

机构信息

Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.

Department of Statistics, Columbia University, New York, NY, USA.

出版信息

Int Stat Rev. 2020 Dec;88(Suppl 1):S91-S113. doi: 10.1111/insr.12430. Epub 2020 Nov 25.

DOI:10.1111/insr.12430

PMID:35356801

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8963776/

Abstract

Analyzing data from large-scale, multi-experiment studies requires scientists to both analyze each experiment and to assess the results as a whole. In this article, we develop double empirical Bayes testing (DEBT), an empirical Bayes method for analyzing multi-experiment studies when many covariates are gathered per experiment. DEBT is a two-stage method: in the first stage, it reports which experiments yielded significant outcomes; in the second stage, it hypothesizes which covariates drive the experimental significance. In both of its stages, DEBT builds on Efron (2008), which lays out an elegant empirical Bayes approach to testing. DEBT enhances this framework by learning a series of black box predictive models to boost power and control the false discovery rate (FDR). In Stage 1, it uses a deep neural network prior to report which experiments yielded significant outcomes. In Stage 2, it uses an empirical Bayes version of the knockoff filter (Candes et al., 2018) to select covariates that have significant predictive power of Stage-1 significance. In both simulated and real data, DEBT increases the proportion of discovered significant outcomes and selects more features when signals are weak. In a real study of cancer cell lines, DEBT selects a robust set of biologically-plausible genomic drivers of drug sensitivity and resistance in cancer.

摘要

分析来自大规模多实验研究的数据，要求科学家既要分析每个实验，又要整体评估结果。在本文中，我们开发了双重经验贝叶斯检验（DEBT），这是一种经验贝叶斯方法，用于在每个实验收集了许多协变量时分析多实验研究。DEBT是一种两阶段方法：在第一阶段，它报告哪些实验产生了显著结果；在第二阶段，它假设哪些协变量驱动了实验的显著性。在其两个阶段中，DEBT都基于Efron（2008），该文献提出了一种优雅的经验贝叶斯检验方法。DEBT通过学习一系列黑箱预测模型来提高功效并控制错误发现率（FDR），从而增强了这一框架。在第一阶段，它使用深度神经网络来报告哪些实验产生了显著结果。在第二阶段，它使用仿冒筛选器（Candes等人，2018）的经验贝叶斯版本来选择对第一阶段显著性具有显著预测能力的协变量。在模拟数据和真实数据中，当信号较弱时，DEBT都会增加发现的显著结果的比例并选择更多特征。在一项对癌细胞系的实际研究中，DEBT选择了一组可靠的、具有生物学合理性的癌症药物敏感性和耐药性的基因组驱动因素。

相似文献

Double Empirical Bayes Testing.双重经验贝叶斯检验

Int Stat Rev. 2020 Dec;88(Suppl 1):S91-S113. doi: 10.1111/insr.12430. Epub 2020 Nov 25.

Deep direct likelihood knockoffs.深度直接似然性仿样

Adv Neural Inf Process Syst. 2020 Dec;33:5036-5046.

DeepPIG: deep neural network architecture with pairwise connected layers and stochastic gates using knockoff frameworks for feature selection.深度伪像生成网络（DeepPIG）：一种具有成对连接层和随机门的深度神经网络架构，使用仿冒框架进行特征选择。

Sci Rep. 2024 Jul 6;14(1):15582. doi: 10.1038/s41598-024-66061-6.

IPAD: Stable Interpretable Forecasting with Knockoffs Inference.IPAD：基于仿冒品推断的稳定可解释预测

J Am Stat Assoc. 2020;115(532):1822-1834. doi: 10.1080/01621459.2019.1654878. Epub 2019 Sep 17.

Assessing differential expression in two-color microarrays: a resampling-based empirical Bayes approach.评估双色微阵列中的差异表达：基于重采样的经验贝叶斯方法。

PLoS One. 2013 Nov 27;8(11):e80099. doi: 10.1371/journal.pone.0080099. eCollection 2013.

Statistical detection of EEG synchrony using empirical bayesian inference.使用经验贝叶斯推断进行脑电图同步性的统计检测。

PLoS One. 2015 Mar 30;10(3):e0121795. doi: 10.1371/journal.pone.0121795. eCollection 2015.

DeepLINK: Deep learning inference using knockoffs with applications to genomics.DeepLINK：使用 Knockoffs 进行深度学习推断及其在基因组学中的应用。

Proc Natl Acad Sci U S A. 2021 Sep 7;118(36). doi: 10.1073/pnas.2104683118.

Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。

Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.

Competition-based control of the false discovery proportion.基于竞争的假发现率控制。

Biometrics. 2023 Dec;79(4):3472-3484. doi: 10.1111/biom.13830. Epub 2023 Jan 30.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

本文引用的文献

Fast and powerful conditional randomization testing via distillation.通过蒸馏实现快速且强大的条件随机化测试。

Biometrika. 2022 Jun;109(2):277-293. doi: 10.1093/biomet/asab039. Epub 2021 Jul 8.

Causal inference in genetic trio studies.遗传三体型研究中的因果推断。

Proc Natl Acad Sci U S A. 2020 Sep 29;117(39):24117-24126. doi: 10.1073/pnas.2007743117. Epub 2020 Sep 18.

Identification and correction of spatial bias are essential for obtaining quality data in high-throughput screening technologies.识别和纠正空间偏差对于在高通量筛选技术中获取高质量数据至关重要。

Sci Rep. 2017 Sep 20;7(1):11921. doi: 10.1038/s41598-017-11940-4.

The tumour suppressor CYLD regulates the p53 DNA damage response.抑癌基因 CYLD 调节 p53 对 DNA 损伤的反应。

Nat Commun. 2016 Aug 26;7:12508. doi: 10.1038/ncomms12508.

A Landscape of Pharmacogenomic Interactions in Cancer.癌症中的药物基因组学相互作用全景

Cell. 2016 Jul 28;166(3):740-754. doi: 10.1016/j.cell.2016.06.017. Epub 2016 Jul 7.

False discovery rate regression: an application to neural synchrony detection in primary visual cortex.错误发现率回归：在初级视觉皮层神经同步检测中的应用

J Am Stat Assoc. 2015;110(510):459-471. doi: 10.1080/01621459.2014.990973.

MDM2 is an important prognostic and predictive factor for platin-pemetrexed therapy in malignant pleural mesotheliomas and deregulation of P14/ARF (encoded by CDKN2A) seems to contribute to an MDM2-driven inactivation of P53.MDM2是恶性胸膜间皮瘤铂类培美曲塞治疗的重要预后和预测因素，而P14/ARF（由CDKN2A编码）的失调似乎导致了MDM2驱动的P53失活。

Br J Cancer. 2015 Mar 3;112(5):883-90. doi: 10.1038/bjc.2015.27. Epub 2015 Feb 10.

The pharmacodynamics of the p53-Mdm2 targeting drug Nutlin: the role of gene-switching noise.靶向p53-Mdm2的药物Nutlin的药效学：基因转换噪声的作用

PLoS Comput Biol. 2014 Dec 11;10(12):e1003991. doi: 10.1371/journal.pcbi.1003991. eCollection 2014 Dec.

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells.癌症药物敏感性基因组学（GDSC）：癌症细胞治疗生物标志物发现的资源。

Nucleic Acids Res. 2013 Jan;41(Database issue):D955-61. doi: 10.1093/nar/gks1111. Epub 2012 Nov 23.

Systematic identification of genomic markers of drug sensitivity in cancer cells.系统鉴定癌细胞药物敏感性的基因组标记物。

Nature. 2012 Mar 28;483(7391):570-5. doi: 10.1038/nature11005.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。