Suppr超能文献

相关性的大规模多重检验

Large-Scale Multiple Testing of Correlations.

作者信息

Cai T Tony, Liu Weidong

机构信息

Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104 (

Department of Mathematics, Institute of Natural Sciences and MOE-LSC, Shanghai Jiao Tong University, Shanghai, China (

出版信息

J Am Stat Assoc. 2016;111(513):229-240. doi: 10.1080/01621459.2014.999157. Epub 2016 May 5.

Abstract

Multiple testing of correlations arises in many applications including gene coexpression network analysis and brain connectivity analysis. In this paper, we consider large scale simultaneous testing for correlations in both the one-sample and two-sample settings. New multiple testing procedures are proposed and a bootstrap method is introduced for estimating the proportion of the nulls falsely rejected among all the true nulls. The properties of the proposed procedures are investigated both theoretically and numerically. It is shown that the procedures asymptotically control the overall false discovery rate and false discovery proportion at the nominal level. Simulation results show that the methods perform well numerically in terms of both the size and power of the test and it significantly outperforms two alternative methods. The two-sample procedure is also illustrated by an analysis of a prostate cancer dataset for the detection of changes in coexpression patterns between gene expression levels.

摘要

相关性的多重检验出现在许多应用中,包括基因共表达网络分析和脑连接性分析。在本文中,我们考虑在单样本和两样本设置下对相关性进行大规模同时检验。提出了新的多重检验程序,并引入了一种自助法来估计在所有真零假设中被错误拒绝的零假设的比例。从理论和数值两方面研究了所提出程序的性质。结果表明,这些程序在名义水平上渐近地控制了总体错误发现率和错误发现比例。模拟结果表明,这些方法在检验的大小和功效方面在数值上表现良好,并且显著优于两种替代方法。通过对一个前列腺癌数据集进行分析,以检测基因表达水平之间共表达模式的变化,对两样本程序进行了说明。

相似文献

1
Large-Scale Multiple Testing of Correlations.
J Am Stat Assoc. 2016;111(513):229-240. doi: 10.1080/01621459.2014.999157. Epub 2016 May 5.
3
Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models.
J Am Stat Assoc. 2021;116(534):984-998. doi: 10.1080/01621459.2019.1699421. Epub 2020 Jan 21.
4
Multiple Testing of Submatrices of a Precision Matrix with Applications to Identification of Between Pathway Interactions.
J Am Stat Assoc. 2018;113(521):328-339. doi: 10.1080/01621459.2016.1251930. Epub 2017 Sep 26.
5
False Discovery Rate Control With Groups.
J Am Stat Assoc. 2010 Sep 1;105(491):1215-1227. doi: 10.1198/jasa.2010.tm09329.
6
FarmTest: Factor-adjusted robust multiple testing with approximate false discovery control.
J Am Stat Assoc. 2019;114(528):1880-1893. doi: 10.1080/01621459.2018.1527700. Epub 2019 Mar 20.
7
Large-Scale Simultaneous Testing of Cross-Covariance Matrices with Applications to PheWAS.
Stat Sin. 2019 Apr;29(2):983-1005. doi: 10.5705/ss.202017.0189.
8
False discovery rate control for high dimensional networks of quantile associations conditioning on covariates.
J R Stat Soc Series B Stat Methodol. 2018 Nov;80(5):1015-1034. doi: 10.1111/rssb.12288. Epub 2018 Jul 19.
9
Bias and variance reduction in estimating the proportion of true-null hypotheses.
Biostatistics. 2015 Jan;16(1):189-204. doi: 10.1093/biostatistics/kxu029. Epub 2014 Jun 23.

引用本文的文献

1
FLEXIBILITY IN GENE COEXPRESSION AT DEVELOPMENTAL AND EVOLUTIONARY TIMESCALES.
bioRxiv. 2024 Dec 11:2024.12.10.627761. doi: 10.1101/2024.12.10.627761.
3
A double-robust test for high-dimensional gene coexpression networks conditioning on clinical information.
Biometrics. 2023 Dec;79(4):3227-3238. doi: 10.1111/biom.13890. Epub 2023 Jun 13.
4
LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.
Ann Appl Stat. 2021 Jun;15(2):902-924. doi: 10.1214/20-aoas1431. Epub 2021 Jul 12.
5
The significance of neural inter-frequency power correlations.
Sci Rep. 2021 Nov 30;11(1):23190. doi: 10.1038/s41598-021-02277-0.
6
Severe COVID-19 is associated with hyperactivation of the alternative complement pathway.
J Allergy Clin Immunol. 2022 Feb;149(2):550-556.e2. doi: 10.1016/j.jaci.2021.11.004. Epub 2021 Nov 17.
7
Simultaneous Covariance Inference for Multimodal Integrative Analysis.
J Am Stat Assoc. 2020;115(531):1279-1291. doi: 10.1080/01621459.2019.1623040. Epub 2019 Jun 28.
8
Multimodal neurocognitive markers of frontal lobe epilepsy: Insights from ecological text processing.
Neuroimage. 2021 Jul 15;235:117998. doi: 10.1016/j.neuroimage.2021.117998. Epub 2021 Mar 28.
9
A mixture model to detect edges in sparse co-expression graphs with an application for comparing breast cancer subtypes.
PLoS One. 2021 Feb 11;16(2):e0246945. doi: 10.1371/journal.pone.0246945. eCollection 2021.

本文引用的文献

1
False Discovery Control in Large-Scale Spatial Multiple Testing.
J R Stat Soc Series B Stat Methodol. 2015 Jan 1;77(1):59-83. doi: 10.1111/rssb.12064.
3
Multiple common variants for celiac disease influencing immune gene expression.
Nat Genet. 2010 Apr;42(4):295-302. doi: 10.1038/ng.543. Epub 2010 Feb 28.
4
Class-specific correlations of gene expressions: identification and their effects on clustering analyses.
Am J Hum Genet. 2008 Aug;83(2):269-77. doi: 10.1016/j.ajhg.2008.07.009.
5
Socioeconomic status predicts hemispheric specialisation of the left inferior frontal gyrus in young children.
Neuroimage. 2008 Apr 15;40(3):1392-401. doi: 10.1016/j.neuroimage.2008.01.021. Epub 2008 Jan 29.
6
Omics-based identification of Arabidopsis Myb transcription factors regulating aliphatic glucosinolate biosynthesis.
Proc Natl Acad Sci U S A. 2007 Apr 10;104(15):6478-83. doi: 10.1073/pnas.0611629104. Epub 2007 Apr 9.
8
Intellectual ability and cortical development in children and adolescents.
Nature. 2006 Mar 30;440(7084):676-9. doi: 10.1038/nature04513.
10
Finding disease specific alterations in the co-expression of genes.
Bioinformatics. 2004 Aug 4;20 Suppl 1:i194-9. doi: 10.1093/bioinformatics/bth909.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验