假发现率：大规模遗传研究中的关键概念。

The false discovery rate: a key concept in large-scale genetic studies.

机构信息

Division of Personalized Nutrition and Medicine, National Center for Toxicological Research, Food and Drug Administration, HFT-20, Jefferson, AR 72079, USA.

出版信息

Cancer Control. 2010 Jan;17(1):58-62. doi: 10.1177/107327481001700108.

DOI:10.1177/107327481001700108

PMID:20010520

Abstract

BACKGROUND

In experimental research, a statistical test is often used for making decisions on a null hypothesis such as that the means of gene expression in the normal and tumor groups are equal. Typically, a test statistic and its corresponding P value are calculated to measure the extent of the difference between the two groups. The null hypothesis is rejected and a discovery is declared when the P value is less than a prespecified significance level. When more than one test is conducted, use of a significance level intended for use by a single test typically leads to a large chance of false-positive findings.

METHODS

This paper presents an overview of the multiple testing framework and describes the false discovery rate (FDR) approach to determining the significance cutoff when a large number of tests are conducted.

RESULTS

The FDR is the expected proportion of the null hypotheses that are falsely rejected divided by the total number of rejections. An FDR-controlling procedure is described and illustrated with a numerical example.

CONCLUSIONS

In multiple testing, a classical "family-wise error rate" (FWE) approach is commonly used when the number of tests is small. When a study involves a large number of tests, the FDR error measure is a more useful approach to determining a significance cutoff, as the FWE approach is too stringent. The FDR approach allows more claims of significant differences to be made, provided the investigator is willing to accept a small fraction of false-positive findings.

摘要

背景

在实验研究中，通常会使用统计检验来对零假设做出决策，例如正常组和肿瘤组的基因表达平均值相等。通常，会计算检验统计量及其对应的 P 值，以衡量两组之间差异的程度。当 P 值小于预设的显著性水平时，就会拒绝零假设，并宣布发现。当进行多个检验时，使用单个检验的预设显著性水平通常会导致大量假阳性发现的可能性增加。

方法

本文概述了多重检验框架，并描述了当进行大量检验时确定显著性截断值的错误发现率 (FDR) 方法。

结果

FDR 是被错误拒绝的零假设数量除以总拒绝数的预期比例。描述并举例说明了一种 FDR 控制程序。

结论

在多重检验中，当检验数量较小时，通常使用经典的“总体错误率”（FWE）方法。当研究涉及大量检验时，FDR 错误度量是确定显著性截断值的更有用方法，因为 FWE 方法过于严格。FDR 方法允许做出更多的显著差异声明，前提是研究者愿意接受一小部分假阳性发现。

相似文献

The false discovery rate: a key concept in large-scale genetic studies.

Cancer Control. 2010 Jan;17(1):58-62. doi: 10.1177/107327481001700108.

Re-sampling strategy to improve the estimation of number of null hypotheses in FDR control under strong correlation structures.

BMC Bioinformatics. 2007 May 18;8:157. doi: 10.1186/1471-2105-8-157.

Comparison of methods for estimating the number of true null hypotheses in multiplicity testing.

J Biopharm Stat. 2003 Nov;13(4):675-89. doi: 10.1081/BIP-120024202.

Estimation of false discovery rate using sequential permutation p-values.

Biometrics. 2013 Mar;69(1):1-7. doi: 10.1111/j.1541-0420.2012.01825.x. Epub 2013 Feb 4.

Quick calculation for sample size while controlling false discovery rate with application to microarray analysis.

Bioinformatics. 2007 Mar 15;23(6):739-46. doi: 10.1093/bioinformatics/btl664. Epub 2007 Jan 19.

Multidimensional local false discovery rate for microarray studies.

Bioinformatics. 2006 Mar 1;22(5):556-65. doi: 10.1093/bioinformatics/btk013. Epub 2005 Dec 20.

Estimating the false discovery rate using nonparametric deconvolution.

Biometrics. 2007 Sep;63(3):806-15. doi: 10.1111/j.1541-0420.2006.00736.x.

A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data.

Bioinformatics. 2005 Dec 1;21(23):4280-8. doi: 10.1093/bioinformatics/bti685. Epub 2005 Sep 27.

Sample size for FDR-control in microarray data analysis.

Bioinformatics. 2005 Jul 15;21(14):3097-104. doi: 10.1093/bioinformatics/bti456. Epub 2005 Apr 21.

Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data.

BMC Bioinformatics. 2005 Jul 22;6:187. doi: 10.1186/1471-2105-6-187.

引用本文的文献

Genetic insights into the effect of trace elements on cardiovascular diseases: multi-omics Mendelian randomization combined with linkage disequilibrium score regression analysis.

Front Immunol. 2024 Dec 3;15:1459465. doi: 10.3389/fimmu.2024.1459465. eCollection 2024.

Field application of de novo transcriptomic analysis to evaluate the effects of sublethal freshwater salinization on Gasterosteus aculeatus in urban streams.

PLoS One. 2024 Mar 13;19(3):e0298213. doi: 10.1371/journal.pone.0298213. eCollection 2024.

Association of Thymidylate Synthase () Gene Polymorphisms with Incidence and Prognosis of Coronary Artery Disease.

Int J Mol Sci. 2023 Aug 9;24(16):12591. doi: 10.3390/ijms241612591.

Mitochondrial TXNRD2 and ME3 Genetic Risk Scores Are Associated with Specific Primary Open-Angle Glaucoma Phenotypes.

Ophthalmology. 2023 Jul;130(7):756-763. doi: 10.1016/j.ophtha.2023.02.018. Epub 2023 Feb 20.

Networks of placental DNA methylation correlate with maternal serum PCB concentrations and child neurodevelopment.

Environ Res. 2023 Mar 1;220:115227. doi: 10.1016/j.envres.2023.115227. Epub 2023 Jan 4.

Global FDR control across multiple RNAseq experiments.

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac718.

Analysis of Thromboembolic and Thrombocytopenic Events After the AZD1222, BNT162b2, and MRNA-1273 COVID-19 Vaccines in 3 Nordic Countries.

JAMA Netw Open. 2022 Jun 1;5(6):e2217375. doi: 10.1001/jamanetworkopen.2022.17375.

Default-Mode Network Connectivity Changes Correlate with Attention Deficits in ALL Long-Term Survivors Treated with Radio- and/or Chemotherapy.

Biology (Basel). 2022 Mar 24;11(4):499. doi: 10.3390/biology11040499.

Disentangling sex differences in the shared genetic architecture of posttraumatic stress disorder, traumatic experiences, and social support with body size and composition.

Neurobiol Stress. 2021 Sep 17;15:100400. doi: 10.1016/j.ynstr.2021.100400. eCollection 2021 Nov.

DAGM: A novel modelling framework to assess the risk of HER2-negative breast cancer based on germline rare coding mutations.

EBioMedicine. 2021 Jul;69:103446. doi: 10.1016/j.ebiom.2021.103446. Epub 2021 Jun 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

假发现率：大规模遗传研究中的关键概念。

The false discovery rate: a key concept in large-scale genetic studies.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献