Suppr超能文献

假发现率:大规模遗传研究中的关键概念。

The false discovery rate: a key concept in large-scale genetic studies.

机构信息

Division of Personalized Nutrition and Medicine, National Center for Toxicological Research, Food and Drug Administration, HFT-20, Jefferson, AR 72079, USA.

出版信息

Cancer Control. 2010 Jan;17(1):58-62. doi: 10.1177/107327481001700108.

Abstract

BACKGROUND

In experimental research, a statistical test is often used for making decisions on a null hypothesis such as that the means of gene expression in the normal and tumor groups are equal. Typically, a test statistic and its corresponding P value are calculated to measure the extent of the difference between the two groups. The null hypothesis is rejected and a discovery is declared when the P value is less than a prespecified significance level. When more than one test is conducted, use of a significance level intended for use by a single test typically leads to a large chance of false-positive findings.

METHODS

This paper presents an overview of the multiple testing framework and describes the false discovery rate (FDR) approach to determining the significance cutoff when a large number of tests are conducted.

RESULTS

The FDR is the expected proportion of the null hypotheses that are falsely rejected divided by the total number of rejections. An FDR-controlling procedure is described and illustrated with a numerical example.

CONCLUSIONS

In multiple testing, a classical "family-wise error rate" (FWE) approach is commonly used when the number of tests is small. When a study involves a large number of tests, the FDR error measure is a more useful approach to determining a significance cutoff, as the FWE approach is too stringent. The FDR approach allows more claims of significant differences to be made, provided the investigator is willing to accept a small fraction of false-positive findings.

摘要

背景

在实验研究中,通常会使用统计检验来对零假设做出决策,例如正常组和肿瘤组的基因表达平均值相等。通常,会计算检验统计量及其对应的 P 值,以衡量两组之间差异的程度。当 P 值小于预设的显著性水平时,就会拒绝零假设,并宣布发现。当进行多个检验时,使用单个检验的预设显著性水平通常会导致大量假阳性发现的可能性增加。

方法

本文概述了多重检验框架,并描述了当进行大量检验时确定显著性截断值的错误发现率 (FDR) 方法。

结果

FDR 是被错误拒绝的零假设数量除以总拒绝数的预期比例。描述并举例说明了一种 FDR 控制程序。

结论

在多重检验中,当检验数量较小时,通常使用经典的“总体错误率”(FWE)方法。当研究涉及大量检验时,FDR 错误度量是确定显著性截断值的更有用方法,因为 FWE 方法过于严格。FDR 方法允许做出更多的显著差异声明,前提是研究者愿意接受一小部分假阳性发现。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验