• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于使用基于排列的错误发现率估计来比较微阵列数据不同分析方法的说明。

A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data.

作者信息

Xie Yang, Pan Wei, Khodursky Arkady B

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota Minneapolis, MN 55455, USA.

出版信息

Bioinformatics. 2005 Dec 1;21(23):4280-8. doi: 10.1093/bioinformatics/bti685. Epub 2005 Sep 27.

DOI:10.1093/bioinformatics/bti685
PMID:16188930
Abstract

MOTIVATION

False discovery rate (FDR) is defined as the expected percentage of false positives among all the claimed positives. In practice, with the true FDR unknown, an estimated FDR can serve as a criterion to evaluate the performance of various statistical methods under the condition that the estimated FDR approximates the true FDR well, or at least, it does not improperly favor or disfavor any particular method. Permutation methods have become popular to estimate FDR in genomic studies. The purpose of this paper is 2-fold. First, we investigate theoretically and empirically whether the standard permutation-based FDR estimator is biased, and if so, whether the bias inappropriately favors or disfavors any method. Second, we propose a simple modification of the standard permutation to yield a better FDR estimator, which can in turn serve as a more fair criterion to evaluate various statistical methods.

RESULTS

Both simulated and real data examples are used for illustration and comparison. Three commonly used test statistics, the sample mean, SAM statistic and Student's t-statistic, are considered. The results show that the standard permutation method overestimates FDR. The overestimation is the most severe for the sample mean statistic while the least for the t-statistic with the SAM-statistic lying between the two extremes, suggesting that one has to be cautious when using the standard permutation-based FDR estimates to evaluate various statistical methods. In addition, our proposed FDR estimation method is simple and outperforms the standard method.

摘要

动机

错误发现率(FDR)被定义为所有宣称的阳性结果中假阳性结果的预期百分比。在实际应用中,由于真实的FDR未知,在估计的FDR能较好地近似真实FDR的条件下,或者至少它不会不适当地偏袒或不利于任何特定方法时,估计的FDR可作为评估各种统计方法性能的一个标准。排列方法在基因组研究中已广泛用于估计FDR。本文的目的有两个。首先,我们从理论和实证两方面研究基于标准排列的FDR估计器是否存在偏差,如果存在偏差,该偏差是否不适当地偏袒或不利于任何方法。其次,我们对标准排列提出一种简单的修正,以得到一个更好的FDR估计器,进而可作为一个更公平的标准来评估各种统计方法。

结果

使用模拟数据和实际数据示例进行说明和比较。考虑了三种常用的检验统计量:样本均值、SAM统计量和学生t统计量。结果表明,标准排列方法高估了FDR。对于样本均值统计量,高估最为严重,而对于t统计量则最小,SAM统计量介于两者之间,这表明在使用基于标准排列的FDR估计来评估各种统计方法时必须谨慎。此外,我们提出的FDR估计方法简单且优于标准方法。

相似文献

1
A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data.关于使用基于排列的错误发现率估计来比较微阵列数据不同分析方法的说明。
Bioinformatics. 2005 Dec 1;21(23):4280-8. doi: 10.1093/bioinformatics/bti685. Epub 2005 Sep 27.
2
Construction of null statistics in permutation-based multiple testing for multi-factorial microarray experiments.基于排列的多因素微阵列实验多重检验中零统计量的构建。
Bioinformatics. 2006 Jun 15;22(12):1486-94. doi: 10.1093/bioinformatics/btl109. Epub 2006 Mar 30.
3
Multidimensional local false discovery rate for microarray studies.微阵列研究的多维局部错误发现率
Bioinformatics. 2006 Mar 1;22(5):556-65. doi: 10.1093/bioinformatics/btk013. Epub 2005 Dec 20.
4
Robust estimation of the false discovery rate.错误发现率的稳健估计
Bioinformatics. 2006 Aug 15;22(16):1979-87. doi: 10.1093/bioinformatics/btl328. Epub 2006 Jun 15.
5
Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data.基于秩不变重采样的小样本微阵列数据分析中错误发现率估计
BMC Bioinformatics. 2005 Jul 22;6:187. doi: 10.1186/1471-2105-6-187.
6
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
7
A practical false discovery rate approach to identifying patterns of differential expression in microarray data.一种用于识别微阵列数据中差异表达模式的实用错误发现率方法。
Bioinformatics. 2005 Jun 1;21(11):2684-90. doi: 10.1093/bioinformatics/bti407. Epub 2005 Mar 29.
8
Estimation of false discovery proportion under general dependence.一般相关性下错误发现比例的估计
Bioinformatics. 2006 Dec 15;22(24):3025-31. doi: 10.1093/bioinformatics/btl527. Epub 2006 Oct 17.
9
False discovery rate, sensitivity and sample size for microarray studies.微阵列研究的错误发现率、敏感性和样本量
Bioinformatics. 2005 Jul 1;21(13):3017-24. doi: 10.1093/bioinformatics/bti448. Epub 2005 Apr 19.
10
Bias in the estimation of false discovery rate in microarray studies.微阵列研究中错误发现率估计的偏差。
Bioinformatics. 2005 Oct 15;21(20):3865-72. doi: 10.1093/bioinformatics/bti626. Epub 2005 Aug 16.

引用本文的文献

1
Effective integration of multi-omics with prior knowledge to identify biomarkers via explainable graph neural networks.通过可解释图神经网络将多组学与先验知识有效整合以识别生物标志物。
NPJ Syst Biol Appl. 2025 May 8;11(1):43. doi: 10.1038/s41540-025-00519-9.
2
An explainable graph neural network approach for effectively integrating multi-omics with prior knowledge to identify biomarkers from interacting biological domains.一种可解释的图神经网络方法,用于有效整合多组学与先验知识,以从相互作用的生物领域中识别生物标志物。
bioRxiv. 2024 Sep 26:2024.08.23.609465. doi: 10.1101/2024.08.23.609465.
3
Large changes in detected selection signatures after a selection limit in mice bred for voluntary wheel-running behavior.
在为自愿轮转行为培育的小鼠中达到选择极限后,检测到的选择特征发生了巨大变化。
PLoS One. 2024 Aug 1;19(8):e0306397. doi: 10.1371/journal.pone.0306397. eCollection 2024.
4
Midkine Attenuates Aβ Fibril Assembly and AmyloidPlaque Formation.中期因子可减弱β-淀粉样蛋白原纤维组装及淀粉样斑块形成。
Res Sq. 2024 Jun 7:rs.3.rs-4361125. doi: 10.21203/rs.3.rs-4361125/v1.
5
Cancer metabolites: promising biomarkers for cancer liquid biopsy.癌症代谢物:癌症液体活检中颇具前景的生物标志物。
Biomark Res. 2023 Jun 30;11(1):66. doi: 10.1186/s40364-023-00507-3.
6
The systematic comparison between Gaussian mirror and Model-X knockoff models.高斯镜与 Model-X 伪影模型的系统比较。
Sci Rep. 2023 Apr 4;13(1):5478. doi: 10.1038/s41598-023-32605-5.
7
JUMPptm: Integrated software for sensitive identification of post-translational modifications and its application in Alzheimer's disease study.JUMPptm:用于灵敏鉴定翻译后修饰的集成软件及其在阿尔茨海默病研究中的应用。
Proteomics. 2023 Feb;23(3-4):e2100369. doi: 10.1002/pmic.202100369. Epub 2022 Sep 20.
8
Quantitative Comparison of Statistical Methods for Analyzing Human Metabolomics Data.分析人类代谢组学数据的统计方法的定量比较
Metabolites. 2022 Jun 4;12(6):519. doi: 10.3390/metabo12060519.
9
Null-free False Discovery Rate Control Using Decoy Permutations.使用诱饵排列的无空值错误发现率控制
Acta Math Appl Sin. 2022;38(2):235-253. doi: 10.1007/s10255-022-1077-5. Epub 2022 Apr 9.
10
Neural-level associations of non-verbal pragmatic comprehension in young Finnish autistic adults.年轻芬兰自闭症成年人非言语语用理解的神经水平关联。
Int J Circumpolar Health. 2021 Dec;80(1):1909333. doi: 10.1080/22423982.2021.1909333.