Goeman Jelle J, Solari Aldo
Biostatistics, Department for Health Evidence, Radboud University Medical Center, Nijmegen, The Netherlands; Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands.
Stat Med. 2014 May 20;33(11):1946-78. doi: 10.1002/sim.6082. Epub 2014 Jan 8.
This paper presents an overview of the current state of the art in multiple testing in genomics data from a user's perspective. We describe methods for familywise error control, false discovery rate control and false discovery proportion estimation and confidence, both conceptually and practically, and explain when to use which type of error rate. We elaborate on the assumptions underlying the methods and discuss pitfalls in the interpretation of results. In our discussion, we take into account the exploratory nature of genomics experiments, looking at selection of genes before or after testing, and at the role of validation experiments.
本文从用户的角度概述了基因组数据多重检验的当前技术现状。我们从概念和实践两方面描述了控制家族性错误率、错误发现率以及估计和置信错误发现比例的方法,并解释了何时使用哪种错误率类型。我们详细阐述了这些方法背后的假设,并讨论了结果解释中的陷阱。在讨论中,我们考虑了基因组实验的探索性本质,探讨了检验前或检验后基因的选择以及验证实验的作用。