Golden Helix Inc., Bozeman, MT 59719, USA.
Biostatistics. 2012 Apr;13(2):195-203. doi: 10.1093/biostatistics/kxr055. Epub 2012 Jan 27.
Many public and private genome-wide association studies that we have analyzed include flaws in design, with avoidable confounding appearing as a norm rather than the exception. Rather than recognizing flawed research design and addressing that, a category of quality-control statistical methods has arisen to treat only the symptoms. Reflecting more deeply, we examine elements of current genomic research in light of the traditional scientific method and find that hypotheses are often detached from data collection, experimental design, and causal theories. Association studies independent of causal theories, along with multiple testing errors, too often drive health care and public policy decisions. In an era of large-scale biological research, we ask questions about the role of statistical analyses in advancing coherent theories of diseases and their mechanisms. We advocate for reinterpretation of the scientific method in the context of large-scale data analysis opportunities and for renewed appreciation of falsifiable hypotheses, so that we can learn more from our best mistakes.
我们分析过的许多公共和私人全基因组关联研究都存在设计缺陷,可避免的混杂现象似乎是常态而非例外。人们没有认识到有缺陷的研究设计并加以解决,而是出现了一类质量控制统计方法,这些方法只是针对症状进行处理。我们进一步深入思考,根据传统的科学方法来审视当前基因组研究的各个要素,结果发现假设往往与数据收集、实验设计和因果理论脱节。与因果理论无关的关联研究以及多次测试错误,往往会左右医疗保健和公共政策决策。在大规模生物研究时代,我们对统计分析在推进疾病及其机制的连贯理论方面所起的作用提出了质疑。我们提倡在大规模数据分析机会的背景下重新解释科学方法,并重新重视可证伪的假设,以便我们能够从最好的错误中吸取更多的教训。