Merck Research Laboratories, North Wales, PA 19454, USA.
Pharmacoepidemiol Drug Saf. 2010 May;19(5):533-6. doi: 10.1002/pds.1928.
Dr. Walker asserts that a hypothesis always can be tested using the same data source that generated the data if the test data are independent of the data generating the hypothesis. One way to do this is to use part of the totality of data to generate the hypothesis and the other to test the hypothesis. The validity of this assertion depends on what one means by 'independent'. This note addresses the logical and statistical implications of Dr. Walker's assertion. The key conclusion is that what constitutes 'independent' data has to be considered carefully, and that hypothesis-generating and test data from the same data source generally can not be considered 'independent'.
沃克博士断言,如果测试数据与产生假设的数据无关,那么总是可以使用生成数据的相同数据源来检验假设。一种方法是使用数据的一部分来生成假设,而另一部分则用于检验假设。这一断言的有效性取决于“独立”的含义。本注释探讨了沃克博士断言的逻辑和统计含义。关键结论是,必须仔细考虑什么构成“独立”的数据,并且通常不能认为来自同一数据源的假设生成和测试数据是“独立”的。