Psarou Eleni, Katsanevaki Christini, Maris Eric, Fries Pascal
Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck Society, 60528 Frankfurt, Germany.
International Max Planck Research School for Neural Circuits, 60438 Frankfurt, Germany.
bioRxiv. 2025 Jun 2:2024.08.26.609821. doi: 10.1101/2024.08.26.609821.
Non-human primate studies traditionally use two or three animals. We previously used standard statistics to argue for using either one animal, for an inference about that sample, or five or more animals, for a useful inference about the population. A recently proposed framework argued for testing three animals and accepting the outcome found in the majority as the outcome that is most representative for the population. The proposal tests this framework under various assumptions about the true probability of the representative outcome in the population, i.e. its typicality. On this basis, it argues that the framework is valid across a wide range of typicalities. Here, we show (1) that the error rate of the framework depends strongly on the typicality of the representative outcome, (2) that an acceptable error rate requires this typicality to be very high (87% for a single type of outlier), which actually renders empirical testing beyond a single animal obsolete, (3) that moving from one to three animals decreases error rates mainly for typicality values of 70-90%, and much less for both lower and higher values. Furthermore, we use conjunction analysis to demonstrate that two out of three animals with a given outcome only allow to infer a lower bound to typicality of 9%, which is of limited value. Thus, the use of two or three animals does not allow a useful inference about the population, and if this option is nevertheless chosen, the inferred lower bound of typicality should be reported.
传统上,非人灵长类动物研究使用两到三只动物。我们之前运用标准统计学方法主张,要么使用一只动物来推断该样本,要么使用五只或更多动物来对总体进行有效推断。最近提出的一个框架主张测试三只动物,并将多数动物出现的结果作为最能代表总体的结果接受下来。该提议在关于总体中代表性结果的真实概率(即其典型性)的各种假设下对这个框架进行了测试。在此基础上,它认为该框架在广泛的典型性范围内都是有效的。在这里,我们表明:(1)该框架的错误率强烈依赖于代表性结果的典型性;(2)可接受的错误率要求这种典型性非常高(对于单一类型的异常值为87%),这实际上使得对单只以上动物的实证测试变得过时;(3)从一只动物增加到三只动物主要是在典型性值为70%至90%时降低错误率,而在较低和较高值时降低幅度要小得多。此外,我们使用联合分析来证明,三只动物中有两只出现给定结果仅能推断出典型性的下限为9%,这价值有限。因此,使用两到三只动物无法对总体进行有效推断,如果仍然选择此选项,则应报告推断出的典型性下限。