Stark Emily N, Turton Terece L, Miller Jonah, Barenholtz Elan, Hong Sang, Bujack Roxana
Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, United States of America.
Department of Psychology, Florida Atlantic University, Boca Raton, FL, United States of America.
PLoS One. 2024 Dec 23;19(12):e0315853. doi: 10.1371/journal.pone.0315853. eCollection 2024.
Crowdsource platforms have been used to study a range of perceptual stimuli such as the graphical perception of scatterplots and various aspects of human color perception. Given the lack of control over a crowdsourced participant's experimental setup, there are valid concerns on the use of crowdsourcing for color studies as the perception of the stimuli is highly dependent on the stimulus presentation. Here, we propose that the error due to a crowdsourced experimental design can be effectively averaged out because the crowdsourced experiment can be accommodated by the Thurstonian model as the convolution of two normal distributions, one that is perceptual in nature and one that captures the error due to variability in stimulus presentation. Based on this, we provide a mathematical estimate for the sample size needed to produce a crowdsourced experiment with the same power as the corresponding in-person study. We tested this claim by replicating a large-scale, crowdsourced study of human lightness perception with a diverse sample with a highly controlled, in-person study with a sample taken from psychology undergraduates. Our claim was supported by the replication of the results from the latter. These findings suggest that, with sufficient sample size, color vision studies may be completed online, giving access to a larger and more representative sample. With this framework at hand, experimentalists have the validation that choosing either many online participants or few in person participants will not sacrifice the impact of their results.
众包平台已被用于研究一系列感知刺激,如散点图的图形感知和人类颜色感知的各个方面。鉴于对众包参与者的实验设置缺乏控制,对于将众包用于颜色研究存在合理担忧,因为刺激的感知高度依赖于刺激呈现。在此,我们提出,由于众包实验设计导致的误差可以有效地平均掉,因为众包实验可以被瑟斯顿模型所容纳,该模型是两个正态分布的卷积,一个本质上是感知性的,另一个捕捉由于刺激呈现的变异性而产生的误差。基于此,我们提供了一个数学估计,用于确定与相应的面对面研究具有相同效力的众包实验所需的样本量。我们通过对一项大规模的众包人类明度感知研究进行复制来验证这一说法,该研究使用了多样化的样本,同时进行了一项高度受控的面对面研究,样本取自心理学本科生。后者的结果复制支持了我们的说法。这些发现表明,有了足够的样本量,颜色视觉研究可以在线完成,从而能够接触到更大且更具代表性的样本。有了这个框架,实验者就有了这样的验证:选择许多在线参与者或少数面对面参与者都不会牺牲其结果的影响力。