Shimoni Hagar, Axelrod Vadim
The Gonda Multidisciplinary Brain Research Center, Bar-Ilan University, Ramat Gan, Israel.
R Soc Open Sci. 2025 Jul 16;12(7):250361. doi: 10.1098/rsos.250361. eCollection 2025 Jul.
Amazon Mechanical Turk (MTurk) has been one of the most popular platforms for online research in psychology and the social sciences in general. While concerns about MTurk data quality have been raised, the platform continues to be widely used. The question is whether the MTurk platform is suitable for research and, if so, whether it is used optimally. We conducted a systematic investigation of MTurk data quality and reliability, including main and replication experiments, with more than 1300 participants subdivided into three cohorts: (i) workers (i.e. participants on the MTurk platform) with master requirement (i.e. high-performing workers selected by MTurk), (ii) workers without master requirement, and (iii) workers without master requirement, but with a 95% or above approval rate. We found that master workers almost never missed attentional checks, exhibited high reliability and showed no tendency towards straightlining, therefore, these workers are recommended, especially when the naivety of participants is not a strong prerequisite and no large sample size is required. In contrast, the workers without restrictions or with a 95% or above approval-rate threshold missed many attentional checks, exhibited low reliability and showed a tendency towards straightlining, raising serious concerns about the suitability of these workers for research.
亚马逊土耳其机器人(MTurk)一直是心理学及社会科学领域最受欢迎的在线研究平台之一。尽管有人对MTurk的数据质量提出了担忧,但该平台仍被广泛使用。问题在于MTurk平台是否适合用于研究,如果适合,其使用是否达到了最佳状态。我们对MTurk的数据质量和可靠性进行了系统调查,包括主要实验和重复实验,超过1300名参与者被分为三个群组:(i)有硕士要求的工人(即MTurk挑选出的高绩效工人),(ii)无硕士要求的工人,以及(iii)无硕士要求但批准率达到95%或以上的工人。我们发现,有硕士要求的工人几乎从未错过注意力检查,表现出高可靠性且没有直线答题倾向,因此,推荐使用这些工人,尤其是当参与者的天真不是一个强烈前提且不需要大样本量时。相比之下,没有限制或批准率阈值达到95%或以上的工人错过了许多注意力检查,表现出低可靠性且有直线答题倾向,这引发了对这些工人是否适合用于研究的严重担忧。