Department of Mathematics & Statistics, University of Vermont, Burlington, VT, United States of America.
Vermont Complex Systems Center, University of Vermont, Burlington, VT, United States of America.
PLoS One. 2020 Dec 17;15(12):e0244245. doi: 10.1371/journal.pone.0244245. eCollection 2020.
Allowing members of the crowd to propose novel microtasks for one another is an effective way to combine the efficiencies of traditional microtask work with the inventiveness and hypothesis generation potential of human workers. However, microtask proposal leads to a growing set of tasks that may overwhelm limited crowdsourcer resources. Crowdsourcers can employ methods to utilize their resources efficiently, but algorithmic approaches to efficient crowdsourcing generally require a fixed task set of known size. In this paper, we introduce cost forecasting as a means for a crowdsourcer to use efficient crowdsourcing algorithms with a growing set of microtasks. Cost forecasting allows the crowdsourcer to decide between eliciting new tasks from the crowd or receiving responses to existing tasks based on whether or not new tasks will cost less to complete than existing tasks, efficiently balancing resources as crowdsourcing occurs. Experiments with real and synthetic crowdsourcing data show that cost forecasting leads to improved accuracy. Accuracy and efficiency gains for crowd-generated microtasks hold the promise to further leverage the creativity and wisdom of the crowd, with applications such as generating more informative and diverse training data for machine learning applications and improving the performance of user-generated content and question-answering platforms.
允许人群成员相互提出新的微任务是将传统微任务工作的效率与人类工人的创造力和假设生成潜力相结合的有效方法。然而,微任务提案会导致任务集不断增加,从而可能超出有限的众包资源的处理能力。众包人员可以采用有效的资源利用方法,但算法方法通常需要固定的已知大小的任务集。在本文中,我们介绍了成本预测,这是众包人员在具有不断增长的微任务集的情况下使用高效众包算法的一种手段。成本预测允许众包人员根据新任务的完成成本是否低于现有任务,来决定是从人群中征集新任务还是接收现有任务的回复,从而有效地平衡资源在众包过程中的使用。使用真实和合成众包数据进行的实验表明,成本预测可提高准确性。对于众包生成的微任务的准确性和效率的提高有望进一步利用群体的创造力和智慧,其应用包括为机器学习应用程序生成更具信息量和更多样化的训练数据,以及提高用户生成内容和问答平台的性能。