Radcliffe Department of Medicine, University of Oxford, Level 4, Academic Block, John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK; People Services Department, Cochrane, St Albans House, 57-59 Haymarket, London SW1Y 4QX, UK.
Metaxis Ltd, Elmbank Offices, Main Road Curbridge, Witney, Oxfordshire OX29 7NT, UK.
J Clin Epidemiol. 2021 May;133:130-139. doi: 10.1016/j.jclinepi.2021.01.006. Epub 2021 Jan 18.
Filtering the deluge of new research to facilitate evidence synthesis has proven to be unmanageable using current paradigms of search and retrieval. Crowdsourcing, a way of harnessing the collective effort of a "crowd" of people, has the potential to support evidence synthesis by addressing this information overload created by the exponential growth in primary research outputs. Cochrane Crowd, Cochrane's citizen science platform, offers a range of tasks aimed at identifying studies related to health care. Accompanying each task are brief, interactive training modules, and agreement algorithms that help ensure accurate collective decision-making.The aims of the study were to evaluate the performance of Cochrane Crowd in terms of its accuracy, capacity, and autonomy and to examine contributor engagement across three tasks aimed at identifying randomized trials.
Crowd accuracy was evaluated by measuring the sensitivity and specificity of crowd screening decisions on a sample of titles and abstracts, compared with "quasi gold-standard" decisions about the same records using the conventional methods of dual screening. Crowd capacity, in the form of output volume, was evaluated by measuring the number of records processed by the crowd, compared with baseline. Crowd autonomy, the capability of the crowd to produce accurate collectively derived decisions without the need for expert resolution, was measured by the proportion of records that needed resolving by an expert.
The Cochrane Crowd community currently has 18,897 contributors from 163 countries. Collectively, the Crowd has processed 1,021,227 records, helping to identify 178,437 reports of randomized controlled trials (RCTs) for Cochrane's Central Register of Controlled Trials. The sensitivity for each task was 99.1% for the RCT identification task (RCT ID), 99.7% for the RCT identification task of trials from ClinicalTrials.gov (CT ID), and 97.7% for the identification of RCTs from the International Clinical Trials Registry Platform (ICTRP ID). The specificity for each task was 99% for RCT ID, 98.6% for CT ID, and 99.1% for CT ICTRP ID. The capacity of the combined Crowd and machine learning workflow has increased fivefold in 6 years, compared with baseline. The proportion of records requiring expert resolution across the tasks ranged from 16.6% to 19.7%.
Cochrane Crowd is sufficiently accurate and scalable to keep pace with the current rate of publication (and registration) of new primary studies. It has also proved to be a popular, efficient, and accurate way for a large number of people to play an important voluntary role in health evidence production. Cochrane Crowd is now an established part of Cochrane's effort to manage the deluge of primary research being produced.
使用当前的搜索和检索模式来筛选新研究的大量信息以促进证据综合已被证明是无法管理的。众包是一种利用“人群”的集体努力的方法,通过解决主要研究成果呈指数级增长所带来的信息过载问题,有可能支持证据综合。Cochrane Crowd 是 Cochrane 的公民科学平台,提供了一系列旨在确定与医疗保健相关研究的任务。每项任务都伴随着简短的互动培训模块和一致的算法,以帮助确保准确的集体决策。本研究的目的是评估 Cochrane Crowd 在准确性、容量和自主性方面的表现,并检查三个旨在确定随机试验的任务中的参与者参与情况。
通过测量人群筛选决策在标题和摘要样本上的敏感性和特异性,与使用双重筛选的常规方法对同一记录的“准黄金标准”决策进行比较,来评估人群的准确性。以处理的记录数量来衡量人群的容量(以输出量的形式),并与基线进行比较。人群的自主性,即人群无需专家解决即可生成准确的集体决策的能力,通过需要专家解决的记录比例来衡量。
Cochrane Crowd 社区目前拥有来自 163 个国家/地区的 18897 名贡献者。该人群总共处理了 1021227 条记录,帮助为 Cochrane 中央对照试验注册库确定了 178437 项随机对照试验 (RCT) 的报告。每个任务的敏感性分别为 RCT 识别任务 (RCT ID) 的 99.1%、来自 ClinicalTrials.gov 的 RCT 识别任务 (CT ID) 的 99.7%和来自国际临床试验注册平台 (ICTRP ID) 的 RCT 识别任务的 97.7%。每个任务的特异性分别为 RCT ID 的 99%、CT ID 的 98.6%和 CT ICTRP ID 的 99.1%。结合人群和机器学习工作流程的容量在 6 年内增加了五倍,与基线相比。三个任务中需要专家解决的记录比例范围为 16.6%至 19.7%。
Cochrane Crowd 足够准确且具有可扩展性,能够跟上新的主要研究的当前出版(和注册)速度。它还被证明是一种流行、高效和准确的方法,可以让大量人员在健康证据生产中发挥重要的自愿作用。Cochrane Crowd 现已成为 Cochrane 管理大量主要研究的重要组成部分。