Suppr超能文献

众包和自动化有助于在实时综述中识别和分类随机对照试验。

Crowd-sourcing and automation facilitated the identification and classification of randomized controlled trials in a living review.

作者信息

Kamso Mohammed Mujaab, Pardo Jordi Pardo, Whittle Samuel L, Buchbinder Rachelle, Wells George, Glennon Vanessa, Tugwell Peter, Deardon Rob, Sajobi Tolulope, Tomlinson George, Elliott Jesse, Kelly Shannon E, Hazlewood Glen S

机构信息

Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Canada.

Centre for Practice-Changing Research, Ottawa Hospital Research Institute, The Ottawa Hospital - General Campus, Ottawa, Canada.

出版信息

J Clin Epidemiol. 2023 Dec;164:1-8. doi: 10.1016/j.jclinepi.2023.10.007. Epub 2023 Oct 21.

Abstract

OBJECTIVES

To evaluate an approach using automation and crowdsourcing to identify and classify randomized controlled trials (RCTs) for rheumatoid arthritis (RA) in a living systematic review (LSR).

METHODS

Records from a database search for RCTs in RA were screened first by machine learning and Cochrane Crowd to exclude non-RCTs, then by trainee reviewers using a Population, Intervention, Comparison, and Outcome (PICO) annotator platform to assess eligibility and classify the trial to the appropriate review. Disagreements were resolved by experts using a custom online tool. We evaluated the efficiency gains, sensitivity, accuracy, and interrater agreement (kappa scores) between reviewers.

RESULTS

From 42,452 records, machine learning and Cochrane Crowd excluded 28,777 (68%), trainee reviewers excluded 4,529 (11%), and experts excluded 7,200 (17%). The 1,946 records eligible for our LSR represented 220 RCTs and included 148/149 (99.3%) of known eligible trials from prior reviews. Although excluded from our LSRs, 6,420 records were classified as other RCTs in RA to inform future reviews. False negative rates among trainees were highest for the RCT domain (12%), although only 1.1% of these were for the primary record. Kappa scores for two reviewers ranged from moderate to substantial agreement (0.40-0.69).

CONCLUSION

A screening approach combining machine learning, crowdsourcing, and trainee participation substantially reduced the screening burden for expert reviewers and was highly sensitive.

摘要

目的

评估一种利用自动化和众包来识别和分类类风湿性关节炎(RA)随机对照试验(RCT)的方法,用于实时系统评价(LSR)。

方法

首先通过机器学习和Cochrane众包筛选类风湿性关节炎随机对照试验数据库搜索记录,以排除非随机对照试验,然后由实习评审员使用人群、干预措施、对照和结果(PICO)注释平台评估其合格性,并将试验分类到适当的评价中。分歧由专家使用定制在线工具解决。我们评估了评审员之间的效率提升、敏感性、准确性和评分者间一致性(kappa值)。

结果

在42452条记录中,机器学习和Cochrane众包排除了28777条(68%),实习评审员排除了4529条(11%),专家排除了7200条(17%)。符合我们实时系统评价的1946条记录代表220项随机对照试验,包括先前评价中已知合格试验的148/149项(99.3%)。虽然被排除在我们的实时系统评价之外,但6420条记录被分类为类风湿性关节炎的其他随机对照试验,为未来的评价提供参考。实习评审员中随机对照试验领域的假阴性率最高(12%),不过其中仅1.1%是针对主要记录。两位评审员的kappa值从中度到高度一致(0.40 - 0.69)。

结论

一种结合机器学习、众包和实习人员参与的筛选方法显著减轻了专家评审员的筛选负担,且具有高度敏感性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验