• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较机器和人工评审员评估随机对照试验偏倚风险。

Comparing machine and human reviewers to evaluate the risk of bias in randomized controlled trials.

机构信息

Institute of Health Economics (IHE), Edmonton, Alberta, Canada.

Faculty of Rehabilitation Medicine, Department of Physical Therapy/Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada.

出版信息

Res Synth Methods. 2020 May;11(3):484-493. doi: 10.1002/jrsm.1398. Epub 2020 Mar 3.

DOI:10.1002/jrsm.1398
PMID:32065732
Abstract

BACKGROUND

Evidence from new health technologies is growing, along with demands for evidence to inform policy decisions, creating challenges in completing health technology assessments (HTAs)/systematic reviews (SRs) in a timely manner. Software can decrease the time and burden by automating the process, but evidence validating such software is limited. We tested the accuracy of RobotReviewer, a semi-autonomous risk of bias (RoB) assessment tool, and its agreement with human reviewers.

METHODS

Two reviewers independently conducted RoB assessments on a sample of randomized controlled trials (RCTs), and their consensus ratings were compared with those generated by RobotReviewer. Agreement with the human reviewers was assessed using percent agreement and weighted kappa (κ). The accuracy of RobotReviewer was also assessed by calculating the sensitivity, specificity, and area under the curve in comparison to the consensus agreement of the human reviewers.

RESULTS

The study included 372 RCTs. Inter-rater reliability ranged from κ = -0.06 (no agreement) for blinding of participants and personnel to κ = 0.62 (good agreement) for random sequence generation (excluding overall RoB). RobotReviewer was found to use a high percentage of "irrelevant supporting quotations" to complement RoB assessments for blinding of participants and personnel (72.6%), blinding of outcome assessment (70.4%), and allocation concealment (54.3%).

CONCLUSION

RobotReviewer can help with risk of bias assessment of RCTs but cannot replace human evaluations. Thus, reviewers should check and validate RoB assessments from RobotReviewer by consulting the original article when not relevant supporting quotations are provided by RobotReviewer. This consultation is in line with the recommendation provided by the developers.

摘要

背景

新的医疗技术证据不断增加,同时也需要证据来为政策决策提供信息,这给及时完成医疗技术评估(HTA)/系统评价(SR)带来了挑战。软件可以通过自动化流程来减少时间和负担,但验证此类软件的证据有限。我们测试了 RobotReviewer 的准确性,这是一种半自动偏倚(RoB)评估工具,以及它与人评阅者的一致性。

方法

两位评阅者分别对一组随机对照试验(RCT)进行 RoB 评估,然后将他们的共识评分与 RobotReviewer 生成的评分进行比较。使用百分比一致性和加权 κ(κ)来评估与人类评阅者的一致性。通过计算与人类评阅者共识一致性的灵敏度、特异性和曲线下面积,评估 RobotReviewer 的准确性。

结果

该研究共纳入 372 项 RCT。评阅者间可靠性范围从参与者和人员盲法的 κ = -0.06(无一致性)到随机序列生成(不包括总体 RoB)的 κ = 0.62(良好一致性)。发现 RobotReviewer 大量使用“不相关的支持引语”来补充参与者和人员盲法(72.6%)、结局评估盲法(70.4%)和分配隐藏盲法(54.3%)的 RoB 评估。

结论

RobotReviewer 可以帮助评估 RCT 的偏倚风险,但不能替代人类评估。因此,当 RobotReviewer 未提供相关支持引语时,评阅者应查阅原文,检查并验证来自 RobotReviewer 的 RoB 评估。这种咨询符合开发人员的建议。

相似文献

1
Comparing machine and human reviewers to evaluate the risk of bias in randomized controlled trials.比较机器和人工评审员评估随机对照试验偏倚风险。
Res Synth Methods. 2020 May;11(3):484-493. doi: 10.1002/jrsm.1398. Epub 2020 Mar 3.
2
Agreement in Risk of Bias Assessment Between RobotReviewer and Human Reviewers: An Evaluation Study on Randomised Controlled Trials in Nursing-Related Cochrane Reviews.机器人评估者与人工评估者在偏倚风险评估中的一致性:一项针对 Cochrane 护理相关综述中随机对照试验的评估研究。
J Nurs Scholarsh. 2021 Mar;53(2):246-254. doi: 10.1111/jnu.12628. Epub 2021 Feb 8.
3
Accuracy and Efficiency of Machine Learning-Assisted Risk-of-Bias Assessments in "Real-World" Systematic Reviews : A Noninferiority Randomized Controlled Trial.机器学习辅助“真实世界”系统评价偏倚风险评估的准确性和效率:一项非劣效性随机对照试验。
Ann Intern Med. 2022 Jul;175(7):1001-1009. doi: 10.7326/M22-0092. Epub 2022 May 31.
4
Automating risk of bias assessment in systematic reviews: a real-time mixed methods comparison of human researchers to a machine learning system.自动化系统评价中的偏倚风险评估:人类研究人员与机器学习系统的实时混合方法比较。
BMC Med Res Methodol. 2022 Jun 8;22(1):167. doi: 10.1186/s12874-022-01649-y.
5
Towards the automatic risk of bias assessment on randomized controlled trials: A comparison of RobotReviewer and humans.迈向随机对照试验自动偏倚风险评估:RobotReviewer 与人类的比较。
Res Synth Methods. 2024 Nov;15(6):1111-1119. doi: 10.1002/jrsm.1761. Epub 2024 Sep 26.
6
Poor reliability between Cochrane reviewers and blinded external reviewers when applying the Cochrane risk of bias tool in physical therapy trials.在物理治疗试验中应用Cochrane偏倚风险工具时,Cochrane综述作者与盲法外部评审者之间的可靠性较差。
PLoS One. 2014 May 13;9(5):e96920. doi: 10.1371/journal.pone.0096920. eCollection 2014.
7
Disagreements in risk of bias assessment for randomized controlled trials in hypertension-related Cochrane reviews.高血压相关 Cochrane 综述中随机对照试验偏倚风险评估的分歧。
Trials. 2024 Jun 21;25(1):405. doi: 10.1186/s13063-024-08145-2.
8
Technology-assisted risk of bias assessment in systematic reviews: a prospective cross-sectional evaluation of the RobotReviewer machine learning tool.技术辅助的系统评价偏倚风险评估:对 RobotReviewer 机器学习工具的前瞻性横断面评估。
J Clin Epidemiol. 2018 Apr;96:54-62. doi: 10.1016/j.jclinepi.2017.12.015. Epub 2017 Dec 28.
9
Practicalities of using a modified version of the Cochrane Collaboration risk of bias tool for randomised and non-randomised study designs applied in a health technology assessment setting.在健康技术评估环境中应用改良版 Cochrane 协作偏倚风险工具评估随机和非随机研究设计的实用性。
Res Synth Methods. 2014 Sep;5(3):200-11. doi: 10.1002/jrsm.1102. Epub 2013 Nov 14.
10

引用本文的文献

1
Using a large language model (ChatGPT) to assess risk of bias in randomized controlled trials of medical interventions: protocol for a pilot study of interrater agreement with human reviewers.使用大语言模型(ChatGPT)评估医学干预随机对照试验中的偏倚风险:与人类评审员进行评分者间一致性的初步研究方案
BMC Med Res Methodol. 2025 Jul 31;25(1):182. doi: 10.1186/s12874-025-02631-0.
2
Digital Tools to Support the Systematic Review Process: An Introduction.支持系统评价过程的数字工具:简介
J Eval Clin Pract. 2025 Apr;31(3):e70100. doi: 10.1111/jep.70100.
3
Concordance between humans and GPT-4 in appraising the methodological quality of case reports and case series using the Murad tool.
人类与 GPT-4 评估病例报告和病例系列研究方法学质量的一致性:使用 Murad 工具。
BMC Med Res Methodol. 2024 Nov 4;24(1):266. doi: 10.1186/s12874-024-02372-6.
4
An exploration of available methods and tools to improve the efficiency of systematic review production: a scoping review.探索提高系统评价制作效率的可用方法和工具:范围综述。
BMC Med Res Methodol. 2024 Sep 18;24(1):210. doi: 10.1186/s12874-024-02320-4.
5
Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.生物医学文献系统评价自动化:PubMed 索引研究的范围综述。
Syst Rev. 2024 Jul 8;13(1):174. doi: 10.1186/s13643-024-02592-3.
6
Rapid review methods series: Guidance on the use of supportive software.快速审查方法系列:支持性软件使用指南。
BMJ Evid Based Med. 2024 Jul 23;29(4):264-271. doi: 10.1136/bmjebm-2023-112530.