• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习帮助研究人员评估临床试验中的偏倚:一项前瞻性、随机用户研究。

Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study.

机构信息

School of Population Health & Environmental Sciences, Faculty of Life Sciences and Medicine, King's College London, 3rd Floor, Addison House, Guy's Campus, London, SE1 1UL, UK.

Center for Evidence Synthesis in Health, Brown University, Providence, USA.

出版信息

BMC Med Inform Decis Mak. 2019 May 8;19(1):96. doi: 10.1186/s12911-019-0814-z.

DOI:10.1186/s12911-019-0814-z
PMID:31068178
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6505190/
Abstract

OBJECTIVE

Assessing risks of bias in randomized controlled trials (RCTs) is an important but laborious task when conducting systematic reviews. RobotReviewer (RR), an open-source machine learning (ML) system, semi-automates bias assessments. We conducted a user study of RobotReviewer, evaluating time saved and usability of the tool.

MATERIALS AND METHODS

Systematic reviewers applied the Cochrane Risk of Bias tool to four randomly selected RCT articles. Reviewers judged: whether an RCT was at low, or high/unclear risk of bias for each bias domain in the Cochrane tool (Version 1); and highlighted article text justifying their decision. For a random two of the four articles, the process was semi-automated: users were provided with ML-suggested bias judgments and text highlights. Participants could amend the suggestions if necessary. We measured time taken for the task, ML suggestions, usability via the System Usability Scale (SUS) and collected qualitative feedback.

RESULTS

For 41 volunteers, semi-automation was quicker than manual assessment (mean 755 vs. 824 s; relative time 0.75, 95% CI 0.62-0.92). Reviewers accepted 301/328 (91%) of the ML Risk of Bias (RoB) judgments, and 202/328 (62%) of text highlights without change. Overall, ML suggested text highlights had a recall of 0.90 (SD 0.14) and precision of 0.87 (SD 0.21) with respect to the users' final versions. Reviewers assigned the system a mean 77.7 SUS score, corresponding to a rating between "good" and "excellent".

CONCLUSIONS

Semi-automation (where humans validate machine learning suggestions) can improve the efficiency of evidence synthesis. Our system was rated highly usable, and expedited bias assessment of RCTs.

摘要

目的

在进行系统评价时,评估随机对照试验(RCT)的偏倚风险是一项重要但繁琐的任务。RobotReviewer(RR)是一个开源的机器学习(ML)系统,可半自动地进行偏倚评估。我们对 RobotReviewer 进行了用户研究,评估了该工具节省的时间和可用性。

材料和方法

系统评价员应用 Cochrane 偏倚风险工具对随机选择的四篇 RCT 文章进行评估。评价员判断:每篇 Cochrane 工具(第 1 版)中各偏倚领域的 RCT 是否为低风险或高/不确定风险;并突出显示文章文本,以证明其决策的合理性。对于随机选择的四篇文章中的两篇,该过程是半自动的:用户获得了 ML 建议的偏倚判断和文本突出显示。如有必要,参与者可以修改建议。我们测量了完成任务的时间、ML 建议、通过系统可用性量表(SUS)进行的可用性以及收集定性反馈。

结果

对于 41 名志愿者,半自动评估比手动评估更快(平均 755 秒与 824 秒;相对时间 0.75,95%CI 0.62-0.92)。审查员接受了 301/328(91%)的 ML 风险偏倚(RoB)判断,并且无需更改 202/328(62%)的文本突出显示。总体而言,ML 建议的文本突出显示在与用户最终版本的召回率为 0.90(SD 0.14)和精确度为 0.87(SD 0.21)。审查员为该系统分配了 77.7 SUS 平均得分,对应于“好”和“优秀”之间的评级。

结论

半自动(其中人类验证机器学习建议)可以提高证据综合的效率。我们的系统被评为高度可用,并加快了 RCT 的偏倚评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/917787bb80d6/12911_2019_814_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/4a4295deba3e/12911_2019_814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/6270cb94faf5/12911_2019_814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/f7383d37d6b8/12911_2019_814_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/0461319a21fa/12911_2019_814_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/3776be69ae6b/12911_2019_814_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/596a354a3db8/12911_2019_814_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/917787bb80d6/12911_2019_814_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/4a4295deba3e/12911_2019_814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/6270cb94faf5/12911_2019_814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/f7383d37d6b8/12911_2019_814_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/0461319a21fa/12911_2019_814_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/3776be69ae6b/12911_2019_814_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/596a354a3db8/12911_2019_814_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9855/6505190/917787bb80d6/12911_2019_814_Fig7_HTML.jpg

相似文献

1
Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study.机器学习帮助研究人员评估临床试验中的偏倚:一项前瞻性、随机用户研究。
BMC Med Inform Decis Mak. 2019 May 8;19(1):96. doi: 10.1186/s12911-019-0814-z.
2
Automating risk of bias assessment in systematic reviews: a real-time mixed methods comparison of human researchers to a machine learning system.自动化系统评价中的偏倚风险评估:人类研究人员与机器学习系统的实时混合方法比较。
BMC Med Res Methodol. 2022 Jun 8;22(1):167. doi: 10.1186/s12874-022-01649-y.
3
Comparing machine and human reviewers to evaluate the risk of bias in randomized controlled trials.比较机器和人工评审员评估随机对照试验偏倚风险。
Res Synth Methods. 2020 May;11(3):484-493. doi: 10.1002/jrsm.1398. Epub 2020 Mar 3.
4
Agreement in Risk of Bias Assessment Between RobotReviewer and Human Reviewers: An Evaluation Study on Randomised Controlled Trials in Nursing-Related Cochrane Reviews.机器人评估者与人工评估者在偏倚风险评估中的一致性:一项针对 Cochrane 护理相关综述中随机对照试验的评估研究。
J Nurs Scholarsh. 2021 Mar;53(2):246-254. doi: 10.1111/jnu.12628. Epub 2021 Feb 8.
5
Accuracy and Efficiency of Machine Learning-Assisted Risk-of-Bias Assessments in "Real-World" Systematic Reviews : A Noninferiority Randomized Controlled Trial.机器学习辅助“真实世界”系统评价偏倚风险评估的准确性和效率:一项非劣效性随机对照试验。
Ann Intern Med. 2022 Jul;175(7):1001-1009. doi: 10.7326/M22-0092. Epub 2022 May 31.
6
RobotReviewer: evaluation of a system for automatically assessing bias in clinical trials.机器人评审员:用于自动评估临床试验偏倚的系统评估
J Am Med Inform Assoc. 2016 Jan;23(1):193-201. doi: 10.1093/jamia/ocv044. Epub 2015 Jun 22.
7
Risk of bias judgments for random sequence generation in Cochrane systematic reviews were frequently not in line with Cochrane Handbook.Cochrane 系统评价中随机序列生成的偏倚风险判断常常与 Cochrane 手册不一致。
BMC Med Res Methodol. 2019 Aug 5;19(1):170. doi: 10.1186/s12874-019-0804-y.
8
Technology-assisted risk of bias assessment in systematic reviews: a prospective cross-sectional evaluation of the RobotReviewer machine learning tool.技术辅助的系统评价偏倚风险评估:对 RobotReviewer 机器学习工具的前瞻性横断面评估。
J Clin Epidemiol. 2018 Apr;96:54-62. doi: 10.1016/j.jclinepi.2017.12.015. Epub 2017 Dec 28.
9
The judgement of biases included in the category "other bias" in Cochrane systematic reviews of interventions: a systematic survey.Cochrane 系统评价干预措施中“其他偏倚”类别中偏倚的判断:系统调查。
BMC Med Res Methodol. 2019 Apr 11;19(1):77. doi: 10.1186/s12874-019-0718-8.
10
Rayyan-a web and mobile app for systematic reviews.Rayyan——一款用于系统评价的网络和移动应用程序。
Syst Rev. 2016 Dec 5;5(1):210. doi: 10.1186/s13643-016-0384-4.

引用本文的文献

1
Using a large language model (ChatGPT) to assess risk of bias in randomized controlled trials of medical interventions: protocol for a pilot study of interrater agreement with human reviewers.使用大语言模型(ChatGPT)评估医学干预随机对照试验中的偏倚风险:与人类评审员进行评分者间一致性的初步研究方案
BMC Med Res Methodol. 2025 Jul 31;25(1):182. doi: 10.1186/s12874-025-02631-0.
2
Assessing risk of bias in toxicological studies in the era of artificial intelligence.评估人工智能时代毒理学研究中的偏倚风险。
Arch Toxicol. 2025 Aug;99(8):3065-3090. doi: 10.1007/s00204-025-03978-5.
3
Digital Tools to Support the Systematic Review Process: An Introduction.

本文引用的文献

1
Moving toward the automation of the systematic review process: a summary of discussions at the second meeting of International Collaboration for the Automation of Systematic Reviews (ICASR).迈向系统评价流程自动化:第二届国际系统评价自动化合作组织(ICASR)会议讨论摘要。
Syst Rev. 2018 Jan 9;7(1):3. doi: 10.1186/s13643-017-0667-4.
2
Technology-assisted risk of bias assessment in systematic reviews: a prospective cross-sectional evaluation of the RobotReviewer machine learning tool.技术辅助的系统评价偏倚风险评估:对 RobotReviewer 机器学习工具的前瞻性横断面评估。
J Clin Epidemiol. 2018 Apr;96:54-62. doi: 10.1016/j.jclinepi.2017.12.015. Epub 2017 Dec 28.
3
支持系统评价过程的数字工具:简介
J Eval Clin Pract. 2025 Apr;31(3):e70100. doi: 10.1111/jep.70100.
4
Artificial Intelligence Software to Accelerate Screening for Living Systematic Reviews.加速实时系统评价筛选的人工智能软件
Clin Child Fam Psychol Rev. 2025 Apr 18. doi: 10.1007/s10567-025-00519-5.
5
Artificial intelligence in food and nutrition evidence: The challenges and opportunities.食品与营养领域人工智能的证据:挑战与机遇
PNAS Nexus. 2024 Oct 15;3(12):pgae461. doi: 10.1093/pnasnexus/pgae461. eCollection 2024 Dec.
6
Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.生物医学文献系统评价自动化:PubMed 索引研究的范围综述。
Syst Rev. 2024 Jul 8;13(1):174. doi: 10.1186/s13643-024-02592-3.
7
Rapid reviews methods series: Guidance on team considerations, study selection, data extraction and risk of bias assessment.快速综述方法系列:团队考虑因素、研究选择、数据提取和偏倚风险评估指南。
BMJ Evid Based Med. 2023 Nov 22;28(6):418-423. doi: 10.1136/bmjebm-2022-112185.
8
Automating Quality Assessment of Medical Evidence in Systematic Reviews: Model Development and Validation Study.系统评价中医疗证据质量评估的自动化:模型开发和验证研究。
J Med Internet Res. 2023 Mar 13;25:e35568. doi: 10.2196/35568.
9
Reporting and transparent research practices in sports medicine and orthopaedic clinical trials: a meta-research study.运动医学和矫形外科临床试验中的报告和透明研究实践:一项元研究。
BMJ Open. 2022 Aug 8;12(8):e059347. doi: 10.1136/bmjopen-2021-059347.
10
Automating risk of bias assessment in systematic reviews: a real-time mixed methods comparison of human researchers to a machine learning system.自动化系统评价中的偏倚风险评估:人类研究人员与机器学习系统的实时混合方法比较。
BMC Med Res Methodol. 2022 Jun 8;22(1):167. doi: 10.1186/s12874-022-01649-y.
Automating Biomedical Evidence Synthesis: RobotReviewer.
生物医学证据合成自动化:机器人审阅者
Proc Conf Assoc Comput Linguist Meet. 2017 Jul;2017:7-12. doi: 10.18653/v1/P17-4002.
4
Rationale-Augmented Convolutional Neural Networks for Text Classification.用于文本分类的基于原理增强的卷积神经网络。
Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:795-804. doi: 10.18653/v1/d16-1076.
5
Extracting PICO Sentences from Clinical Trial Reports using .使用……从临床试验报告中提取PICO句子
J Mach Learn Res. 2016;17.
6
RobotReviewer: evaluation of a system for automatically assessing bias in clinical trials.机器人评审员:用于自动评估临床试验偏倚的系统评估
J Am Med Inform Assoc. 2016 Jan;23(1):193-201. doi: 10.1093/jamia/ocv044. Epub 2015 Jun 22.
7
Automating data extraction in systematic reviews: a systematic review.系统评价中数据提取的自动化:一项系统评价。
Syst Rev. 2015 Jun 15;4:78. doi: 10.1186/s13643-015-0066-7.
8
The automation of systematic reviews.系统评价的自动化
BMJ. 2013 Jan 10;346:f139. doi: 10.1136/bmj.f139.
9
The Cochrane Collaboration's tool for assessing risk of bias in randomised trials.Cochrane 协作网评估随机试验偏倚风险的工具。
BMJ. 2011 Oct 18;343:d5928. doi: 10.1136/bmj.d5928.
10
Seventy-five trials and eleven systematic reviews a day: how will we ever keep up?每天要处理七十五个试验和十一个系统评价:我们怎么才能跟得上?
PLoS Med. 2010 Sep 21;7(9):e1000326. doi: 10.1371/journal.pmed.1000326.