• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

主动学习模型在系统评价筛选优先级中的性能:平均发现相关记录时间的模拟研究。

Performance of active learning models for screening prioritization in systematic reviews: a simulation study into the Average Time to Discover relevant records.

机构信息

Department of Methodology and Statistics, Faculty of Social and Behavioral Sciences, Utrecht University, Utrecht, Netherlands.

Department of Research and Data Management Services, Information Technology Services, Utrecht University, Utrecht, The Netherlands.

出版信息

Syst Rev. 2023 Jun 20;12(1):100. doi: 10.1186/s13643-023-02257-7.

DOI:10.1186/s13643-023-02257-7
PMID:37340494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10280866/
Abstract

BACKGROUND

Conducting a systematic review demands a significant amount of effort in screening titles and abstracts. To accelerate this process, various tools that utilize active learning have been proposed. These tools allow the reviewer to interact with machine learning software to identify relevant publications as early as possible. The goal of this study is to gain a comprehensive understanding of active learning models for reducing the workload in systematic reviews through a simulation study.

METHODS

The simulation study mimics the process of a human reviewer screening records while interacting with an active learning model. Different active learning models were compared based on four classification techniques (naive Bayes, logistic regression, support vector machines, and random forest) and two feature extraction strategies (TF-IDF and doc2vec). The performance of the models was compared for six systematic review datasets from different research areas. The evaluation of the models was based on the Work Saved over Sampling (WSS) and recall. Additionally, this study introduces two new statistics, Time to Discovery (TD) and Average Time to Discovery (ATD).

RESULTS

The models reduce the number of publications needed to screen by 91.7 to 63.9% while still finding 95% of all relevant records (WSS@95). Recall of the models was defined as the proportion of relevant records found after screening 10% of of all records and ranges from 53.6 to 99.8%. The ATD values range from 1.4% till 11.7%, which indicate the average proportion of labeling decisions the researcher needs to make to detect a relevant record. The ATD values display a similar ranking across the simulations as the recall and WSS values.

CONCLUSIONS

Active learning models for screening prioritization demonstrate significant potential for reducing the workload in systematic reviews. The Naive Bayes + TF-IDF model yielded the best results overall. The Average Time to Discovery (ATD) measures performance of active learning models throughout the entire screening process without the need for an arbitrary cut-off point. This makes the ATD a promising metric for comparing the performance of different models across different datasets.

摘要

背景

进行系统评价需要在筛选标题和摘要上投入大量精力。为了加速这一过程,已经提出了各种利用主动学习的工具。这些工具允许评审员与机器学习软件进行交互,尽早识别相关文献。本研究的目的是通过模拟研究全面了解用于减少系统评价工作量的主动学习模型。

方法

模拟研究模仿了人工评审员在与主动学习模型交互时筛选记录的过程。基于四种分类技术(朴素贝叶斯、逻辑回归、支持向量机和随机森林)和两种特征提取策略(TF-IDF 和 doc2vec)比较了不同的主动学习模型。基于来自不同研究领域的六个系统评价数据集对模型的性能进行了比较。基于工作节省抽样(WSS)和召回率对模型进行了评估。此外,本研究还引入了两个新的统计指标,发现时间(TD)和平均发现时间(ATD)。

结果

模型将需要筛选的文献数量减少了 91.7%至 63.9%,同时仍然找到了 95%的所有相关记录(WSS@95)。模型的召回率定义为筛选所有记录的 10%后发现的相关记录的比例,范围从 53.6%到 99.8%。ATD 值范围从 1.4%到 11.7%,这表示研究人员发现一个相关记录所需的平均标签决策比例。ATD 值在整个模拟中与召回率和 WSS 值的排序相似。

结论

用于筛选优先级的主动学习模型在减少系统评价工作量方面具有显著的潜力。朴素贝叶斯+TF-IDF 模型总体上取得了最好的结果。平均发现时间(ATD)在整个筛选过程中衡量主动学习模型的性能,而无需任意截止点。这使得 ATD 成为比较不同模型在不同数据集上性能的有前途的指标。

相似文献

1
Performance of active learning models for screening prioritization in systematic reviews: a simulation study into the Average Time to Discover relevant records.主动学习模型在系统评价筛选优先级中的性能:平均发现相关记录时间的模拟研究。
Syst Rev. 2023 Jun 20;12(1):100. doi: 10.1186/s13643-023-02257-7.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Machine learning to optimize literature screening in medical guideline development.机器学习在医学指南制定中的文献筛选优化。
Syst Rev. 2024 Jul 11;13(1):177. doi: 10.1186/s13643-024-02590-5.
4
SWIFT-Review: a text-mining workbench for systematic review.SWIFT-Review:一个用于系统评价的文本挖掘工作台。
Syst Rev. 2016 May 23;5:87. doi: 10.1186/s13643-016-0263-z.
5
Aligning text mining and machine learning algorithms with best practices for study selection in systematic literature reviews.将文本挖掘和机器学习算法与系统文献综述中的研究选择最佳实践相结合。
Syst Rev. 2020 Dec 13;9(1):293. doi: 10.1186/s13643-020-01520-5.
6
An evaluation of DistillerSR's machine learning-based prioritization tool for title/abstract screening - impact on reviewer-relevant outcomes.评估基于机器学习的 DistillerSR 优先筛选工具在标题/摘要筛选中的应用——对与评审员相关结果的影响。
BMC Med Res Methodol. 2020 Oct 15;20(1):256. doi: 10.1186/s12874-020-01129-1.
7
Performance and usability of machine learning for screening in systematic reviews: a comparative evaluation of three tools.机器学习在系统评价筛选中的性能和可用性:三种工具的比较评估。
Syst Rev. 2019 Nov 15;8(1):278. doi: 10.1186/s13643-019-1222-2.
8
Impact of Active learning model and prior knowledge on discovery time of elusive relevant papers: a simulation study.主动学习模型和先验知识对发现难以捉摸的相关文献时间的影响:一项模拟研究。
Syst Rev. 2024 Jul 8;13(1):175. doi: 10.1186/s13643-024-02587-0.
9
In-depth evaluation of machine learning methods for semi-automating article screening in a systematic review of mechanistic literature.在对机制文献的系统评价中,对用于半自动文章筛选的机器学习方法进行深入评估。
Res Synth Methods. 2023 Mar;14(2):156-172. doi: 10.1002/jrsm.1589. Epub 2022 Jul 23.
10
Machine learning for screening prioritization in systematic reviews: comparative performance of Abstrackr and EPPI-Reviewer.机器学习在系统评价中的筛选优先级:Abstrackr 和 EPPI-Reviewer 的比较性能。
Syst Rev. 2020 Apr 2;9(1):73. doi: 10.1186/s13643-020-01324-7.

引用本文的文献

1
Association between child and youth physical activity and family functioning: a systematic review of observational studies.儿童和青少年身体活动与家庭功能之间的关联:观察性研究的系统评价
Int J Behav Nutr Phys Act. 2025 Jul 22;22(1):101. doi: 10.1186/s12966-025-01782-z.
2
A Study Protocol for a Comprehensive Evaluation of Two Artificial Intelligence-Based Tools in Title and Abstract Screening for the Development of Evidence-Based Cancer Guidelines.一项关于在循证癌症指南制定中对两种基于人工智能的标题和摘要筛选工具进行综合评估的研究方案。
Cancer Innov. 2025 Jun 29;4(4):e70021. doi: 10.1002/cai2.70021. eCollection 2025 Aug.
3

本文引用的文献

1
Active learning-based systematic reviewing using switching classification models: the case of the onset, maintenance, and relapse of depressive disorders.使用切换分类模型的基于主动学习的系统综述:以抑郁症的发作、维持和复发为例
Front Res Metr Anal. 2023 May 16;8:1178181. doi: 10.3389/frma.2023.1178181. eCollection 2023.
2
Clinical applicability of artificial intelligence for patients with an inherited heart disease: A scoping review.人工智能在遗传性心脏病患者中的临床应用:范围综述。
Trends Cardiovasc Med. 2023 Jul;33(5):274-282. doi: 10.1016/j.tcm.2022.01.011. Epub 2022 Jan 31.
3
Mapping Phenomena Relevant to Adolescent Emotion Regulation: A Text-Mining Systematic Review.
Exploring Evaluation of eHealth Lifestyle Interventions for Preschool Children: A Scoping Review.
探索针对学龄前儿童的电子健康生活方式干预措施的评估:一项范围综述
Mayo Clin Proc Digit Health. 2025 Apr 17;3(2):100223. doi: 10.1016/j.mcpdig.2025.100223. eCollection 2025 Jun.
4
Utilizing Large language models to select literature for meta-analysis shows workload reduction while maintaining a similar recall level as manual curation.利用大语言模型为荟萃分析选择文献显示,在保持与人工筛选相似召回率的同时,工作量有所减少。
BMC Med Res Methodol. 2025 Apr 28;25(1):116. doi: 10.1186/s12874-025-02569-3.
5
Mapping Research Trends on Intestinal Permeability in Irritable Bowel Syndrome with a Focus on Nutrition: A Bibliometric Analysis.以营养为重点的肠易激综合征肠道通透性研究趋势图谱:一项文献计量分析
Nutrients. 2025 Mar 18;17(6):1064. doi: 10.3390/nu17061064.
6
Testing the utility of GPT for title and abstract screening in environmental systematic evidence synthesis.测试GPT在环境系统证据综合中用于标题和摘要筛选的效用。
Environ Evid. 2025 Apr 23;14(1):7. doi: 10.1186/s13750-025-00360-x.
7
Artificial Intelligence as a New Research Ally? Performing AI-Assisted Systematic Literature Reviews in Health Economics.人工智能会成为新的研究伙伴?开展健康经济学领域的人工智能辅助系统文献综述
Pharmacoeconomics. 2025 Jun;43(6):647-650. doi: 10.1007/s40273-025-01481-4. Epub 2025 Apr 10.
8
Machine Learning-Assisted Health Economics and Policy Reviews: A Comparative Assessment.机器学习辅助的卫生经济学与政策综述:一项比较评估
Appl Health Econ Health Policy. 2025 Mar 28. doi: 10.1007/s40258-025-00963-y.
9
Reporting feedback on healthcare outcomes to improve quality in care: a scoping review.报告医疗保健结果的反馈以提高护理质量:一项范围综述
Implement Sci. 2025 Mar 25;20(1):14. doi: 10.1186/s13012-025-01424-9.
10
Instruments for Value Elucidation in Older Adults in Clinical Practice-A Scoping Review.临床实践中老年人价值阐释工具的范围综述
J Am Geriatr Soc. 2025 Apr;73(4):1267-1287. doi: 10.1111/jgs.19356. Epub 2025 Jan 10.
映射与青少年情绪调节相关的现象:一项文本挖掘系统综述。
Adolesc Res Rev. 2022;7(1):127-139. doi: 10.1007/s40894-021-00160-7. Epub 2021 May 21.
4
AI-Assisted Systematic Reviewing: Selecting Studies to Compare Bayesian Versus Frequentist SEM for Small Sample Sizes.人工智能辅助的系统评价:选择研究以比较贝叶斯方法与频率论方法在小样本量下的结构方程模型
Multivariate Behav Res. 2021 Jan-Feb;56(1):153-154. doi: 10.1080/00273171.2020.1853501. Epub 2020 Dec 2.
5
Virus Metagenomics in Farm Animals: A Systematic Review.动物病毒宏基因组学研究:系统综述。
Viruses. 2020 Jan 16;12(1):107. doi: 10.3390/v12010107.
6
Software tools to support title and abstract screening for systematic reviews in healthcare: an evaluation.支持医疗保健系统评价标题和摘要筛选的软件工具:评价。
BMC Med Res Methodol. 2020 Jan 13;20(1):7. doi: 10.1186/s12874-020-0897-3.
7
Comparative effectiveness of common therapies for Wilson disease: A systematic review and meta-analysis of controlled studies.常见疗法治疗威尔逊病的疗效比较:系统评价和对照研究的荟萃分析。
Liver Int. 2019 Nov;39(11):2136-2152. doi: 10.1111/liv.14179. Epub 2019 Jul 10.
8
Editorial: Systematic review automation thematic series.社论:系统评价自动化专题系列
Syst Rev. 2019 Mar 11;8(1):70. doi: 10.1186/s13643-019-0974-z.
9
Prioritising references for systematic reviews with RobotAnalyst: A user study.使用 RobotAnalyst 对系统评价进行优先排序:一项用户研究。
Res Synth Methods. 2018 Sep;9(3):470-488. doi: 10.1002/jrsm.1311. Epub 2018 Jul 30.
10
Using machine learning to advance synthesis and use of conservation and environmental evidence.利用机器学习推动保护和环境证据的综合与应用。
Conserv Biol. 2018 Aug;32(4):762-764. doi: 10.1111/cobi.13117.