• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用自然语言处理和机器学习简化和加速研究的自动化文献综述工具(LiteRev):描述性性能评估研究。

An Automated Literature Review Tool (LiteRev) for Streamlining and Accelerating Research Using Natural Language Processing and Machine Learning: Descriptive Performance Evaluation Study.

机构信息

Institute of Global Health, University of Geneva, Geneva, Switzerland.

Médecins Sans Frontières, Geneva, Switzerland.

出版信息

J Med Internet Res. 2023 Sep 15;25:e39736. doi: 10.2196/39736.

DOI:10.2196/39736
PMID:37713261
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10541641/
Abstract

BACKGROUND

Literature reviews (LRs) identify, evaluate, and synthesize relevant papers to a particular research question to advance understanding and support decision-making. However, LRs, especially traditional systematic reviews, are slow, resource-intensive, and become outdated quickly.

OBJECTIVE

LiteRev is an advanced and enhanced version of an existing automation tool designed to assist researchers in conducting LRs through the implementation of cutting-edge technologies such as natural language processing and machine learning techniques. In this paper, we present a comprehensive explanation of LiteRev's capabilities, its methodology, and an evaluation of its accuracy and efficiency to a manual LR, highlighting the benefits of using LiteRev.

METHODS

Based on the user's query, LiteRev performs an automated search on a wide range of open-access databases and retrieves relevant metadata on the resulting papers, including abstracts or full texts when available. These abstracts (or full texts) are text processed and represented as a term frequency-inverse document frequency matrix. Using dimensionality reduction (pairwise controlled manifold approximation) and clustering (hierarchical density-based spatial clustering of applications with noise) techniques, the corpus is divided into different topics described by a list of the most important keywords. The user can then select one or several topics of interest, enter additional keywords to refine its search, or provide key papers to the research question. Based on these inputs, LiteRev performs a k-nearest neighbor (k-NN) search and suggests a list of potentially interesting papers. By tagging the relevant ones, the user triggers new k-NN searches until no additional paper is suggested for screening. To assess the performance of LiteRev, we ran it in parallel to a manual LR on the burden and care for acute and early HIV infection in sub-Saharan Africa. We assessed the performance of LiteRev using true and false predictive values, recall, and work saved over sampling.

RESULTS

LiteRev extracted, processed, and transformed text into a term frequency-inverse document frequency matrix of 631 unique papers from PubMed. The topic modeling module identified 16 topics and highlighted 2 topics of interest to the research question. Based on 18 key papers, the k-NNs module suggested 193 papers for screening out of 613 papers in total (31.5% of the whole corpus) and correctly identified 64 relevant papers out of the 87 papers found by the manual abstract screening (recall rate of 73.6%). Compared to the manual full text screening, LiteRev identified 42 relevant papers out of the 48 papers found manually (recall rate of 87.5%). This represents a total work saved over sampling of 56%.

CONCLUSIONS

We presented the features and functionalities of LiteRev, an automation tool that uses natural language processing and machine learning methods to streamline and accelerate LRs and support researchers in getting quick and in-depth overviews on any topic of interest.

摘要

背景

文献综述(LRs)通过识别、评估和综合与特定研究问题相关的论文,来促进理解并支持决策。然而,LRs,特别是传统的系统综述,速度慢、资源密集且很快就会过时。

目的

LiteRev 是一款现有自动化工具的高级增强版本,旨在通过实施自然语言处理和机器学习技术等前沿技术,帮助研究人员进行 LRs。在本文中,我们全面介绍了 LiteRev 的功能、方法以及与手动 LR 相比的准确性和效率评估,突出了使用 LiteRev 的优势。

方法

根据用户的查询,LiteRev 在广泛的开放获取数据库上执行自动搜索,并检索有关论文的相关元数据,包括摘要或全文(如果可用)。这些摘要(或全文)经过文本处理并表示为词频-逆文档频率矩阵。使用降维(成对控制流形逼近)和聚类(基于密度的层次空间聚类应用噪声)技术,语料库被分为不同的主题,每个主题由一系列最重要的关键词描述。然后,用户可以选择一个或多个感兴趣的主题,输入其他关键词来细化搜索,或提供关键论文来回答研究问题。基于这些输入,LiteRev 执行 k-最近邻(k-NN)搜索,并建议一系列可能感兴趣的论文。通过标记相关论文,用户触发新的 k-NN 搜索,直到没有进一步的论文可供筛选。为了评估 LiteRev 的性能,我们在撒哈拉以南非洲急性和早期 HIV 感染的负担和护理方面,与手动 LR 并行运行。我们使用真阳性和假阳性预测值、召回率和节省的工作来评估 LiteRev 的性能。

结果

LiteRev 从 PubMed 中提取、处理和转换了 631 篇独特论文的文本,生成了词频-逆文档频率矩阵。主题建模模块识别出 16 个主题,并突出了 2 个与研究问题相关的主题。基于 18 篇关键论文,k-NN 模块建议筛选 193 篇论文,而总共(整个语料库的 31.5%)有 613 篇论文,正确识别出手动摘要筛选中找到的 87 篇相关论文中的 64 篇(召回率为 73.6%)。与手动全文筛选相比,LiteRev 从手动筛选中找到的 48 篇论文中识别出 42 篇相关论文(召回率为 87.5%)。这代表着采样节省了 56%的工作。

结论

我们介绍了 LiteRev 的功能和特点,这是一款使用自然语言处理和机器学习方法的自动化工具,可以简化和加速 LRs,并帮助研究人员快速深入地了解任何感兴趣的主题。

相似文献

1
An Automated Literature Review Tool (LiteRev) for Streamlining and Accelerating Research Using Natural Language Processing and Machine Learning: Descriptive Performance Evaluation Study.一种使用自然语言处理和机器学习简化和加速研究的自动化文献综述工具(LiteRev):描述性性能评估研究。
J Med Internet Res. 2023 Sep 15;25:e39736. doi: 10.2196/39736.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Machine Learning-Based Approach for Identifying Research Gaps: COVID-19 as a Case Study.基于机器学习的研究空白识别方法:以COVID-19为例
JMIR Form Res. 2024 Mar 5;8:e49411. doi: 10.2196/49411.
4
The Effectiveness of Integrated Care Pathways for Adults and Children in Health Care Settings: A Systematic Review.综合护理路径在医疗环境中对成人和儿童的有效性:一项系统评价。
JBI Libr Syst Rev. 2009;7(3):80-129. doi: 10.11124/01938924-200907030-00001.
5
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
6
Text mining to support abstract screening for knowledge syntheses: a semi-automated workflow.文本挖掘支持知识综合的摘要筛选:一种半自动化工作流程。
Syst Rev. 2021 May 26;10(1):156. doi: 10.1186/s13643-021-01700-x.
7
Evaluation of a semi-automated data extraction tool for public health literature-based reviews: Dextr.评估一种用于公共卫生文献综述的半自动数据提取工具:Dextr。
Environ Int. 2022 Jan 15;159:107025. doi: 10.1016/j.envint.2021.107025. Epub 2021 Dec 14.
8
SWIFT-Review: a text-mining workbench for systematic review.SWIFT-Review:一个用于系统评价的文本挖掘工作台。
Syst Rev. 2016 May 23;5:87. doi: 10.1186/s13643-016-0263-z.
9
Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.生物医学文献系统评价自动化:PubMed 索引研究的范围综述。
Syst Rev. 2024 Jul 8;13(1):174. doi: 10.1186/s13643-024-02592-3.
10
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT:一种用于从医学叙述中映射短语概念的机器学习系统。
J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.

引用本文的文献

1
Year 2023 in Biomedical Natural Language Processing: a Tribute to Large Language Models and Generative AI.2023年生物医学自然语言处理领域:向大语言模型和生成式人工智能致敬。
Yearb Med Inform. 2024 Aug;33(1):241-248. doi: 10.1055/s-0044-1800751. Epub 2025 Apr 8.
2
Predicting blood-brain barrier permeability of molecules with a large language model and machine learning.利用大语言模型和机器学习预测分子的血脑屏障通透性。
Sci Rep. 2024 Jul 9;14(1):15844. doi: 10.1038/s41598-024-66897-y.

本文引用的文献

1
The Systematic Review Toolbox: keeping up to date with tools to support evidence synthesis.系统评价工具包:保持对支持证据综合工具的更新。
Syst Rev. 2022 Dec 1;11(1):258. doi: 10.1186/s13643-022-02122-z.
2
The Impact of Systematic Review Automation Tools on Methodological Quality and Time Taken to Complete Systematic Review Tasks: Case Study.系统评价自动化工具对方法学质量及完成系统评价任务所需时间的影响:案例研究
JMIR Med Educ. 2021 May 31;7(2):e24418. doi: 10.2196/24418.
3
Social, Behavioral, and Cultural factors of HIV in Malawi: Semi-Automated Systematic Review.
马拉维艾滋病毒的社会、行为和文化因素:半自动系统评价。
J Med Internet Res. 2020 Aug 14;22(8):e18747. doi: 10.2196/18747.
4
A full systematic review was completed in 2 weeks using automation tools: a case study.在两周内使用自动化工具完成了全面的系统回顾:案例研究。
J Clin Epidemiol. 2020 May;121:81-90. doi: 10.1016/j.jclinepi.2020.01.008. Epub 2020 Jan 28.
5
Software tools to support title and abstract screening for systematic reviews in healthcare: an evaluation.支持医疗保健系统评价标题和摘要筛选的软件工具:评价。
BMC Med Res Methodol. 2020 Jan 13;20(1):7. doi: 10.1186/s12874-020-0897-3.
6
Treatment as Prevention: Concepts and Challenges for Reducing HIV Incidence.治疗即预防:降低艾滋病毒感染率的概念与挑战
J Acquir Immune Defic Syndr. 2019 Dec 1;82 Suppl 2(2):S104-S112. doi: 10.1097/QAI.0000000000002168.
7
Meeting the review family: exploring review types and associated information retrieval requirements.满足审稿人需求:探索审稿类型及相关信息检索要求。
Health Info Libr J. 2019 Sep;36(3):202-222. doi: 10.1111/hir.12276.
8
The disconnect between individual-level and population-level HIV prevention benefits of antiretroviral treatment.抗逆转录病毒治疗在个体和人群层面预防 HIV 效果之间的脱节。
Lancet HIV. 2019 Sep;6(9):e632-e638. doi: 10.1016/S2352-3018(19)30226-7. Epub 2019 Jul 19.
9
Making progress with the automation of systematic reviews: principles of the International Collaboration for the Automation of Systematic Reviews (ICASR).在系统评价自动化方面取得进展:国际系统评价自动化合作(ICASR)的原则。
Syst Rev. 2018 May 19;7(1):77. doi: 10.1186/s13643-018-0740-7.
10
Acute HIV infection detection and immediate treatment estimated to reduce transmission by 89% among men who have sex with men in Bangkok.据估计,在曼谷男男性行为者中,急性HIV感染检测及立即治疗可使传播率降低89%。
J Int AIDS Soc. 2017 Jun 28;20(1):21708. doi: 10.7448/IAS.20.1.21708.