• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加速观察性研究中的证据综合:开发一个基于自然语言处理的智能化实时文献综述系统。

Accelerating Evidence Synthesis in Observational Studies: Development of a Living Natural Language Processing-Assisted Intelligent Systematic Literature Review System.

机构信息

IMO Health, 9600 W Bryn Mawr Ave # 100, Rosemont, IL, 60018, United States.

Merck & Co, Inc, 126 East Lincoln Ave, Rahway, NJ, United States, 1 619-643-2693.

出版信息

JMIR Med Inform. 2024 Oct 23;12:e54653. doi: 10.2196/54653.

DOI:10.2196/54653
PMID:39441204
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11523763/
Abstract

BACKGROUND

Systematic literature review (SLR), a robust method to identify and summarize evidence from published sources, is considered to be a complex, time-consuming, labor-intensive, and expensive task.

OBJECTIVE

This study aimed to present a solution based on natural language processing (NLP) that accelerates and streamlines the SLR process for observational studies using real-world data.

METHODS

We followed an agile software development and iterative software engineering methodology to build a customized intelligent end-to-end living NLP-assisted solution for observational SLR tasks. Multiple machine learning-based NLP algorithms were adopted to automate article screening and data element extraction processes. The NLP prediction results can be further reviewed and verified by domain experts, following the human-in-the-loop design. The system integrates explainable articificial intelligence to provide evidence for NLP algorithms and add transparency to extracted literature data elements. The system was developed based on 3 existing SLR projects of observational studies, including the epidemiology studies of human papillomavirus-associated diseases, the disease burden of pneumococcal diseases, and cost-effectiveness studies on pneumococcal vaccines.

RESULTS

Our Intelligent SLR Platform covers major SLR steps, including study protocol setting, literature retrieval, abstract screening, full-text screening, data element extraction from full-text articles, results summary, and data visualization. The NLP algorithms achieved accuracy scores of 0.86-0.90 on article screening tasks (framed as text classification tasks) and macroaverage F1 scores of 0.57-0.89 on data element extraction tasks (framed as named entity recognition tasks).

CONCLUSIONS

Cutting-edge NLP algorithms expedite SLR for observational studies, thus allowing scientists to have more time to focus on the quality of data and the synthesis of evidence in observational studies. Aligning the living SLR concept, the system has the potential to update literature data and enable scientists to easily stay current with the literature related to observational studies prospectively and continuously.

摘要

背景

系统文献综述(SLR)是一种从已发表的文献中识别和总结证据的强大方法,被认为是一项复杂、耗时、劳动密集且昂贵的任务。

目的

本研究旨在提出一种基于自然语言处理(NLP)的解决方案,以加速和简化使用真实世界数据的观察性研究的 SLR 过程。

方法

我们遵循敏捷软件开发和迭代软件工程方法,为观察性 SLR 任务构建了一个定制的智能端到端自然语言处理辅助解决方案。采用了多种基于机器学习的 NLP 算法来自动化文章筛选和数据元素提取过程。NLP 预测结果可以由领域专家进一步审查和验证,遵循人机交互设计。该系统集成了可解释的人工智能,为 NLP 算法提供证据,并为提取的文献数据元素增加透明度。该系统是基于 3 个现有的观察性研究 SLR 项目开发的,包括人乳头瘤病毒相关疾病的流行病学研究、肺炎球菌疾病的疾病负担以及肺炎球菌疫苗的成本效益研究。

结果

我们的智能 SLR 平台涵盖了主要的 SLR 步骤,包括研究方案设定、文献检索、摘要筛选、全文筛选、从全文文章中提取数据元素、结果总结和数据可视化。NLP 算法在文章筛选任务(框定为文本分类任务)中的准确率达到 0.86-0.90,在数据元素提取任务(框定为命名实体识别任务)中的宏平均 F1 得分达到 0.57-0.89。

结论

最先进的 NLP 算法加速了观察性研究的 SLR,从而使科学家有更多的时间专注于观察性研究中数据的质量和证据的综合。与现有的 SLR 概念保持一致,该系统具有更新文献数据的潜力,使科学家能够轻松地前瞻性和持续地跟踪与观察性研究相关的文献。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/c64119da6917/medinform-v12-e54653-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/6f67028e7b40/medinform-v12-e54653-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/5623baf76df7/medinform-v12-e54653-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/554fe897759d/medinform-v12-e54653-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/e73b8a5f2d8e/medinform-v12-e54653-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/d0988fc3878c/medinform-v12-e54653-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/c64119da6917/medinform-v12-e54653-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/6f67028e7b40/medinform-v12-e54653-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/5623baf76df7/medinform-v12-e54653-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/554fe897759d/medinform-v12-e54653-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/e73b8a5f2d8e/medinform-v12-e54653-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/d0988fc3878c/medinform-v12-e54653-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd2/11523763/c64119da6917/medinform-v12-e54653-g006.jpg

相似文献

1
Accelerating Evidence Synthesis in Observational Studies: Development of a Living Natural Language Processing-Assisted Intelligent Systematic Literature Review System.加速观察性研究中的证据综合:开发一个基于自然语言处理的智能化实时文献综述系统。
JMIR Med Inform. 2024 Oct 23;12:e54653. doi: 10.2196/54653.
2
Machine Learning and Natural Language Processing in Mental Health: Systematic Review.机器学习和自然语言处理在心理健康中的应用:系统综述。
J Med Internet Res. 2021 May 4;23(5):e15708. doi: 10.2196/15708.
3
Can large language models replace humans in systematic reviews? Evaluating GPT-4's efficacy in screening and extracting data from peer-reviewed and grey literature in multiple languages.大型语言模型能否在系统评价中取代人类?评估 GPT-4 从多种语言的同行评议文献和灰色文献中进行筛选和提取数据的效果。
Res Synth Methods. 2024 Jul;15(4):616-626. doi: 10.1002/jrsm.1715. Epub 2024 Mar 14.
4
Machine learning models for abstract screening task - A systematic literature review application for health economics and outcome research.用于摘要筛选任务的机器学习模型 - 健康经济学和结果研究的系统文献综述应用。
BMC Med Res Methodol. 2024 May 9;24(1):108. doi: 10.1186/s12874-024-02224-3.
5
Ascle-A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study.Ascle-A 是一个用于医疗文本生成的 Python 自然语言处理工具包:开发和评估研究。
J Med Internet Res. 2024 Oct 3;26:e60601. doi: 10.2196/60601.
6
Evaluation of a prototype machine learning tool to semi-automate data extraction for systematic literature reviews.评价一个机器学习原型工具,以半自动提取系统文献综述的数据。
Syst Rev. 2023 Oct 6;12(1):187. doi: 10.1186/s13643-023-02351-w.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review.临床命名实体识别和关系抽取技术在医学自然语言处理中的应用:系统综述。
Int J Med Inform. 2023 Sep;177:105122. doi: 10.1016/j.ijmedinf.2023.105122. Epub 2023 Jun 5.
9
Natural language processing was effective in assisting rapid title and abstract screening when updating systematic reviews.自然语言处理在更新系统评价时,有助于快速进行标题和摘要筛选。
J Clin Epidemiol. 2021 May;133:121-129. doi: 10.1016/j.jclinepi.2021.01.010. Epub 2021 Jan 21.
10
Methods for using Bing's AI-powered search engine for data extraction for a systematic review.使用必应的人工智能搜索引擎进行数据提取以进行系统评价的方法。
Res Synth Methods. 2024 Mar;15(2):347-353. doi: 10.1002/jrsm.1689. Epub 2023 Dec 8.

引用本文的文献

1
High-precision information retrieval for rapid clinical guideline updates.用于快速更新临床指南的高精度信息检索。
NPJ Digit Med. 2025 Apr 27;8(1):227. doi: 10.1038/s41746-025-01648-5.
2
Artificial intelligence in orthopedic research assistance: a resident's perspective.骨科研究辅助中的人工智能:住院医师视角
Musculoskelet Surg. 2025 Feb 17. doi: 10.1007/s12306-025-00894-w.

本文引用的文献

1
LitMC-BERT: Transformer-Based Multi-Label Classification of Biomedical Literature With An Application on COVID-19 Literature Curation.LitMC-BERT:基于 Transformer 的生物医学文献多标签分类及其在 COVID-19 文献整理中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2584-2595. doi: 10.1109/TCBB.2022.3173562. Epub 2022 Oct 10.
2
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
3
The significant cost of systematic reviews and meta-analyses: A call for greater involvement of machine learning to assess the promise of clinical trials.
系统评价和荟萃分析的高昂成本:呼吁机器学习更多地参与评估临床试验的前景。
Contemp Clin Trials Commun. 2019 Aug 25;16:100443. doi: 10.1016/j.conctc.2019.100443. eCollection 2019 Dec.
4
A Deep Learning Method to Automatically Identify Reports of Scientifically Rigorous Clinical Research from the Biomedical Literature: Comparative Analytic Study.一种从生物医学文献中自动识别科学严谨的临床研究报告的深度学习方法:比较分析研究。
J Med Internet Res. 2018 Jun 25;20(6):e10281. doi: 10.2196/10281.
5
What kind of systematic review should I conduct? A proposed typology and guidance for systematic reviewers in the medical and health sciences.我应该进行什么样的系统评价?医学和健康科学系统评价者的建议分类法和指南。
BMC Med Res Methodol. 2018 Jan 10;18(1):5. doi: 10.1186/s12874-017-0468-4.
6
Systematic review automation technologies.系统评价自动化技术。
Syst Rev. 2014 Jul 9;3:74. doi: 10.1186/2046-4053-3-74.
7
Living systematic reviews: an emerging opportunity to narrow the evidence-practice gap.实时系统评价:缩小证据-实践差距的新契机。
PLoS Med. 2014 Feb 18;11(2):e1001603. doi: 10.1371/journal.pmed.1001603. eCollection 2014 Feb.
8
What is a support vector machine?什么是支持向量机?
Nat Biotechnol. 2006 Dec;24(12):1565-7. doi: 10.1038/nbt1206-1565.