• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床试验结果的自动化制表:基于转换器的语言表示的联合实体和关系提取方法。

Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations.

机构信息

Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, UK.

出版信息

Artif Intell Med. 2023 Oct;144:102661. doi: 10.1016/j.artmed.2023.102661. Epub 2023 Sep 7.

DOI:10.1016/j.artmed.2023.102661
PMID:37783549
Abstract

Evidence-based medicine, the practice in which healthcare professionals refer to the best available evidence when making decisions, forms the foundation of modern healthcare. However, it relies on labour-intensive systematic reviews, where domain specialists must aggregate and extract information from thousands of publications, primarily of randomised controlled trial (RCT) results, into evidence tables. This paper investigates automating evidence table generation by decomposing the problem across two language processing tasks: named entity recognition, which identifies key entities within text, such as drug names, and relation extraction, which maps their relationships for separating them into ordered tuples. We focus on the automatic tabulation of sentences from published RCT abstracts that report the results of the study outcomes. Two deep neural net models were developed as part of a joint extraction pipeline, using the principles of transfer learning and transformer-based language representations. To train and test these models, a new gold-standard corpus was developed, comprising over 550 result sentences from six disease areas. This approach demonstrated significant advantages, with our system performing well across multiple natural language processing tasks and disease areas, as well as in generalising to disease domains unseen during training. Furthermore, we show these results were achievable through training our models on as few as 170 example sentences. The final system is a proof of concept that the generation of evidence tables can be semi-automated, representing a step towards fully automating systematic reviews.

摘要

循证医学是一种医疗实践,医生在做决策时会参考最佳现有证据。它是现代医疗保健的基础。然而,它依赖于劳动密集型的系统评价,领域专家必须从成千上万的出版物中(主要是随机对照试验 RCT 的结果)汇总和提取信息到证据表中。本文通过将问题分解为两个语言处理任务来研究自动生成证据表的问题:命名实体识别,它识别文本中的关键实体,如药物名称;关系提取,它将它们的关系映射出来,将它们分离成有序元组。我们专注于从报告研究结果的已发表 RCT 摘要中自动编制句子。两个深度神经网络模型是作为联合提取管道的一部分开发的,使用了迁移学习和基于转换器的语言表示的原理。为了训练和测试这些模型,开发了一个新的黄金标准语料库,其中包含来自六个疾病领域的 550 多个结果句子。这种方法表现出了显著的优势,我们的系统在多个自然语言处理任务和疾病领域表现良好,并且可以泛化到训练中未见过的疾病领域。此外,我们证明通过在 170 个示例句子上训练我们的模型就可以实现这些结果。最终系统是一个概念验证,表明证据表的生成可以半自动完成,这是朝着完全自动化系统评价迈出的一步。

相似文献

1
Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations.临床试验结果的自动化制表:基于转换器的语言表示的联合实体和关系提取方法。
Artif Intell Med. 2023 Oct;144:102661. doi: 10.1016/j.artmed.2023.102661. Epub 2023 Sep 7.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
PICO entity extraction for preclinical animal literature.针对临床前动物文献的 PICO 实体抽取。
Syst Rev. 2022 Sep 30;11(1):209. doi: 10.1186/s13643-022-02074-4.
4
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
5
BioBERT and Similar Approaches for Relation Extraction.BioBERT 及其在关系抽取中的应用。
Methods Mol Biol. 2022;2496:221-235. doi: 10.1007/978-1-0716-2305-3_12.
6
A Question-and-Answer System to Extract Data From Free-Text Oncological Pathology Reports (CancerBERT Network): Development Study.从自由文本肿瘤病理学报告(CancerBERT 网络)中提取数据的问答系统:开发研究。
J Med Internet Res. 2022 Mar 23;24(3):e27210. doi: 10.2196/27210.
7
Clinical concept extraction using transformers.使用转换器进行临床概念提取。
J Am Med Inform Assoc. 2020 Dec 9;27(12):1935-1942. doi: 10.1093/jamia/ocaa189.
8
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.
9
Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review.临床命名实体识别和关系抽取技术在医学自然语言处理中的应用:系统综述。
Int J Med Inform. 2023 Sep;177:105122. doi: 10.1016/j.ijmedinf.2023.105122. Epub 2023 Jun 5.
10
Extracting entities with attributes in clinical text via joint deep learning.通过联合深度学习从临床文本中提取具有属性的实体。
J Am Med Inform Assoc. 2019 Dec 1;26(12):1584-1591. doi: 10.1093/jamia/ocz158.

引用本文的文献

1
Evidence triangulator: using large language models to extract and synthesize causal evidence across study designs.证据三角测量器:利用大语言模型跨研究设计提取和综合因果证据。
Nat Commun. 2025 Aug 9;16(1):7355. doi: 10.1038/s41467-025-62783-x.
2
The emergence of large language models as tools in literature reviews: a large language model-assisted systematic review.大语言模型作为文献综述工具的出现:一项大语言模型辅助的系统综述
J Am Med Inform Assoc. 2025 Jun 1;32(6):1071-1086. doi: 10.1093/jamia/ocaf063.
3
Grammar-constrained decoding for structured information extraction with fine-tuned generative models applied to clinical trial abstracts.
用于结构化信息提取的语法约束解码,将微调生成模型应用于临床试验摘要。
Front Artif Intell. 2025 Jan 7;7:1406857. doi: 10.3389/frai.2024.1406857. eCollection 2024.
4
Enhancing the coverage of SemRep using a relation classification approach.利用关系分类方法增强 SemRep 的覆盖范围。
J Biomed Inform. 2024 Jul;155:104658. doi: 10.1016/j.jbi.2024.104658. Epub 2024 May 21.
5
Comparing generative and extractive approaches to information extraction from abstracts describing randomized clinical trials.比较从描述随机临床试验的摘要中提取信息的生成式方法和抽取式方法。
J Biomed Semantics. 2024 Apr 23;15(1):3. doi: 10.1186/s13326-024-00305-2.
6
Systematic comparison of Mendelian randomisation studies and randomised controlled trials using electronic databases.基于电子数据库的孟德尔随机化研究与随机对照试验的系统比较。
BMJ Open. 2023 Sep 26;13(9):e072087. doi: 10.1136/bmjopen-2023-072087.