从临床试验出版物中自动提取关键试验设计要素的信息。

Automated information extraction of key trial design elements from clinical trial publications.

作者信息

de Bruijn Berry, Carini Simona, Kiritchenko Svetlana, Martin Joel, Sim Ida

机构信息

Institute for Information Technology, National Research Council, Ottawa, Ontario, Canada.

出版信息

AMIA Annu Symp Proc. 2008 Nov 6;2008:141-5.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2655966/

Abstract

Clinical trials are one of the most valuable sources of scientific evidence for improving the practice of medicine. The Trial Bank project aims to improve structured access to trial findings by including formalized trial information into a knowledge base. Manually extracting trial information from published articles is costly, but automated information extraction techniques can assist. The current study highlights a single architecture to extract a wide array of information elements from full-text publications of randomized clinical trials (RCTs). This architecture combines a text classifier with a weak regular expression matcher. We tested this two-stage architecture on 88 RCT reports from 5 leading medical journals, extracting 23 elements of key trial information such as eligibility rules, sample size, intervention, and outcome names. Results prove this to be a promising avenue to help critical appraisers, systematic reviewers, and curators quickly identify key information elements in published RCT articles.

摘要

临床试验是改善医学实践的最有价值的科学证据来源之一。试验库项目旨在通过将正式的试验信息纳入知识库来改善对试验结果的结构化访问。从已发表的文章中手动提取试验信息成本高昂，但自动化信息提取技术可以提供帮助。当前的研究突出了一种单一架构，用于从随机临床试验（RCT）的全文出版物中提取大量信息元素。这种架构将文本分类器与弱正则表达式匹配器相结合。我们在来自5种领先医学期刊的88份RCT报告上测试了这种两阶段架构，提取了23个关键试验信息元素，如入选规则、样本量、干预措施和结果名称。结果证明，这是一条有前景的途径，可帮助关键评估者、系统评价者和管理者快速识别已发表RCT文章中的关键信息元素。

相似文献

1

Automated information extraction of key trial design elements from clinical trial publications.

AMIA Annu Symp Proc. 2008 Nov 6;2008:141-5.

2

Unsupervised method for automatic construction of a disease dictionary from a large free text collection.

AMIA Annu Symp Proc. 2008 Nov 6;2008:820-4.

3

An automatic method for retrieving and indexing catalogues of biomedical courses.

AMIA Annu Symp Proc. 2008 Nov 6:922.

4

Evaluating relevance ranking strategies for MEDLINE retrieval.

AMIA Annu Symp Proc. 2008 Nov 6;2008:439.

5

Using semantic predications to characterize the clinical cardiovascular literature.

AMIA Annu Symp Proc. 2008 Nov 6:887.

6

Reflecting all query aspects on query expansion.

AMIA Annu Symp Proc. 2008 Nov 6:1189.

7

Finding the evidence for protein-protein interactions from PubMed abstracts.

Bioinformatics. 2006 Jul 15;22(14):e220-6. doi: 10.1093/bioinformatics/btl203.

8

Extracting structured medication event information from discharge summaries.

AMIA Annu Symp Proc. 2008 Nov 6;2008:237-41.

9

Status of text-mining techniques applied to biomedical text.

Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011.

10

Exploring MEDLINE space with random indexing and pathfinder networks.

AMIA Annu Symp Proc. 2008 Nov 6;2008:126-30.

引用本文的文献

1

An analysis of published study designs in PubMed prisoner health abstracts from 1963 to 2023: a text mining study.

BMC Med Res Methodol. 2024 Mar 17;24(1):68. doi: 10.1186/s12874-024-02186-6.

2

A comparison of machine learning methods to find clinical trials for inclusion in new systematic reviews from their PROSPERO registrations prior to searching and screening.

Res Synth Methods. 2024 Jan;15(1):73-85. doi: 10.1002/jrsm.1672. Epub 2023 Sep 25.

3

Automatic Extraction of Research Themes in Epidemiological Criminology From PubMed Abstracts From 1946 to 2020: Text Mining Study.

JMIR Form Res. 2023 Sep 22;7:e49721. doi: 10.2196/49721.

4

Methodological information extraction from randomized controlled trial publications: a pilot study.

AMIA Annu Symp Proc. 2023 Apr 29;2022:542-551. eCollection 2022.

5

Automating Quality Assessment of Medical Evidence in Systematic Reviews: Model Development and Validation Study.

J Med Internet Res. 2023 Mar 13;25:e35568. doi: 10.2196/35568.

6

The automation of relevant trial registration screening for systematic review updates: an evaluation study on a large dataset of ClinicalTrials.gov registrations.

BMC Med Res Methodol. 2021 Dec 18;21(1):281. doi: 10.1186/s12874-021-01485-6.

7

Text mining approaches for dealing with the rapidly expanding literature on COVID-19.

Brief Bioinform. 2021 Mar 22;22(2):781-799. doi: 10.1093/bib/bbaa296.

8

Is it time for computable evidence synthesis?

J Am Med Inform Assoc. 2020 Jun 1;27(6):972-975. doi: 10.1093/jamia/ocaa035.

9

Improving reference prioritisation with PICO recognition.

BMC Med Inform Decis Mak. 2019 Dec 5;19(1):256. doi: 10.1186/s12911-019-0992-8.

10

Extractive text summarization system to aid data extraction from full text in systematic review development.

J Biomed Inform. 2016 Dec;64:265-272. doi: 10.1016/j.jbi.2016.10.014. Epub 2016 Oct 27.

本文引用的文献

1

Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study.

J Am Med Inform Assoc. 2008 Jan-Feb;15(1):87-98. doi: 10.1197/jamia.M2401. Epub 2007 Oct 18.

2

Extracting subject demographic information from abstracts of randomized clinical trial reports.

Stud Health Technol Inform. 2007;129(Pt 1):550-4.

3

Combining text classification and Hidden Markov Modeling techniques for categorizing sentences in randomized clinical trial abstracts.

AMIA Annu Symp Proc. 2006;2006:824-8.

4

Shallow semantic parsing of randomized controlled trial reports.

AMIA Annu Symp Proc. 2006;2006:604-8.

5

Generating executable knowledge for evidence-based medicine using natural language and semantic processing.

AMIA Annu Symp Proc. 2006;2006:56-60.

6

Status of text-mining techniques applied to biomedical text.

Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011.

7

Trial bank publishing: phase I results.

Stud Health Technol Inform. 2004;107(Pt 2):1476-80.

8

An ontology of randomized controlled trials for evidence-based practice: content specification and evaluation using the competency decomposition method.

J Biomed Inform. 2004 Apr;37(2):108-19. doi: 10.1016/j.jbi.2004.03.001.

9

Categorization of sentence types in medical abstracts.

AMIA Annu Symp Proc. 2003;2003:440-4.

10

Clinical decision support systems for the practice of evidence-based medicine.

J Am Med Inform Assoc. 2001 Nov-Dec;8(6):527-34. doi: 10.1136/jamia.2001.0080527.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。