随机对照试验文章的自动置信度分级分类：循证医学的辅助手段

Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine.

作者信息

Cohen Aaron M, Smalheiser Neil R, McDonagh Marian S, Yu Clement, Adams Clive E, Davis John M, Yu Philip S

机构信息

Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR 97239 USA

Department of Psychiatry, University of Illinois at Chicago, Chicago, IL 60612 USA.

出版信息

J Am Med Inform Assoc. 2015 May;22(3):707-17. doi: 10.1093/jamia/ocu025. Epub 2015 Feb 5.

DOI:10.1093/jamia/ocu025

PMID:25656516

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4457112/

Abstract

OBJECTIVE

For many literature review tasks, including systematic review (SR) and other aspects of evidence-based medicine, it is important to know whether an article describes a randomized controlled trial (RCT). Current manual annotation is not complete or flexible enough for the SR process. In this work, highly accurate machine learning predictive models were built that include confidence predictions of whether an article is an RCT.

MATERIALS AND METHODS

The LibSVM classifier was used with forward selection of potential feature sets on a large human-related subset of MEDLINE to create a classification model requiring only the citation, abstract, and MeSH terms for each article.

RESULTS

The model achieved an area under the receiver operating characteristic curve of 0.973 and mean squared error of 0.013 on the held out year 2011 data. Accurate confidence estimates were confirmed on a manually reviewed set of test articles. A second model not requiring MeSH terms was also created, and performs almost as well.

DISCUSSION

Both models accurately rank and predict article RCT confidence. Using the model and the manually reviewed samples, it is estimated that about 8000 (3%) additional RCTs can be identified in MEDLINE, and that 5% of articles tagged as RCTs in Medline may not be identified.

CONCLUSION

Retagging human-related studies with a continuously valued RCT confidence is potentially more useful for article ranking and review than a simple yes/no prediction. The automated RCT tagging tool should offer significant savings of time and effort during the process of writing SRs, and is a key component of a multistep text mining pipeline that we are building to streamline SR workflow. In addition, the model may be useful for identifying errors in MEDLINE publication types. The RCT confidence predictions described here have been made available to users as a web service with a user query form front end at: http://arrowsmith.psych.uic.edu/cgi-bin/arrowsmith_uic/RCT_Tagger.cgi.

摘要

目的

对于许多文献综述任务，包括系统综述（SR）以及循证医学的其他方面，了解一篇文章是否描述了随机对照试验（RCT）很重要。当前的手动标注对于SR过程而言不够完整或灵活。在这项研究中，构建了高度准确的机器学习预测模型，该模型包括关于一篇文章是否为RCT的置信度预测。

材料与方法

使用LibSVM分类器，并在MEDLINE中一个与人类相关的大型子集中对潜在特征集进行前向选择，以创建一个仅需每篇文章的引用、摘要和医学主题词（MeSH）的分类模型。

结果

该模型在2011年留出的数据上实现了受试者操作特征曲线下面积为0.973，均方误差为0.013。在一组经人工审核的测试文章上证实了准确的置信度估计。还创建了一个不需要MeSH词的第二个模型，其表现几乎同样出色。

讨论

两个模型都能准确地对文章的RCT置信度进行排名和预测。使用该模型和人工审核的样本估计，在MEDLINE中可额外识别出约8000篇（3%）RCT，并且Medline中标记为RCT的文章可能有5%未被识别。

结论

用连续值的RCT置信度对与人类相关的研究进行重新标注，对于文章排名和综述而言可能比简单的是/否预测更有用。自动化的RCT标注工具在撰写SR的过程中应能显著节省时间和精力，并且是我们正在构建的用于简化SR工作流程的多步骤文本挖掘管道的关键组成部分。此外，该模型可能有助于识别MEDLINE出版物类型中的错误。这里描述的RCT置信度预测已作为一项网络服务提供给用户，其前端有用户查询表单，网址为：http://arrowsmith.psych.uic.edu/cgi-bin/arrowsmith_uic/RCT_Tagger.cgi。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff4a/4457112/6181a8888b39/ocu025f1p.jpg

相似文献

Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine.

J Am Med Inform Assoc. 2015 May;22(3):707-17. doi: 10.1093/jamia/ocu025. Epub 2015 Feb 5.

A quantitative model for linking two disparate sets of articles in MEDLINE.

Bioinformatics. 2007 Jul 1;23(13):1658-65. doi: 10.1093/bioinformatics/btm161. Epub 2007 Apr 26.

A probabilistic automated tagger to identify human-related publications.

Database (Oxford). 2018 Jan 1;2018:1-8. doi: 10.1093/database/bay079.

Identifying reports of randomized controlled trials (RCTs) via a hybrid machine learning and crowdsourcing approach.

J Am Med Inform Assoc. 2017 Nov 1;24(6):1165-1168. doi: 10.1093/jamia/ocx053.

Machine learning for identifying Randomized Controlled Trials: An evaluation and practitioner's guide.

Res Synth Methods. 2018 Dec;9(4):602-614. doi: 10.1002/jrsm.1287. Epub 2018 Feb 7.

Evaluation of publication type tagging as a strategy to screen randomized controlled trial articles in preparing systematic reviews.

JAMIA Open. 2022 Mar 30;5(1):ooac015. doi: 10.1093/jamiaopen/ooac015. eCollection 2022 Apr.

Wnt pathway curation using automated natural language processing: combining statistical methods with partial and full parse for knowledge extraction.

Bioinformatics. 2005 Apr 15;21(8):1653-8. doi: 10.1093/bioinformatics/bti165. Epub 2004 Nov 25.

Automated information extraction of key trial design elements from clinical trial publications.

AMIA Annu Symp Proc. 2008 Nov 6;2008:141-5.

Ranking the whole MEDLINE database according to a large training set using text indexing.

BMC Bioinformatics. 2005 Mar 24;6:75. doi: 10.1186/1471-2105-6-75.

Extracting drug-drug interaction articles from MEDLINE to improve the content of drug databases.

AMIA Annu Symp Proc. 2005;2005:216-20.

引用本文的文献

Enhancing Automatic PT Tagging for MEDLINE Citations Using Transformer-Based Models.

ArXiv. 2025 Jun 3:arXiv:2506.03321v1.

Publication Type Tagging using Transformer Models and Multi-Label Classification.

AMIA Annu Symp Proc. 2025 May 22;2024:818-827. eCollection 2024.

Enhancing automated indexing of publication types and study designs in biomedical literature using full-text features.

medRxiv. 2025 Apr 28:2025.04.23.25326300. doi: 10.1101/2025.04.23.25326300.

Issues regarding the Indexing of Adaptive Clinical Trial Articles.

medRxiv. 2025 Mar 11:2025.03.10.25323694. doi: 10.1101/2025.03.10.25323694.

Publication Type Tagging using Transformer Models and Multi-Label Classification.

medRxiv. 2025 Mar 7:2025.03.06.25323516. doi: 10.1101/2025.03.06.25323516.

PLoS One. 2024 Nov 18;19(11):e0313991. doi: 10.1371/journal.pone.0313991. eCollection 2024.

Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.

Syst Rev. 2024 Jul 8;13(1):174. doi: 10.1186/s13643-024-02592-3.

How to optimize the systematic review process using AI tools.

JCPP Adv. 2024 Apr 23;4(2):e12234. doi: 10.1002/jcv2.12234. eCollection 2024 Jun.

Insights into the nutritional prevention of macular degeneration based on a comparative topic modeling approach.

PeerJ Comput Sci. 2024 Mar 20;10:e1940. doi: 10.7717/peerj-cs.1940. eCollection 2024.

Bat4RCT: A suite of benchmark data and baseline methods for text classification of randomized controlled trials.

PLoS One. 2023 Mar 24;18(3):e0283342. doi: 10.1371/journal.pone.0283342. eCollection 2023.

本文引用的文献

Design and implementation of Metta, a metasearch engine for biomedical literature retrieval intended for systematic reviewers.

Health Inf Sci Syst. 2014 Jan 10;2:1. doi: 10.1186/2047-2501-2-1. eCollection 2014.

Aggregator: a machine learning approach to identifying MEDLINE articles that derive from the same underlying clinical trial.

Methods. 2015 Mar;74:65-70. doi: 10.1016/j.ymeth.2014.11.006. Epub 2014 Nov 20.

A large-scale analysis of the reasons given for excluding articles that are retrieved by literature search during systematic review.

AMIA Annu Symp Proc. 2013 Nov 16;2013:379-87. eCollection 2013.

Feature engineering and a proposed decision-support system for systematic reviewers of medical evidence.

PLoS One. 2014 Jan 27;9(1):e86277. doi: 10.1371/journal.pone.0086277. eCollection 2014.

Rule-based deduplication of article records from bibliographic databases.

Database (Oxford). 2014 Jan 16;2014:bat086. doi: 10.1093/database/bat086. Print 2014.

Early versus delayed laparoscopic cholecystectomy for people with acute cholecystitis.

Cochrane Database Syst Rev. 2013 Jun 30(6):CD005440. doi: 10.1002/14651858.CD005440.pub3.

What is a rapid review? A methodological exploration of rapid reviews in Health Technology Assessments.

Int J Evid Based Healthc. 2012 Dec;10(4):397-410. doi: 10.1111/j.1744-1609.2012.00290.x.

MEDLINE clinical queries are robust when searching in recent publishing years.

J Am Med Inform Assoc. 2013 Mar-Apr;20(2):363-8. doi: 10.1136/amiajnl-2012-001075. Epub 2012 Sep 27.

Methods for the drug effectiveness review project.

BMC Med Res Methodol. 2012 Sep 12;12:140. doi: 10.1186/1471-2288-12-140.

Beyond PICO: the SPIDER tool for qualitative evidence synthesis.

Qual Health Res. 2012 Oct;22(10):1435-43. doi: 10.1177/1049732312452938. Epub 2012 Jul 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

随机对照试验文章的自动置信度分级分类：循证医学的辅助手段

Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料与方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献