• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向评估临床试验出版物报告的透明度。

Toward assessing clinical trial publications for reporting transparency.

作者信息

Kilicoglu Halil, Rosemblat Graciela, Hoang Linh, Wadhwa Sahil, Peng Zeshan, Malički Mario, Schneider Jodi, Ter Riet Gerben

机构信息

School of Information Sciences, University of Illinois at Urbana-Champaign, Champaign, IL, USA; U.S. National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

U.S. National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.

出版信息

J Biomed Inform. 2021 Apr;116:103717. doi: 10.1016/j.jbi.2021.103717. Epub 2021 Feb 26.

DOI:10.1016/j.jbi.2021.103717
PMID:33647518
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8112250/
Abstract

OBJECTIVE

To annotate a corpus of randomized controlled trial (RCT) publications with the checklist items of CONSORT reporting guidelines and using the corpus to develop text mining methods for RCT appraisal.

METHODS

We annotated a corpus of 50 RCT articles at the sentence level using 37 fine-grained CONSORT checklist items. A subset (31 articles) was double-annotated and adjudicated, while 19 were annotated by a single annotator and reconciled by another. We calculated inter-annotator agreement at the article and section level using MASI (Measuring Agreement on Set-Valued Items) and at the CONSORT item level using Krippendorff's α. We experimented with two rule-based methods (phrase-based and section header-based) and two supervised learning approaches (support vector machine and BioBERT-based neural network classifiers), for recognizing 17 methodology-related items in the RCT Methods sections.

RESULTS

We created CONSORT-TM consisting of 10,709 sentences, 4,845 (45%) of which were annotated with 5,246 labels. A median of 28 CONSORT items (out of possible 37) were annotated per article. Agreement was moderate at the article and section levels (average MASI: 0.60 and 0.64, respectively). Agreement varied considerably among individual checklist items (Krippendorff's α= 0.06-0.96). The model based on BioBERT performed best overall for recognizing methodology-related items (micro-precision: 0.82, micro-recall: 0.63, micro-F1: 0.71). Combining models using majority vote and label aggregation further improved precision and recall, respectively.

CONCLUSION

Our annotated corpus, CONSORT-TM, contains more fine-grained information than earlier RCT corpora. Low frequency of some CONSORT items made it difficult to train effective text mining models to recognize them. For the items commonly reported, CONSORT-TM can serve as a testbed for text mining methods that assess RCT transparency, rigor, and reliability, and support methods for peer review and authoring assistance. Minor modifications to the annotation scheme and a larger corpus could facilitate improved text mining models. CONSORT-TM is publicly available at https://github.com/kilicogluh/CONSORT-TM.

摘要

目的

用CONSORT报告指南的清单项目注释随机对照试验(RCT)出版物语料库,并使用该语料库开发用于RCT评估的文本挖掘方法。

方法

我们使用37个细粒度的CONSORT清单项目在句子层面注释了一个包含50篇RCT文章的语料库。一个子集(31篇文章)进行了双人注释和裁决,而19篇由一名注释者注释并由另一名注释者核对。我们使用MASI(集值项目一致性测量)在文章和章节层面以及使用Krippendorff's α在CONSORT项目层面计算注释者间一致性。我们试验了两种基于规则的方法(基于短语和基于章节标题)和两种监督学习方法(支持向量机和基于BioBERT的神经网络分类器),用于识别RCT方法部分中的17个与方法相关的项目。

结果

我们创建了CONSORT-TM,它由10,709个句子组成,其中4,845个(45%)被标注了5,246个标签。每篇文章标注的CONSORT项目中位数为28个(可能的37个项目中)。在文章和章节层面一致性为中等(平均MASI分别为0.60和0.64)。各个清单项目之间的一致性差异很大(Krippendorff's α = 0.06 - 0.96)。基于BioBERT的模型在识别与方法相关的项目方面总体表现最佳(微精度:0.82,微召回率:0.63,微F1值:0.71)。使用多数投票和标签聚合组合模型分别进一步提高了精度和召回率。

结论

我们注释的语料库CONSORT-TM比早期的RCT语料库包含更细粒度的信息。一些CONSORT项目的低频使得难以训练有效的文本挖掘模型来识别它们。对于常见报告的项目,CONSORT-TM可以作为评估RCT透明度、严谨性和可靠性的文本挖掘方法的测试平台,并支持同行评审和作者辅助方法。对注释方案进行小的修改并增加语料库规模可以促进改进文本挖掘模型。CONSORT-TM可在https://github.com/kilicogluh/CONSORT-TM上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3226/8112250/ed8ccf7f64cc/nihms-1681592-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3226/8112250/db739e518f31/nihms-1681592-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3226/8112250/ed8ccf7f64cc/nihms-1681592-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3226/8112250/db739e518f31/nihms-1681592-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3226/8112250/ed8ccf7f64cc/nihms-1681592-f0002.jpg

相似文献

1
Toward assessing clinical trial publications for reporting transparency.迈向评估临床试验出版物报告的透明度。
J Biomed Inform. 2021 Apr;116:103717. doi: 10.1016/j.jbi.2021.103717. Epub 2021 Feb 26.
2
Text classification models for assessing the completeness of randomized controlled trial publications based on CONSORT reporting guidelines.基于 CONSORT 报告规范的评估随机对照试验出版物完整性的文本分类模型。
Sci Rep. 2024 Sep 17;14(1):21721. doi: 10.1038/s41598-024-72130-7.
3
SPIRIT-CONSORT-TM: a corpus for assessing transparency of clinical trial protocol and results publications.SPIRIT-CONSORT-TM:一个用于评估临床试验方案和结果出版物透明度的语料库。
Sci Data. 2025 Feb 28;12(1):355. doi: 10.1038/s41597-025-04629-1.
4
SPIRIT-CONSORT-TM: a corpus for assessing transparency of clinical trial protocol and results publications.SPIRIT-CONSORT-TM:一个用于评估临床试验方案和结果出版物透明度的语料库。
medRxiv. 2025 Jan 15:2025.01.14.25320543. doi: 10.1101/2025.01.14.25320543.
5
CONSORT-TM: Text classification models for assessing the completeness of randomized controlled trial publications.CONSORT-TM:用于评估随机对照试验出版物完整性的文本分类模型。
medRxiv. 2024 Apr 1:2024.03.31.24305138. doi: 10.1101/2024.03.31.24305138.
6
Reporting Quality of Randomized Controlled Trials of Periodontal Diseases in Journal Abstracts-A Cross-sectional Survey and Bibliometric Analysis.期刊摘要中牙周病随机对照试验的报告质量:横断面调查和文献计量分析。
J Evid Based Dent Pract. 2018 Jun;18(2):130-141.e22. doi: 10.1016/j.jebdp.2017.08.005. Epub 2017 Sep 21.
7
Methodological information extraction from randomized controlled trial publications: a pilot study.从随机对照试验出版物中提取方法学信息:一项初步研究。
AMIA Annu Symp Proc. 2023 Apr 29;2022:542-551. eCollection 2022.
8
Investigating the impact of weakly supervised data on text mining models of publication transparency: a case study on randomized controlled trials.研究弱监督数据对出版物透明度文本挖掘模型的影响:以随机对照试验为例。
AMIA Jt Summits Transl Sci Proc. 2022 May 23;2022:254-263. eCollection 2022.
9
Consolidated standards of reporting trials (CONSORT) and the completeness of reporting of randomised controlled trials (RCTs) published in medical journals.试验报告的统一标准(CONSORT)以及医学期刊上发表的随机对照试验(RCT)的报告完整性。
Cochrane Database Syst Rev. 2012 Nov 14;11(11):MR000030. doi: 10.1002/14651858.MR000030.pub2.
10
Methodology reporting improved over time in 176,469 randomized controlled trials.方法学报告在 176469 项随机对照试验中随着时间的推移而改善。
J Clin Epidemiol. 2023 Oct;162:19-28. doi: 10.1016/j.jclinepi.2023.08.004. Epub 2023 Aug 9.

引用本文的文献

1
Large Language Model Analysis of Reporting Quality of Randomized Clinical Trial Articles: A Systematic Review.随机临床试验文章报告质量的大语言模型分析:一项系统评价
JAMA Netw Open. 2025 Aug 1;8(8):e2529418. doi: 10.1001/jamanetworkopen.2025.29418.
2
SPIRIT-CONSORT-TM: a corpus for assessing transparency of clinical trial protocol and results publications.SPIRIT-CONSORT-TM:一个用于评估临床试验方案和结果出版物透明度的语料库。
Sci Data. 2025 Feb 28;12(1):355. doi: 10.1038/s41597-025-04629-1.
3
The Maastricht Intensive Care COVID Cohort: A Critical Appraisal of the Predefined Research Questions.

本文引用的文献

1
The past, present and future of Registered Reports.注册报告的过去、现在与未来。
Nat Hum Behav. 2022 Jan;6(1):29-42. doi: 10.1038/s41562-021-01193-7. Epub 2021 Nov 15.
2
The Rigor and Transparency Index Quality Metric for Assessing Biological and Medical Science Methods.用于评估生物医学科学方法的严谨性与透明度指数质量指标
iScience. 2020 Oct 20;23(11):101698. doi: 10.1016/j.isci.2020.101698. eCollection 2020 Nov 20.
3
Menagerie: A text-mining tool to support animal-human translation in neurodegeneration research.动物园:一种文本挖掘工具,用于支持神经退行性疾病研究中的动物-人类翻译。
马斯特里赫特重症监护 COVID 队列研究:对预定义研究问题的批判性评估
Crit Care Explor. 2025 Feb 3;7(2):e1211. doi: 10.1097/CCE.0000000000001211. eCollection 2025 Feb 1.
4
SPIRIT-CONSORT-TM: a corpus for assessing transparency of clinical trial protocol and results publications.SPIRIT-CONSORT-TM:一个用于评估临床试验方案和结果出版物透明度的语料库。
medRxiv. 2025 Jan 15:2025.01.14.25320543. doi: 10.1101/2025.01.14.25320543.
5
The Impact of Temperature on Extracting Information From Clinical Trial Publications Using Large Language Models.温度对使用大语言模型从临床试验出版物中提取信息的影响
Cureus. 2024 Dec 15;16(12):e75748. doi: 10.7759/cureus.75748. eCollection 2024 Dec.
6
Predicting the sample size of randomized controlled trials using natural language processing.使用自然语言处理预测随机对照试验的样本量。
JAMIA Open. 2024 Oct 25;7(4):ooae116. doi: 10.1093/jamiaopen/ooae116. eCollection 2024 Dec.
7
Text classification models for assessing the completeness of randomized controlled trial publications based on CONSORT reporting guidelines.基于 CONSORT 报告规范的评估随机对照试验出版物完整性的文本分类模型。
Sci Rep. 2024 Sep 17;14(1):21721. doi: 10.1038/s41598-024-72130-7.
8
CONSORT-TM: Text classification models for assessing the completeness of randomized controlled trial publications.CONSORT-TM:用于评估随机对照试验出版物完整性的文本分类模型。
medRxiv. 2024 Apr 1:2024.03.31.24305138. doi: 10.1101/2024.03.31.24305138.
9
Automatic categorization of self-acknowledged limitations in randomized controlled trial publications.自我承认的随机对照试验出版物局限性的自动分类。
J Biomed Inform. 2024 Apr;152:104628. doi: 10.1016/j.jbi.2024.104628. Epub 2024 Mar 26.
10
Retrieval augmented scientific claim verification.检索增强型科学声明验证
JAMIA Open. 2024 Feb 21;7(1):ooae021. doi: 10.1093/jamiaopen/ooae021. eCollection 2024 Apr.
PLoS One. 2019 Dec 17;14(12):e0226176. doi: 10.1371/journal.pone.0226176. eCollection 2019.
4
Improving reference prioritisation with PICO recognition.通过 PICO 识别提高文献优先排序。
BMC Med Inform Decis Mak. 2019 Dec 5;19(1):256. doi: 10.1186/s12911-019-0992-8.
5
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
6
Checklists work to improve science.清单有助于改进科学。
Nature. 2018 Apr;556(7701):273-274. doi: 10.1038/d41586-018-04590-7.
7
A manual corpus of annotated main findings of clinical case reports.一份标注了临床病例报告主要发现的人工语料库。
Database (Oxford). 2019 Jan 1;2019:bay143. doi: 10.1093/database/bay143.
8
A Corpus with Multi-Level Annotations of Patients, Interventions and Outcomes to Support Language Processing for Medical Literature.一个带有患者、干预措施和结果的多层次注释的语料库,以支持医学文献的语言处理。
Proc Conf Assoc Comput Linguist Meet. 2018 Jul;2018:197-207.
9
Automatic recognition of self-acknowledged limitations in clinical research literature.临床研究文献中自我承认局限性的自动识别。
J Am Med Inform Assoc. 2018 Jul 1;25(7):855-861. doi: 10.1093/jamia/ocy038.
10
Biomedical text mining for research rigor and integrity: tasks, challenges, directions.生物医学文本挖掘的研究严谨性和完整性:任务、挑战和方向。
Brief Bioinform. 2018 Nov 27;19(6):1400-1414. doi: 10.1093/bib/bbx057.