Suppr超能文献

验证GAN-BioBERT:一种评估临床试验报告趋势的方法。

Validating GAN-BioBERT: A Methodology for Assessing Reporting Trends in Clinical Trials.

作者信息

Myszewski Joshua J, Klossowski Emily, Meyer Patrick, Bevil Kristin, Klesius Lisa, Schroeder Kristopher M

机构信息

School of Medicine and Public Health, University of Wisconsin, Madison, WI, United States.

University of Wisconsin-Milwaukee, Milwaukee, WI, United States.

出版信息

Front Digit Health. 2022 May 24;4:878369. doi: 10.3389/fdgth.2022.878369. eCollection 2022.

Abstract

BACKGROUND

The aim of this study was to validate a three-class sentiment classification model for clinical trial abstracts combining adversarial learning and the BioBERT language processing model as a tool to assess trends in biomedical literature in a clearly reproducible manner. We then assessed the model's performance for this application and compared it to previous models used for this task.

METHODS

Using 108 expert-annotated clinical trial abstracts and 2,000 unlabeled abstracts this study develops a three-class sentiment classification algorithm for clinical trial abstracts. The model uses a semi-supervised model based on the Bidirectional Encoder Representation from Transformers (BERT) model, a much more advanced and accurate method compared to previously used models based upon traditional machine learning methods. The prediction performance was compared to those previous studies.

RESULTS

The algorithm was found to have a classification accuracy of 91.3%, with a macro F1-Score of 0.92, significantly outperforming previous studies used to classify sentiment in clinical trial literature, while also making the sentiment classification finer grained with greater reproducibility.

CONCLUSION

We demonstrate an easily applied sentiment classification model for clinical trial abstracts that significantly outperforms previous models with greater reproducibility and applicability to large-scale study of reporting trends.

摘要

背景

本研究的目的是验证一种用于临床试验摘要的三类情感分类模型,该模型结合了对抗学习和BioBERT语言处理模型,作为以清晰可重复的方式评估生物医学文献趋势的工具。然后,我们评估了该模型在此应用中的性能,并将其与以前用于此任务的模型进行了比较。

方法

本研究使用108篇专家注释的临床试验摘要和2000篇未标记的摘要,开发了一种用于临床试验摘要的三类情感分类算法。该模型使用基于变换器双向编码器表示(BERT)模型的半监督模型,与以前基于传统机器学习方法的模型相比,这是一种更先进、更准确的方法。将预测性能与以前的研究进行了比较。

结果

发现该算法的分类准确率为91.3%,宏F1分数为0.92,显著优于以前用于对临床试验文献中的情感进行分类的研究,同时还使情感分类更细化,具有更高的可重复性。

结论

我们展示了一种易于应用的临床试验摘要情感分类模型,该模型显著优于以前的模型,具有更高的可重复性和对报告趋势大规模研究的适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cace/9170913/96501322c3a3/fdgth-04-878369-g0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验