多个预训练BERT模型在为大型数据集自动执行和加速数据标注方面的性能。

Performance of Multiple Pretrained BERT Models to Automate and Accelerate Data Annotation for Large Datasets.

作者信息

Tejani Ali S, Ng Yee S, Xi Yin, Fielding Julia R, Browning Travis G, Rayan Jesse C

机构信息

Department of Radiology, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390.

出版信息

Radiol Artif Intell. 2022 Jun 29;4(4):e220007. doi: 10.1148/ryai.220007. eCollection 2022 Jul.

DOI:10.1148/ryai.220007

PMID:35923377

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9344209/

Abstract

PURPOSE

To develop and evaluate domain-specific and pretrained bidirectional encoder representations from transformers (BERT) models in a transfer learning task on varying training dataset sizes to annotate a larger overall dataset.

MATERIALS AND METHODS

The authors retrospectively reviewed 69 095 anonymized adult chest radiograph reports (reports dated April 2020-March 2021). From the overall cohort, 1004 reports were randomly selected and labeled for the presence or absence of each of the following devices: endotracheal tube (ETT), enterogastric tube (NGT, or Dobhoff tube), central venous catheter (CVC), and Swan-Ganz catheter (SGC). Pretrained transformer models (BERT, PubMedBERT, DistilBERT, RoBERTa, and DeBERTa) were trained, validated, and tested on 60%, 20%, and 20%, respectively, of these reports through fivefold cross-validation. Additional training involved varying dataset sizes with 5%, 10%, 15%, 20%, and 40% of the 1004 reports. The best-performing epochs were used to assess area under the receiver operating characteristic curve (AUC) and determine run time on the overall dataset.

RESULTS

The highest average AUCs from fivefold cross-validation were 0.996 for ETT (RoBERTa), 0.994 for NGT (RoBERTa), 0.991 for CVC (PubMedBERT), and 0.98 for SGC (PubMedBERT). DeBERTa demonstrated the highest AUC for each support device trained on 5% of the training set. PubMedBERT showed a higher AUC with a decreasing training set size compared with BERT. Training and validation time was shortest for DistilBERT at 3 minutes 39 seconds on the annotated cohort.

CONCLUSION

Pretrained and domain-specific transformer models required small training datasets and short training times to create a highly accurate final model that expedites autonomous annotation of large datasets. Informatics, Named Entity Recognition, Transfer Learning . ©RSNA, 2022See also the commentary by Zech in this issue.

摘要

目的

在不同训练数据集大小的迁移学习任务中开发并评估特定领域的预训练变换器双向编码器表征（BERT）模型，以注释更大的整体数据集。

材料与方法

作者回顾性分析了69095份匿名成人胸部X光报告（报告日期为2020年4月至2021年3月）。从整个队列中随机选择1004份报告，并标记以下每种设备的有无：气管内插管（ETT）、鼻胃管（NGT，或多夫管）、中心静脉导管（CVC）和 Swan-Ganz 导管（SGC）。通过五折交叉验证，分别在这些报告的60%、20%和20%上对预训练变换器模型（BERT、PubMedBERT、DistilBERT、RoBERTa 和 DeBERTa）进行训练、验证和测试。额外训练涉及使用1004份报告中的5%、10%、15%、20%和40%来改变数据集大小。使用表现最佳的轮次来评估受试者操作特征曲线下面积（AUC），并确定在整个数据集上的运行时间。

结果

五折交叉验证的最高平均AUC分别为：ETT为0.996（RoBERTa）、NGT为0.994（RoBERTa）、CVC为0.991（PubMedBERT）、SGC为0.98（PubMedBERT）。对于在5%训练集上训练的每种支持设备，DeBERTa的AUC最高。与BERT相比，PubMedBERT在训练集大小减少时显示出更高的AUC。在注释队列上，DistilBERT的训练和验证时间最短，为3分39秒。

结论

预训练的特定领域变换器模型需要小训练数据集和短训练时间来创建高度准确的最终模型，从而加快大型数据集的自动注释。信息学、命名实体识别、迁移学习。©RSNA，2022 另见本期泽赫的评论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f99/9344209/66e4f3fff212/ryai.220007.VA.jpg

相似文献

Performance of Multiple Pretrained BERT Models to Automate and Accelerate Data Annotation for Large Datasets.

Radiol Artif Intell. 2022 Jun 29;4(4):e220007. doi: 10.1148/ryai.220007. eCollection 2022 Jul.

RadBERT: Adapting Transformer-based Language Models to Radiology.

Radiol Artif Intell. 2022 Jun 15;4(4):e210258. doi: 10.1148/ryai.210258. eCollection 2022 Jul.

Application of a Domain-specific BERT for Detection of Speech Recognition Errors in Radiology Reports.

Radiol Artif Intell. 2022 May 25;4(4):e210185. doi: 10.1148/ryai.210185. eCollection 2022 Jul.

BERT-based Transfer Learning in Sentence-level Anatomic Classification of Free-Text Radiology Reports.

Radiol Artif Intell. 2023 Feb 15;5(2):e220097. doi: 10.1148/ryai.220097. eCollection 2023 Mar.

Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction.

NPJ Digit Med. 2021 May 20;4(1):86. doi: 10.1038/s41746-021-00455-y.

Pretrained Transformer Language Models Versus Pretrained Word Embeddings for the Detection of Accurate Health Information on Arabic Social Media: Comparative Study.

JMIR Form Res. 2022 Jun 29;6(6):e34834. doi: 10.2196/34834.

Clinical concept extraction using transformers.

J Am Med Inform Assoc. 2020 Dec 9;27(12):1935-1942. doi: 10.1093/jamia/ocaa189.

Domain-adapted Large Language Models for Classifying Nuclear Medicine Reports.

Radiol Artif Intell. 2023 Sep 27;5(6):e220281. doi: 10.1148/ryai.220281. eCollection 2023 Nov.

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study.

JMIR Med Inform. 2023 Apr 25;11:e46348. doi: 10.2196/46348.

When BERT meets Bilbo: a learning curve analysis of pretrained language model on disease classification.

BMC Med Inform Decis Mak. 2022 Apr 5;21(Suppl 9):377. doi: 10.1186/s12911-022-01829-2.

引用本文的文献

In-Context Learning with Large Language Models: A Simple and Effective Approach to Improve Radiology Report Labeling.

Healthc Inform Res. 2025 Jul;31(3):295-309. doi: 10.4258/hir.2025.31.3.295. Epub 2025 Jul 31.

Text intelligent correction in English translation: A study on integrating models with dependency attention mechanism.

PLoS One. 2025 Jun 24;20(6):e0319690. doi: 10.1371/journal.pone.0319690. eCollection 2025.

The Evolution of Radiology Image Annotation in the Era of Large Language Models.

Radiol Artif Intell. 2025 Jul;7(4):e240631. doi: 10.1148/ryai.240631.

Impact of hospital-specific domain adaptation on BERT-based models to classify neuroradiology reports.

Eur Radiol. 2025 Mar 17. doi: 10.1007/s00330-025-11500-9.

Ontology-guided machine learning outperforms zero-shot foundation models for cardiac ultrasound text reports.

Sci Rep. 2025 Feb 14;15(1):5456. doi: 10.1038/s41598-024-83540-y.

A scoping review of large language model based approaches for information extraction from radiology reports.

NPJ Digit Med. 2024 Aug 24;7(1):222. doi: 10.1038/s41746-024-01219-0.

From Revisions to Insights: Converting Radiology Report Revisions into Actionable Educational Feedback Using Generative AI Models.

J Imaging Inform Med. 2025 Apr;38(2):1265-1279. doi: 10.1007/s10278-024-01233-4. Epub 2024 Aug 19.

Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods.

Health Care Sci. 2023 Apr 24;2(2):120-128. doi: 10.1002/hcs2.40. eCollection 2023 Apr.

A New Era of Text Mining in Radiology with Privacy-Preserving LLMs.

Radiol Artif Intell. 2024 Jul;6(4):e240261. doi: 10.1148/ryai.240261.

Artificial Intelligence-Assisted Cancer Status Detection in Radiology Reports.

Cancer Res Commun. 2024 Apr 9;4(4):1041-1049. doi: 10.1158/2767-9764.CRC-24-0064.

本文引用的文献

BI-RADS BERT and Using Section Segmentation to Understand Radiology Reports.

J Imaging. 2022 May 9;8(5):131. doi: 10.3390/jimaging8050131.

RadBERT-CL: Factually-Aware Contrastive Learning For Radiology Report Classification.

Proc Mach Learn Res. 2021 Dec;158:196-208.

Automatic Diagnosis Labeling of Cardiovascular MRI by Using Semisupervised Natural Language Processing of Text Reports.

Radiol Artif Intell. 2021 Nov 24;4(1):e210085. doi: 10.1148/ryai.210085. eCollection 2022 Jan.

CLiP, catheter and line position dataset.

Sci Data. 2021 Oct 28;8(1):285. doi: 10.1038/s41597-021-01066-8.

Natural Language Processing of Radiology Text Reports: Interactive Text Classification.

Radiol Artif Intell. 2021 May 12;3(4):e210035. doi: 10.1148/ryai.2021210035. eCollection 2021 Jul.

Machine learning based natural language processing of radiology reports in orthopaedic trauma.

Comput Methods Programs Biomed. 2021 Sep;208:106304. doi: 10.1016/j.cmpb.2021.106304. Epub 2021 Jul 23.

Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports.

Bioinformatics. 2021 Jan 29;36(21):5255-5261. doi: 10.1093/bioinformatics/btaa668.

Preparing Medical Imaging Data for Machine Learning.

Radiology. 2020 Apr;295(1):4-15. doi: 10.1148/radiol.2020192224. Epub 2020 Feb 18.

Racial and Sex Disparities in Catheter Use and Dialysis Access in the United States Medicare Population.

J Am Soc Nephrol. 2020 Mar;31(3):625-636. doi: 10.1681/ASN.2019030274. Epub 2020 Jan 15.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多个预训练BERT模型在为大型数据集自动执行和加速数据标注方面的性能。

Performance of Multiple Pretrained BERT Models to Automate and Accelerate Data Annotation for Large Datasets.

作者信息

Tejani Ali S, Ng Yee S, Xi Yin, Fielding Julia R, Browning Travis G, Rayan Jesse C

机构信息

Department of Radiology, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390.

出版信息

Radiol Artif Intell. 2022 Jun 29;4(4):e220007. doi: 10.1148/ryai.220007. eCollection 2022 Jul.

DOI:10.1148/ryai.220007

PMID:35923377

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9344209/

Abstract

PURPOSE

MATERIALS AND METHODS

RESULTS

CONCLUSION

摘要

目的

在不同训练数据集大小的迁移学习任务中开发并评估特定领域的预训练变换器双向编码器表征（BERT）模型，以注释更大的整体数据集。

多个预训练BERT模型在为大型数据集自动执行和加速数据标注方面的性能。

Performance of Multiple Pretrained BERT Models to Automate and Accelerate Data Annotation for Large Datasets.

作者信息

机构信息

出版信息

PURPOSE

MATERIALS AND METHODS

RESULTS

CONCLUSION

目的

材料与方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

多个预训练BERT模型在为大型数据集自动执行和加速数据标注方面的性能。

Performance of Multiple Pretrained BERT Models to Automate and Accelerate Data Annotation for Large Datasets.

作者信息

机构信息

出版信息

PURPOSE

MATERIALS AND METHODS

RESULTS

CONCLUSION

目的

材料与方法

结果

结论

相似文献

引用本文的文献

本文引用的文献