基于机器学习模型集成与 BERT 语言模型的脑 CT 报告文本描述分析用于判断颅内出血的比较研究

Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.

机构信息

Junior Researcher, Department of Innovative Technologies; Scientific and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health, Bldg 1, 24 Petrovka St., Moscow, 127051, Russia.

Junior Researcher, Department of Medical Informatics, Radiomics and Radiogenomics; Scientific and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health, Bldg 1, 24 Petrovka St., Moscow, 127051, Russia.

出版信息

Sovrem Tekhnologii Med. 2024;16(1):27-34. doi: 10.17691/stm2024.16.1.03. Epub 2024 Feb 28.

DOI:10.17691/stm2024.16.1.03

PMID:39421632

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11482096/

Abstract

UNLABELLED

is to train and test an ensemble of machine learning models, as well as to compare its performance with the BERT language model pre-trained on medical data to perform simple binary classification, i.e., determine the presence/absence of the signs of intracranial hemorrhage (ICH) in brain CT reports.

MATERIALS AND METHODS

Seven machine learning algorithms and three text vectorization techniques were selected as models to solve the binary classification problem. These models were trained on textual data represented by 3980 brain CT reports from 56 inpatient medical facilities in Moscow. The study utilized three text vectorization techniques: bag of words, TF-IDF, and word2vec. The resulting data were then processed by the following machine learning algorithms: decision tree, random forest, logistic regression, nearest neighbors, support vector machines, Catboost, and XGboost. Data analysis and pre-processing were performed using NLTK (Natural Language Toolkit, version 3.6.5), libraries for character-based and statistical processing of natural language, and Scikit-learn (version 0.24.2), a library for machine learning containing tools to tackle classification challenges. MedRuBertTiny2 was taken as a BERT transformer model pre-trained on medical data.

RESULTS

Based on the training and testing outcomes from seven machine learning algorithms, the authors selected three algorithms that yielded the highest metrics (i.e. sensitivity and specificity): CatBoost, logistic regression, and nearest neighbors. The highest metrics were achieved by the bag of words technique. These algorithms were assembled into an ensemble using the stacking technique. The sensitivity and specificity for the validation dataset separated from the original sample were 0.93 and 0.90, respectively. Next, the ensemble and the BERT model were trained on an independent dataset containing 9393 textual radiology reports also divided into training and test sets. Once the ensemble was tested on this dataset, the resulting sensitivity and specificity were 0.92 and 0.90, respectively. The BERT model tested on these data demonstrated a sensitivity of 0.97 and a specificity of 0.90.

CONCLUSION

When analyzing textual reports of brain CT scans with signs of intracranial hemorrhage, the trained ensemble demonstrated high accuracy metrics. Still, manual quality control of the results is required during its application. The pre-trained BERT transformer model, additionally trained on diagnostic textual reports, demonstrated higher accuracy metrics (p<0.05). The results show promise in terms of finding specific values for both binary classification task and in-depth analysis of unstructured medical information.

摘要

未加标签

目的是训练和测试一组机器学习模型，并将其性能与基于医学数据预训练的 BERT 语言模型进行比较，以执行简单的二分类任务，即确定脑 CT 报告中是否存在颅内出血（ICH）的迹象。

材料和方法

选择了七种机器学习算法和三种文本向量化技术作为模型来解决二分类问题。这些模型是基于来自莫斯科 56 家住院医疗机构的 3980 份脑 CT 报告的文本数据进行训练的。研究采用了三种文本向量化技术：词袋、TF-IDF 和 word2vec。然后，通过以下机器学习算法对生成的数据进行处理：决策树、随机森林、逻辑回归、最近邻、支持向量机、Catboost 和 XGboost。数据分析和预处理使用了 NLTK（自然语言工具包，版本 3.6.5），这是一个用于字符和自然语言统计处理的库，以及 Scikit-learn（版本 0.24.2），这是一个包含用于解决分类挑战的工具的机器学习库。MedRuBertTiny2 被用作基于医学数据预训练的 BERT 转换器模型。

结果

基于七种机器学习算法的训练和测试结果，作者选择了三种产生最高指标（即敏感性和特异性）的算法：Catboost、逻辑回归和最近邻。词袋技术获得了最高的指标。这些算法使用堆叠技术组合成一个集成。从原始样本中分离出来的验证数据集的灵敏度和特异性分别为 0.93 和 0.90。接下来，在包含 9393 份文本放射学报告的独立数据集上对集成和 BERT 模型进行了训练，这些报告也分为训练集和测试集。在对该数据集进行测试后，得到的灵敏度和特异性分别为 0.92 和 0.90。在这些数据上测试的 BERT 模型表现出 0.97 的灵敏度和 0.90 的特异性。

结论

在分析具有颅内出血迹象的脑 CT 扫描的文本报告时，训练好的集成模型表现出了较高的准确性指标。然而，在应用过程中仍需要对结果进行人工质量控制。此外，经过训练的 BERT 转换器模型在诊断文本报告上进行了进一步训练，表现出了更高的准确性指标（p<0.05）。这些结果在二进制分类任务和深入分析非结构化医疗信息方面具有一定的价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26c3/11482096/3e24f22b6db3/STM-16-1-03-f1.jpg

相似文献

Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.

Sovrem Tekhnologii Med. 2024;16(1):27-34. doi: 10.17691/stm2024.16.1.03. Epub 2024 Feb 28.

Text Analysis of Radiology Reports with Signs of Intracranial Hemorrhage on Brain CT Scans Using the Decision Tree Algorithm.

Sovrem Tekhnologii Med. 2022;14(6):34-40. doi: 10.17691/stm2022.14.6.04. Epub 2022 Nov 28.

Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm.

Am J Emerg Med. 2022 Jan;51:388-392. doi: 10.1016/j.ajem.2021.11.001. Epub 2021 Nov 9.

Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).

BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.

Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?

Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.

Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury.

Acad Emerg Med. 2016 Feb;23(2):171-8. doi: 10.1111/acem.12859. Epub 2016 Jan 14.

Natural Language Processing for Imaging Protocol Assignment: Machine Learning for Multiclass Classification of Abdominal CT Protocols Using Indication Text Data.

J Digit Imaging. 2022 Oct;35(5):1120-1130. doi: 10.1007/s10278-022-00633-8. Epub 2022 Jun 2.

Natural language processing augments comorbidity documentation in neurosurgical inpatient admissions.

PLoS One. 2024 May 9;19(5):e0303519. doi: 10.1371/journal.pone.0303519. eCollection 2024.

Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.

J Digit Imaging. 2018 Apr;31(2):178-184. doi: 10.1007/s10278-017-0027-x.

Classifying the lifestyle status for Alzheimer's disease from clinical notes using deep learning with weak supervision.

BMC Med Inform Decis Mak. 2022 Jul 7;22(Suppl 1):88. doi: 10.1186/s12911-022-01819-4.

本文引用的文献

Text Analysis of Radiology Reports with Signs of Intracranial Hemorrhage on Brain CT Scans Using the Decision Tree Algorithm.

Sovrem Tekhnologii Med. 2022;14(6):34-40. doi: 10.17691/stm2022.14.6.04. Epub 2022 Nov 28.

Machine learning technologies in CT-based diagnostics and classification of intracranial hemorrhages.

Zh Vopr Neirokhir Im N N Burdenko. 2023;87(2):85-91. doi: 10.17116/neiro20238702185.

A pre-trained BERT for Korean medical natural language processing.

Sci Rep. 2022 Aug 16;12(1):13847. doi: 10.1038/s41598-022-17806-8.

Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).

BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.

Ensemble Approaches to Recognize Protected Health Information in Radiology Reports.

J Digit Imaging. 2022 Dec;35(6):1694-1698. doi: 10.1007/s10278-022-00673-0. Epub 2022 Jun 17.

Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm.

Am J Emerg Med. 2022 Jan;51:388-392. doi: 10.1016/j.ajem.2021.11.001. Epub 2021 Nov 9.

The reporting quality of natural language processing studies: systematic review of studies of radiology reports.

BMC Med Imaging. 2021 Oct 2;21(1):142. doi: 10.1186/s12880-021-00671-8.

Machine learning in medicine: a practical introduction to natural language processing.

BMC Med Res Methodol. 2021 Jul 31;21(1):158. doi: 10.1186/s12874-021-01347-1.

Applying natural language processing and machine learning techniques to patient experience feedback: a systematic review.

BMJ Health Care Inform. 2021 Mar;28(1). doi: 10.1136/bmjhci-2020-100262.

Exploration of text matching methods in Chinese disease Q&A systems: A method using ensemble based on BERT and boosted tree models.

J Biomed Inform. 2021 Mar;115:103683. doi: 10.1016/j.jbi.2021.103683. Epub 2021 Jan 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于机器学习模型集成与 BERT 语言模型的脑 CT 报告文本描述分析用于判断颅内出血的比较研究

Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.

机构信息

出版信息

UNLABELLED

MATERIALS AND METHODS

RESULTS

CONCLUSION

未加标签

材料和方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献