• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TECRR:一个基于机器学习、深度学习和大语言模型基线的用于 BI-RADS 分类的放射学报告基准数据集。

TECRR: a benchmark dataset of radiological reports for BI-RADS classification with machine learning, deep learning, and large language model baselines.

机构信息

School of Engineering and Sciences, Tecnológico de Monterrey, Monterrey, 64849, Nuevo Leon, Mexico.

School of Computing, Macquarie University, Sydney, 2109, NSW, Australia.

出版信息

BMC Med Inform Decis Mak. 2024 Oct 24;24(1):310. doi: 10.1186/s12911-024-02717-7.

DOI:10.1186/s12911-024-02717-7
PMID:39444035
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11515610/
Abstract

BACKGROUND

Recently, machine learning (ML), deep learning (DL), and natural language processing (NLP) have provided promising results in the free-form radiological reports' classification in the respective medical domain. In order to classify radiological reports properly, a high-quality annotated and curated dataset is required. Currently, no publicly available breast imaging-based radiological dataset exists for the classification of Breast Imaging Reporting and Data System (BI-RADS) categories and breast density scores, as characterized by the American College of Radiology (ACR). To tackle this problem, we construct and annotate a breast imaging-based radiological reports dataset and its benchmark results. The dataset was originally in Spanish. Board-certified radiologists collected and annotated it according to the BI-RADS lexicon and categories at the Breast Radiology department, TecSalud Hospitals Monterrey, Mexico. Initially, it was translated into English language using Google Translate. Afterwards, it was preprocessed by removing duplicates and missing values. After preprocessing, the final dataset consists of 5046 unique reports from 5046 patients with an average age of 53 years and 100% women. Furthermore, we used word-level NLP-based embedding techniques, term frequency-inverse document frequency (TF-IDF) and word2vec to extract semantic and syntactic information. We also compared the performance of ML, DL and large language models (LLMs) classifiers for BI-RADS category classification.

RESULTS

The final breast imaging-based radiological reports dataset contains 5046 unique reports. We compared K-Nearest Neighbour (KNN), Support Vector Machine (SVM), Naive Bayes (NB), Random Forest (RF), Adaptive Boosting (AdaBoost), Gradient-Boosting (GB), Extreme Gradient Boosting (XGB), Long Short-Term Memory (LSTM), Bidirectional Encoder Representations from Transformers (BERT) and Biomedical Generative Pre-trained Transformer (BioGPT) classifiers. It is observed that the BioGPT classifier with preprocessed data performed 6% better with a mean sensitivity of 0.60 (95% confidence interval (CI), 0.391-0.812) compared to the second best performing classifier BERT, which achieved mean sensitivity of 0.54 (95% CI, 0.477-0.607).

CONCLUSION

In this work, we propose a curated and annotated benchmark dataset that can be used for BI-RADS and breast density category classification. We also provide baseline results of most ML, DL and LLMs models for BI-RADS classification that can be used as a starting point for future investigation. The main objective of this investigation is to provide a repository for the investigators who wish to enter the field to push the boundaries further.

摘要

背景

最近,机器学习(ML)、深度学习(DL)和自然语言处理(NLP)在各自的医学领域中为自由格式放射报告的分类提供了有前景的结果。为了正确地对放射报告进行分类,需要高质量的标注和整理数据集。目前,还没有可用于 BI-RADS 类别和乳房密度评分分类的基于乳房成像的放射报告公共数据集,这些数据集由美国放射学院(ACR)定义。为了解决这个问题,我们构建并标注了一个基于乳房成像的放射报告数据集及其基准结果。该数据集最初是西班牙语的。在墨西哥蒙特雷的 TecSalud 医院的乳腺放射科,经过董事会认证的放射科医生根据 BI-RADS 词汇表和类别进行了收集和标注。最初,它使用谷歌翻译翻译成英语。之后,通过删除重复项和缺失值对其进行预处理。预处理后,最终数据集包含 5046 名来自 5046 名患者的独特报告,平均年龄为 53 岁,100%为女性。此外,我们使用基于词级的 NLP 嵌入技术、词频-逆文档频率(TF-IDF)和 word2vec 来提取语义和句法信息。我们还比较了 ML、DL 和大型语言模型(LLM)分类器在 BI-RADS 类别分类方面的性能。

结果

最终的基于乳房成像的放射报告数据集包含 5046 个独特的报告。我们比较了 K-最近邻(KNN)、支持向量机(SVM)、朴素贝叶斯(NB)、随机森林(RF)、自适应提升(AdaBoost)、梯度提升(GB)、极端梯度提升(XGB)、长短期记忆(LSTM)、双向转换器编码器表示(BERT)和生物生成预训练转换器(BioGPT)分类器。观察到,与第二好的分类器 BERT 相比,经过预处理的数据的 BioGPT 分类器的平均灵敏度提高了 6%,达到 0.60(95%置信区间(CI),0.391-0.812),而 BERT 的平均灵敏度为 0.54(95%CI,0.477-0.607)。

结论

在这项工作中,我们提出了一个经过整理和标注的基准数据集,可用于 BI-RADS 和乳房密度分类。我们还提供了最 ML、DL 和 LLM 模型用于 BI-RADS 分类的基准结果,可作为未来研究的起点。这项研究的主要目的是为希望进入该领域的研究人员提供一个存储库,以进一步推动研究的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/a7b79f5ee11a/12911_2024_2717_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/0ebbf92f7e75/12911_2024_2717_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/fe0999ed418e/12911_2024_2717_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/20c78d5f9b30/12911_2024_2717_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/a7b79f5ee11a/12911_2024_2717_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/0ebbf92f7e75/12911_2024_2717_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/fe0999ed418e/12911_2024_2717_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/20c78d5f9b30/12911_2024_2717_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e570/11515610/a7b79f5ee11a/12911_2024_2717_Fig4_HTML.jpg

相似文献

1
TECRR: a benchmark dataset of radiological reports for BI-RADS classification with machine learning, deep learning, and large language model baselines.TECRR:一个基于机器学习、深度学习和大语言模型基线的用于 BI-RADS 分类的放射学报告基准数据集。
BMC Med Inform Decis Mak. 2024 Oct 24;24(1):310. doi: 10.1186/s12911-024-02717-7.
2
Automated annotation and classification of BI-RADS assessment from radiology reports.从放射学报告中自动标注和分类乳腺影像报告和数据系统(BI-RADS)评估结果
J Biomed Inform. 2017 May;69:177-187. doi: 10.1016/j.jbi.2017.04.011. Epub 2017 Apr 18.
3
Automatic classification and prioritisation of actionable BI-RADS categories using natural language processing models.使用自然语言处理模型对可操作 BI-RADS 类别进行自动分类和优先级排序。
Clin Radiol. 2024 Jan;79(1):e1-e7. doi: 10.1016/j.crad.2023.09.009. Epub 2023 Sep 27.
4
Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches.使用深度学习方法从中文乳腺超声报告中提取 BI-RADS 结果。
Int J Med Inform. 2018 Nov;119:17-21. doi: 10.1016/j.ijmedinf.2018.08.009. Epub 2018 Aug 18.
5
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
6
BI-RADS Category Assignments by GPT-3.5, GPT-4, and Google Bard: A Multilanguage Study.BI-RADS 类别分配由 GPT-3.5、GPT-4 和谷歌巴德完成:一项多语言研究。
Radiology. 2024 Apr;311(1):e232133. doi: 10.1148/radiol.232133.
7
Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.基于机器学习模型集成与 BERT 语言模型的脑 CT 报告文本描述分析用于判断颅内出血的比较研究
Sovrem Tekhnologii Med. 2024;16(1):27-34. doi: 10.17691/stm2024.16.1.03. Epub 2024 Feb 28.
8
Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers.使用来自 Transformer 的双向编码器表示自动检测可操作的放射学报告。
BMC Med Inform Decis Mak. 2021 Sep 11;21(1):262. doi: 10.1186/s12911-021-01623-6.
9
Automated extraction of BI-RADS final assessment categories from radiology reports with natural language processing.基于自然语言处理的放射学报告中 BI-RADS 最终评估类别自动提取
J Digit Imaging. 2013 Oct;26(5):989-94. doi: 10.1007/s10278-013-9616-5.
10
Automatic inference of BI-RADS final assessment categories from narrative mammography report findings.从叙事性乳腺 X 线报告结果自动推断 BI-RADS 最终评估类别。
J Biomed Inform. 2019 Apr;92:103137. doi: 10.1016/j.jbi.2019.103137. Epub 2019 Feb 23.

引用本文的文献

1
Technical requirements and optimization strategies for home-based teleradiology workstations: a review article.居家远程放射学工作站的技术要求与优化策略:一篇综述文章
Insights Imaging. 2025 Sep 18;16(1):198. doi: 10.1186/s13244-025-02081-8.
2
Study on postoperative survival prediction model for non-small cell lung cancer: application of radiomics technology workflow based on multi-organ imaging features and various machine learning algorithms.非小细胞肺癌术后生存预测模型的研究:基于多器官影像特征和多种机器学习算法的放射组学技术流程应用
Front Med (Lausanne). 2025 Feb 5;12:1517765. doi: 10.3389/fmed.2025.1517765. eCollection 2025.

本文引用的文献

1
BioGPT: generative pre-trained transformer for biomedical text generation and mining.BioGPT:用于生物医学文本生成和挖掘的生成式预训练转换器。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac409.
2
VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations.VinDr-CXR:一个带有放射科医生标注的胸部 X 光数据集。
Sci Data. 2022 Jul 20;9(1):429. doi: 10.1038/s41597-022-01498-w.
3
Diagnostic value of artificial intelligence automatic detection systems for breast BI-RADS 4 nodules.人工智能自动检测系统对乳腺影像报告和数据系统(BI-RADS)4类结节的诊断价值
World J Clin Cases. 2022 Jan 14;10(2):518-527. doi: 10.12998/wjcc.v10.i2.518.
4
A dataset of chest X-ray reports annotated with Spatial Role Labeling annotations.一个带有空间角色标注注释的胸部X光报告数据集。
Data Brief. 2020 Jul 25;32:106056. doi: 10.1016/j.dib.2020.106056. eCollection 2020 Oct.
5
PadChest: A large chest x-ray image dataset with multi-label annotated reports.PadChest:一个大型胸部 X 射线图像数据集,带有多标签注释报告。
Med Image Anal. 2020 Dec;66:101797. doi: 10.1016/j.media.2020.101797. Epub 2020 Aug 20.
6
A New Computer-Aided Diagnosis System with Modified Genetic Feature Selection for BI-RADS Classification of Breast Masses in Mammograms.一种基于改进遗传特征选择的计算机辅助诊断系统,用于乳腺钼靶片中乳腺肿块的 BI-RADS 分类。
Biomed Res Int. 2020 May 11;2020:7695207. doi: 10.1155/2020/7695207. eCollection 2020.
7
MIMIC-CXR, a de-identified publicly available database of chest radiographs with free-text reports.MIMIC-CXR,一个去标识化的、公开可用的、包含自由文本报告的胸部 X 光数据库。
Sci Data. 2019 Dec 12;6(1):317. doi: 10.1038/s41597-019-0322-0.
8
Text mining brain imaging reports.文本挖掘脑成像报告。
J Biomed Semantics. 2019 Nov 12;10(Suppl 1):23. doi: 10.1186/s13326-019-0211-7.
9
A validated natural language processing algorithm for brain imaging phenotypes from radiology reports in UK electronic health records.从英国电子健康记录中的放射学报告中提取脑影像学表型的验证自然语言处理算法。
BMC Med Inform Decis Mak. 2019 Sep 9;19(1):184. doi: 10.1186/s12911-019-0908-7.
10
BI-RADS, C-RADS, CAD-RADS, LI-RADS, Lung-RADS, NI-RADS, O-RADS, PI-RADS, TI-RADS: Reporting and Data Systems.乳腺影像报告和数据系统(BI-RADS)、结肠影像报告和数据系统(C-RADS)、冠状动脉CT血管造影报告和数据系统(CAD-RADS)、肝脏影像报告和数据系统(LI-RADS)、肺部影像报告和数据系统(Lung-RADS)、神经影像报告和数据系统(NI-RADS)、卵巢影像报告和数据系统(O-RADS)、前列腺影像报告和数据系统(PI-RADS)、甲状腺影像报告和数据系统(TI-RADS):报告与数据系统
Radiographics. 2019 Sep-Oct;39(5):1435-1436. doi: 10.1148/rg.2019190087.