BUSClean：用于乳腺超声图像预处理和医学人工智能知识提取的开源软件。

BUSClean: Open-source software for breast ultrasound image pre-processing and knowledge extraction for medical AI.

作者信息

Bunnell Arianna, Hung Kailee, Shepherd John A, Sadowski Peter

机构信息

Department of Information and Computer Sciences, University of Hawai'i at Mānoa, Honolulu, HI, United States of America.

University of Hawai'i Cancer Center, Honolulu, HI, United States of America.

出版信息

PLoS One. 2024 Dec 11;19(12):e0315434. doi: 10.1371/journal.pone.0315434. eCollection 2024.

DOI:10.1371/journal.pone.0315434

PMID:39661621

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11633980/

Abstract

Development of artificial intelligence (AI) for medical imaging demands curation and cleaning of large-scale clinical datasets comprising hundreds of thousands of images. Some modalities, such as mammography, contain highly standardized imaging. In contrast, breast ultrasound imaging (BUS) can contain many irregularities not indicated by scan metadata, such as enhanced scan modes, sonographer annotations, or additional views. We present an open-source software solution for automatically processing clinical BUS datasets. The algorithm performs BUS scan filtering (flagging of invalid and non-B-mode scans), cleaning (dual-view scan detection, scan area cropping, and caliper detection), and knowledge extraction (BI-RADS Labeling and Measurement fields) from sonographer annotations. Its modular design enables users to adapt it to new settings. Experiments on an internal testing dataset of 430 clinical BUS images achieve >95% sensitivity and >98% specificity in detecting every type of text annotation, >98% sensitivity and specificity in detecting scans with blood flow highlighting, alternative scan modes, or invalid scans. A case study on a completely external, public dataset of BUS scans found that BUSClean identified text annotations and scans with blood flow highlighting with 88.6% and 90.9% sensitivity and 98.3% and 99.9% specificity, respectively. Adaptation of the lesion caliper detection method to account for a type of caliper specific to the case study demonstrates the intended use of BUSClean in new data distributions and improved performance in lesion caliper detection from 43.3% and 93.3% out-of-the-box to 92.1% and 92.3% sensitivity and specificity, respectively. Source code, example notebooks, and sample data are available at https://github.com/hawaii-ai/bus-cleaning.

摘要

用于医学成像的人工智能（AI）开发需要管理和清理包含数十万张图像的大规模临床数据集。某些模态，如乳腺X线摄影，具有高度标准化的成像。相比之下，乳腺超声成像（BUS）可能包含许多扫描元数据未表明的不规则情况，如增强扫描模式、超声检查人员注释或额外视图。我们提出了一种用于自动处理临床BUS数据集的开源软件解决方案。该算法执行BUS扫描过滤（标记无效和非B模式扫描）、清理（双视图扫描检测、扫描区域裁剪和卡尺检测）以及从超声检查人员注释中提取知识（BI-RADS标记和测量字段）。其模块化设计使用户能够将其应用于新的设置。在一个包含430张临床BUS图像的内部测试数据集上进行的实验表明，在检测每种类型的文本注释时，灵敏度>95%，特异性>98%；在检测具有血流突出显示、替代扫描模式或无效扫描的扫描时，灵敏度和特异性均>98%。对一个完全外部的公共BUS扫描数据集进行的案例研究发现，BUSClean识别文本注释和具有血流突出显示的扫描的灵敏度分别为88.6%和90.9%，特异性分别为98.3%和99.9%。针对案例研究中特定类型的卡尺对病变卡尺检测方法进行调整，证明了BUSClean在新数据分布中的预期用途，并将病变卡尺检测的性能从开箱即用的43.3%和93.3%分别提高到92.1%和92.3%的灵敏度和特异性。源代码、示例笔记本和示例数据可在https://github.com/hawaii-ai/bus-cleaning获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22f8/11633980/22f265460960/pone.0315434.g001.jpg

相似文献

BUSClean: Open-source software for breast ultrasound image pre-processing and knowledge extraction for medical AI.

PLoS One. 2024 Dec 11;19(12):e0315434. doi: 10.1371/journal.pone.0315434. eCollection 2024.

BUS-BRA: A breast ultrasound dataset for assessing computer-aided diagnosis systems.

Med Phys. 2024 Apr;51(4):3110-3123. doi: 10.1002/mp.16812. Epub 2023 Nov 8.

Segmentation-based BI-RADS ensemble classification of breast tumours in ultrasound images.

Int J Med Inform. 2024 Sep;189:105522. doi: 10.1016/j.ijmedinf.2024.105522. Epub 2024 Jun 6.

Can artificial intelligence replace ultrasound as a complementary tool to mammogram for the diagnosis of the breast cancer?

Br J Radiol. 2021 Dec;94(1128):20210820. doi: 10.1259/bjr.20210820. Epub 2021 Oct 18.

Artificial intelligence for ultrasound microflow imaging in breast cancer diagnosis.

Ultraschall Med. 2024 Aug;45(4):412-417. doi: 10.1055/a-2230-2455. Epub 2024 Apr 9.

Fully automatic tumor segmentation of breast ultrasound images with deep learning.

J Appl Clin Med Phys. 2023 Jan;24(1):e13863. doi: 10.1002/acm2.13863. Epub 2022 Dec 9.

Semi-supervised segmentation of lesion from breast ultrasound images with attentional generative adversarial network.

Comput Methods Programs Biomed. 2020 Jun;189:105275. doi: 10.1016/j.cmpb.2019.105275. Epub 2019 Dec 12.

Diagnostic Performance of Artificial Intelligence-Based Computer-Aided Detection Software for Automated Breast Ultrasound.

Acad Radiol. 2024 Feb;31(2):480-491. doi: 10.1016/j.acra.2023.09.013. Epub 2023 Oct 7.

BUS-Net: Breast Tumour Detection Network for Ultrasound Images Using Bi-directional ConvLSTM and Dense Residual Connections.

J Digit Imaging. 2023 Apr;36(2):627-646. doi: 10.1007/s10278-022-00733-5. Epub 2022 Dec 14.

BUS-Set: A benchmark for quantitative evaluation of breast ultrasound segmentation networks with public datasets.

Med Phys. 2023 May;50(5):3223-3243. doi: 10.1002/mp.16287. Epub 2023 Feb 28.

引用本文的文献

Prediction of mammographic breast density based on clinical breast ultrasound images using deep learning: a retrospective analysis.

Lancet Reg Health Am. 2025 Apr 18;46:101096. doi: 10.1016/j.lana.2025.101096. eCollection 2025 Jun.

本文引用的文献

Normal Workflow and Key Strategies for Data Cleaning Toward Real-World Data: Viewpoint.

Interact J Med Res. 2023 Sep 21;12:e44310. doi: 10.2196/44310.

Evaluation of MRI-based machine learning approaches for computer-aided diagnosis of dementia in a clinical data warehouse.

Med Image Anal. 2023 Oct;89:102903. doi: 10.1016/j.media.2023.102903. Epub 2023 Jul 17.

Deep learning model for the diagnosis of breast cancers smaller than 1 cm with ultrasonography: integration of ultrasonography and clinical factors.

Quant Imaging Med Surg. 2023 Apr 1;13(4):2486-2495. doi: 10.21037/qims-22-880. Epub 2023 Mar 9.

Improving breast cancer diagnosis by incorporating raw ultrasound parameters into machine learning.

Mach Learn Sci Technol. 2022 Dec 1;3(4):045013. doi: 10.1088/2632-2153/ac9bcc. Epub 2022 Nov 7.

AAU-Net: An Adaptive Attention U-Net for Breast Lesions Segmentation in Ultrasound Images.

IEEE Trans Med Imaging. 2023 May;42(5):1289-1300. doi: 10.1109/TMI.2022.3226268. Epub 2023 May 2.

Application of Artificial Intelligence Computer-Assisted Diagnosis Originally Developed for Thyroid Nodules to Breast Lesions on Ultrasound.

J Digit Imaging. 2022 Dec;35(6):1699-1707. doi: 10.1007/s10278-022-00680-1. Epub 2022 Jul 28.

Deep learning based on ultrasound images assists breast lesion diagnosis in China: a multicenter diagnostic study.

Insights Imaging. 2022 Jul 28;13(1):124. doi: 10.1186/s13244-022-01259-8.

Discrimination of Breast Cancer Based on Ultrasound Images and Convolutional Neural Network.

J Oncol. 2022 Mar 19;2022:7733583. doi: 10.1155/2022/7733583. eCollection 2022.

A Multi-Task Learning Framework for Automated Segmentation and Classification of Breast Tumors From Ultrasound Images.

Ultrason Imaging. 2022 Jan;44(1):3-12. doi: 10.1177/01617346221075769. Epub 2022 Feb 7.

Clever Hans effect found in a widely used brain tumour MRI dataset.

Med Image Anal. 2022 Apr;77:102368. doi: 10.1016/j.media.2022.102368. Epub 2022 Jan 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

BUSClean：用于乳腺超声图像预处理和医学人工智能知识提取的开源软件。

BUSClean: Open-source software for breast ultrasound image pre-processing and knowledge extraction for medical AI.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献