• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对乳腺 X 线摄影机器学习数据集的回顾,以及它们对 FAIR 原则的遵守情况和未来展望。

A review of the machine learning datasets in mammography, their adherence to the FAIR principles and the outlook for the future.

机构信息

Alixir Technologies Pty Ltd, Sydney, NSW, Australia.

Australian Artificial Intelligence Institute, University of Technology Sydney, Sydney, NSW, Australia.

出版信息

Sci Data. 2023 Sep 8;10(1):595. doi: 10.1038/s41597-023-02430-6.

DOI:10.1038/s41597-023-02430-6
PMID:37684306
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10491669/
Abstract

The increasing rates of breast cancer, particularly in emerging economies, have led to interest in scalable deep learning-based solutions that improve the accuracy and cost-effectiveness of mammographic screening. However, such tools require large volumes of high-quality training data, which can be challenging to obtain. This paper combines the experience of an AI startup with an analysis of the FAIR principles of the eight available datasets. It demonstrates that the datasets vary considerably, particularly in their interoperability, as each dataset is skewed towards a particular clinical use-case. Additionally, the mix of digital captures and scanned film compounds the problem of variability, along with differences in licensing terms, ease of access, labelling reliability, and file formats. Improving interoperability through adherence to standards such as the BIRADS criteria for labelling and annotation, and a consistent file format, could markedly improve access and use of larger amounts of standardized data. This, in turn, could be increased further by GAN-based synthetic data generation, paving the way towards better health outcomes for breast cancer.

摘要

乳腺癌发病率的不断上升,特别是在新兴经济体中,促使人们对基于深度学习的可扩展解决方案产生了兴趣,这些解决方案可以提高乳房 X 光筛查的准确性和成本效益。然而,此类工具需要大量高质量的训练数据,这在获取方面可能具有挑战性。本文结合了人工智能初创公司的经验以及对八个可用数据集的 FAIR 原则的分析。结果表明,这些数据集差异很大,特别是在互操作性方面,因为每个数据集都偏向于特定的临床用例。此外,数字化采集和扫描胶片的混合增加了可变性的问题,同时还存在许可条款、访问便利性、标签可靠性和文件格式方面的差异。通过遵守标签和注释的 BIRADS 标准以及一致的文件格式等标准来提高互操作性,可以显著改善对更多标准化数据的访问和使用。通过基于 GAN 的合成数据生成,可以进一步增加这种情况,从而为乳腺癌带来更好的健康结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d241/10491669/d8cc49842e63/41597_2023_2430_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d241/10491669/796487d7b2aa/41597_2023_2430_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d241/10491669/d8cc49842e63/41597_2023_2430_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d241/10491669/796487d7b2aa/41597_2023_2430_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d241/10491669/d8cc49842e63/41597_2023_2430_Fig2_HTML.jpg

相似文献

1
A review of the machine learning datasets in mammography, their adherence to the FAIR principles and the outlook for the future.对乳腺 X 线摄影机器学习数据集的回顾,以及它们对 FAIR 原则的遵守情况和未来展望。
Sci Data. 2023 Sep 8;10(1):595. doi: 10.1038/s41597-023-02430-6.
2
3
The effectiveness of internet-based e-learning on clinician behavior and patient outcomes: a systematic review protocol.基于互联网的电子学习对临床医生行为和患者结局的有效性:一项系统评价方案。
JBI Database System Rev Implement Rep. 2015 Jan;13(1):52-64. doi: 10.11124/jbisrir-2015-1919.
4
5
Augmenting Transfer Learning with Feature Extraction Techniques for Limited Breast Imaging Datasets.基于特征提取技术的有限乳腺成像数据集的迁移学习增强。
J Digit Imaging. 2021 Jun;34(3):618-629. doi: 10.1007/s10278-021-00456-z. Epub 2021 May 10.
6
Breast cancer detection using deep learning: Datasets, methods, and challenges ahead.深度学习在乳腺癌检测中的应用:数据集、方法和未来的挑战。
Comput Biol Med. 2022 Oct;149:106073. doi: 10.1016/j.compbiomed.2022.106073. Epub 2022 Aug 31.
7
Deep-Learning-Based Semantic Labeling for 2D Mammography and Comparison of Complexity for Machine Learning Tasks.基于深度学习的 2D 乳腺 X 光图像语义标注及机器学习任务复杂度比较。
J Digit Imaging. 2019 Aug;32(4):565-570. doi: 10.1007/s10278-019-00244-w.
8
Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers.自动标记图像和 DICOM 头中的特殊诊断乳房 X 光视图。
J Digit Imaging. 2019 Apr;32(2):228-233. doi: 10.1007/s10278-018-0154-z.
9
A deep learning method for classifying mammographic breast density categories.一种用于对乳腺钼靶图像的乳房密度类别进行分类的深度学习方法。
Med Phys. 2018 Jan;45(1):314-321. doi: 10.1002/mp.12683. Epub 2017 Dec 22.
10
Robust breast cancer detection in mammography and digital breast tomosynthesis using an annotation-efficient deep learning approach.利用高效标注的深度学习方法在乳腺 X 线摄影和数字乳腺断层合成术中进行稳健的乳腺癌检测。
Nat Med. 2021 Feb;27(2):244-249. doi: 10.1038/s41591-020-01174-9. Epub 2021 Jan 11.

引用本文的文献

1
Evaluating breast cancer screening performance without registries using medico-administrative data.使用医疗管理数据在无登记处的情况下评估乳腺癌筛查性能。
Sci Rep. 2025 Jul 11;15(1):25096. doi: 10.1038/s41598-025-10115-w.
2
Improving annotation efficiency for fully labeling a breast mass segmentation dataset.提高对乳腺肿块分割数据集进行完全标注的标注效率。
J Med Imaging (Bellingham). 2025 May;12(3):035501. doi: 10.1117/1.JMI.12.3.035501. Epub 2025 May 21.
3
Challenges in Implementing Artificial Intelligence in Breast Cancer Screening Programs: Systematic Review and Framework for Safe Adoption.

本文引用的文献

1
An Online Mammography Database with Biopsy Confirmed Types.在线乳腺 X 线摄影数据库,包含经活检证实的类型。
Sci Data. 2023 Mar 7;10(1):123. doi: 10.1038/s41597-023-02025-1.
2
Automatic mass detection in mammograms using deep convolutional neural networks.使用深度卷积神经网络在乳腺X光片中进行自动肿块检测。
J Med Imaging (Bellingham). 2019 Jul;6(3):031409. doi: 10.1117/1.JMI.6.3.031409. Epub 2019 Feb 20.
3
Multi- class classification of breast cancer abnormalities using Deep Convolutional Neural Network (CNN).使用深度卷积神经网络(CNN)对乳腺癌异常进行多类别分类。
乳腺癌筛查项目中实施人工智能的挑战:系统评价与安全应用框架
J Med Internet Res. 2025 May 15;27:e62941. doi: 10.2196/62941.
4
An open codebase for enhancing transparency in deep learning-based breast cancer diagnosis utilizing CBIS-DDSM data.利用 CBIS-DDSM 数据增强基于深度学习的乳腺癌诊断透明度的开放代码库。
Sci Rep. 2024 Nov 9;14(1):27318. doi: 10.1038/s41598-024-78648-0.
5
Deep Learning in Breast Cancer Imaging: State of the Art and Recent Advancements in Early 2024.乳腺癌成像中的深度学习:2024年初的技术现状与最新进展
Diagnostics (Basel). 2024 Apr 19;14(8):848. doi: 10.3390/diagnostics14080848.
6
An Enhanced LightGBM-Based Breast Cancer Detection Technique Using Mammography Images.一种基于增强型LightGBM的利用乳腺钼靶图像的乳腺癌检测技术。
Diagnostics (Basel). 2024 Jan 22;14(2):227. doi: 10.3390/diagnostics14020227.
PLoS One. 2021 Aug 26;16(8):e0256500. doi: 10.1371/journal.pone.0256500. eCollection 2021.
4
Dataset of breast mammography images with masses.带有肿块的乳腺钼靶图像数据集。
Data Brief. 2020 Jun 25;31:105928. doi: 10.1016/j.dib.2020.105928. eCollection 2020 Aug.
5
Artificial intelligence and convolution neural networks assessing mammographic images: a narrative literature review.人工智能和卷积神经网络评估乳腺 X 光图像:叙事文献综述。
J Med Radiat Sci. 2020 Jun;67(2):134-142. doi: 10.1002/jmrs.385. Epub 2020 Mar 5.
6
International evaluation of an AI system for breast cancer screening.国际乳腺癌筛查人工智能系统评估。
Nature. 2020 Jan;577(7788):89-94. doi: 10.1038/s41586-019-1799-6. Epub 2020 Jan 1.
7
Artificial intelligence in breast imaging.人工智能在乳腺成像中的应用。
Clin Radiol. 2019 May;74(5):357-366. doi: 10.1016/j.crad.2019.02.006. Epub 2019 Mar 18.
8
Deep Convolutional Neural Networks for breast cancer screening.深度学习卷积神经网络在乳腺癌筛查中的应用。
Comput Methods Programs Biomed. 2018 Apr;157:19-30. doi: 10.1016/j.cmpb.2018.01.011. Epub 2018 Jan 11.
9
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
10
The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository.癌症影像档案库(TCIA):维护和运营公共信息知识库。
J Digit Imaging. 2013 Dec;26(6):1045-57. doi: 10.1007/s10278-013-9622-7.