Institute of Fundamental Technological Research, Polish Academy of Sciences, Pawinskiego 5B, 02-106, Warsaw, Poland.
Maria Sklodowska-Curie National Institute of Oncology - National Research Institute Branch in Krakow ul, Garncarska 11, 31-115, Kraków, Poland.
Sci Data. 2024 Jan 31;11(1):148. doi: 10.1038/s41597-024-02984-z.
A new detailed dataset of breast ultrasound scans (BrEaST) containing images of benign and malignant lesions as well as normal tissue examples, is presented. The dataset consists of 256 breast scans collected from 256 patients. Each scan was manually annotated and labeled by a radiologist experienced in breast ultrasound examination. In particular, each tumor was identified in the image using a freehand annotation and labeled according to BIRADS features and lexicon. The histopathological classification of the tumor was also provided for patients who underwent a biopsy. The BrEaST dataset is the first breast ultrasound dataset containing patient-level labels, image-level annotations, and tumor-level labels with all cases confirmed by follow-up care or core needle biopsy result. To enable research into breast disease detection, tumor segmentation and classification, the BrEaST dataset is made publicly available with the CC-BY 4.0 license.
一个新的详细的乳房超声扫描数据集(BrEaST),包含良性和恶性病变以及正常组织的图像,被呈现出来。该数据集由 256 名患者的 256 次乳房扫描组成。每次扫描都由一位经验丰富的乳房超声检查放射科医生进行手动注释和标记。特别是,使用徒手注释在图像中识别每个肿瘤,并根据 BIRADS 特征和词汇进行标记。对接受活检的患者,还提供了肿瘤的组织病理学分类。BrEaST 数据集是第一个包含患者级别标签、图像级别注释和肿瘤级别标签的乳房超声数据集,所有病例都通过随访护理或核心针活检结果得到证实。为了促进乳房疾病检测、肿瘤分割和分类的研究,该数据集以 CC-BY 4.0 许可证公开发布。