Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden.
Institute of Biomedicine, University of Turku, Turku, Finland.
Sci Data. 2023 Aug 24;10(1):562. doi: 10.1038/s41597-023-02422-6.
The analysis of FFPE tissue sections stained with haematoxylin and eosin (H&E) or immunohistochemistry (IHC) is essential for the pathologic assessment of surgically resected breast cancer specimens. IHC staining has been broadly adopted into diagnostic guidelines and routine workflows to assess the status of several established biomarkers, including ER, PGR, HER2 and KI67. Biomarker assessment can also be facilitated by computational pathology image analysis methods, which have made numerous substantial advances recently, often based on publicly available whole slide image (WSI) data sets. However, the field is still considerably limited by the sparsity of public data sets. In particular, there are no large, high quality publicly available data sets with WSIs of matching IHC and H&E-stained tissue sections from the same tumour. Here, we publish the currently largest publicly available data set of WSIs of tissue sections from surgical resection specimens from female primary breast cancer patients with matched WSIs of corresponding H&E and IHC-stained tissue, consisting of 4,212 WSIs from 1,153 patients.
对经苏木精和伊红(H&E)或免疫组织化学(IHC)染色的 FFPE 组织切片进行分析,对于外科切除的乳腺癌标本的病理评估至关重要。IHC 染色已广泛纳入诊断指南和常规工作流程,以评估几个已确立的生物标志物的状态,包括 ER、PGR、HER2 和 KI67。生物标志物评估也可以通过计算病理学图像分析方法来辅助,这些方法最近取得了许多实质性进展,通常基于公开的全切片图像(WSI)数据集。然而,该领域仍然受到公共数据集稀缺的严重限制。特别是,没有来自同一肿瘤的匹配 IHC 和 H&E 染色组织切片的大型、高质量的公共可用 WSI 数据集。在这里,我们发布了目前最大的、公开可用的来自女性原发性乳腺癌患者手术切除标本的组织切片的 WSI 数据集,包含 1153 名患者的 4212 个 WSI。