Lou Shenghan, Ji Jianxin, Li Huiying, Zhang Xuan, Jiang Yang, Hua Menglei, Chen Kexin, Ge Kaiyuan, Zhang Qi, Wang Liuying, Han Peng, Cao Lei
Department of Oncology Surgery, Harbin Medical University Cancer Hospital, No.150 Haping Road, Harbin, Heilongjiang, 150081, China.
Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, 150081, China.
Sci Data. 2025 Jan 22;12(1):138. doi: 10.1038/s41597-025-04489-9.
Gastric cancer (GC) is the third leading cause of cancer death worldwide. Its clinical course varies considerably due to the highly heterogeneous tumour microenvironment (TME). Decomposing the complex TME from histological images into its constituent parts is crucial for evaluating its patterns and enhancing GC therapies. Although various deep learning methods were developed in medical field, their applications on this task are hindered by the lack of well-annotated histological images of GC. Through this work, we seek to provide a large database of histological images of GC completely annotated for 8 tissue classes in TME. The dataset consists of nearly 31 K histological images from 300 whole slide images. Additionally, we explained two deep learning models used as validation examples using this dataset.
胃癌(GC)是全球癌症死亡的第三大主要原因。由于肿瘤微环境(TME)高度异质性,其临床病程差异很大。将组织学图像中的复杂TME分解为其组成部分对于评估其模式和加强GC治疗至关重要。尽管医学领域开发了各种深度学习方法,但由于缺乏注释良好的GC组织学图像,它们在此任务上的应用受到阻碍。通过这项工作,我们试图提供一个TME中8种组织类型完全注释的GC组织学图像大型数据库。该数据集由来自300张全切片图像的近31000张组织学图像组成。此外,我们使用该数据集解释了用作验证示例的两种深度学习模型。