Zhang Xuan, Ji Jianxin, Zhang Qi, Zheng Xiaohan, Ge Kaiyuan, Hua Menglei, Cao Lei, Wang Liuying
Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, 150081, China.
Department of Health Management, Harbin Medical University, Harbin, 150081, China.
Sci Data. 2025 Jan 7;12(1):23. doi: 10.1038/s41597-025-04374-5.
Accurate detection of abnormal cervical cells in cervical cancer screening increases the chances of timely treatment. The vigorous development of deep learning methods has established a new ecosystem for cervical cancer screening, which has been proven to effectively improve efficiency and accuracy of cell detection in many studies. Although many contributing studies have been conducted, limited public datasets and time-consuming collection efforts may hinder the generalization performance of those advanced models and restrict further research. Through this work, we seek to provide a large dataset of cervical cytology images with exhaustive annotations of abnormal cervical cells. The dataset consists of 8,037 images derived from 129 scanned Thinprep cytologic test (TCT) slide images. Furthermore, we performed evaluation experiments to demonstrate the performance of representative models trained on our dataset in abnormal cells detection.
在宫颈癌筛查中准确检测异常宫颈细胞可增加及时治疗的机会。深度学习方法的蓬勃发展为宫颈癌筛查建立了新的生态系统,许多研究已证明该系统能有效提高细胞检测的效率和准确性。尽管已经开展了许多有价值的研究,但有限的公共数据集和耗时的数据收集工作可能会阻碍这些先进模型的泛化性能,并限制进一步的研究。通过这项工作,我们旨在提供一个包含异常宫颈细胞详尽注释的宫颈细胞学图像大型数据集。该数据集由来自129张扫描的薄层液基细胞学检测(TCT)玻片图像的8037张图像组成。此外,我们进行了评估实验,以证明在我们的数据集上训练的代表性模型在异常细胞检测中的性能。