Shi Liyu, Li Xiaoyan, Hu Weiming, Chen Haoyuan, Chen Jing, Fan Zizhen, Gao Minghe, Jing Yujie, Lu Guotao, Ma Deguo, Ma Zhiyu, Meng Qingtao, Tang Dechao, Sun Hongzan, Grzegorzek Marcin, Qi Shouliang, Teng Yueyang, Li Chen
Microscopic Image and Medical Image Analysis Group, College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, China.
Department of Pathology, Cancer Hospital of China Medical University, Liaoning Cancer Hospital and Institute, Shengyang, China.
Front Med (Lausanne). 2023 Jan 24;10:1114673. doi: 10.3389/fmed.2023.1114673. eCollection 2023.
Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of colorectal cancer, which often hampers the assessment accuracy when computer technology is used to aid in diagnosis.
This present study provided a new publicly available (EBHI-Seg). To demonstrate the validity and extensiveness of EBHI-Seg, the experimental results for EBHI-Seg are evaluated using classical machine learning methods and deep learning methods.
The experimental results showed that deep learning methods had a better image segmentation performance when utilizing EBHI-Seg. The maximum accuracy of the Dice evaluation metric for the classical machine learning method is 0.948, while the Dice evaluation metric for the deep learning method is 0.965.
This publicly available dataset contained 4,456 images of six types of tumor differentiation stages and the corresponding ground truth images. The dataset can provide researchers with new segmentation algorithms for medical diagnosis of colorectal cancer, which can be used in the clinical setting to help doctors and patients. EBHI-Seg is publicly available at: https://figshare.com/articles/dataset/EBHI-SEG/21540159/1.
结直肠癌是一种常见的致命恶性肿瘤,在全球男性中是第四大常见癌症,在女性中是第三大常见癌症。早期及时检测癌症对于治疗该疾病至关重要。目前,缺乏用于结直肠癌组织病理学图像分割的数据集,这在使用计算机技术辅助诊断时常常会妨碍评估准确性。
本研究提供了一个新的公开可用数据集(EBHI-Seg)。为了证明EBHI-Seg的有效性和广泛性,使用经典机器学习方法和深度学习方法对EBHI-Seg的实验结果进行评估。
实验结果表明,在利用EBHI-Seg时,深度学习方法具有更好的图像分割性能。经典机器学习方法的Dice评估指标的最大准确率为0.948,而深度学习方法的Dice评估指标为0.965。
这个公开可用的数据集包含六种肿瘤分化阶段的4456张图像以及相应的真值图像。该数据集可为研究人员提供用于结直肠癌医学诊断的新分割算法,可在临床环境中用于帮助医生和患者。EBHI-Seg可在以下网址公开获取:https://figshare.com/articles/dataset/EBHI-SEG/21540159/1 。