Kim Kangsan, Kim Kwang Seok, Jang Won Il, Jang Seongjae, Hwang Gil Tae, Woo Sang-Keun
Division of Applied RI, Korea Institute of Radiological and Medical Sciences, Seoul 01812, Republic of Korea.
Department of Radiation Oncology, Korea Institute of Radiological and Medical Sciences, Seoul 01812, Republic of Korea.
Diagnostics (Basel). 2023 Oct 12;13(20):3191. doi: 10.3390/diagnostics13203191.
Dicentric chromosome assay (DCA) is one of the cytogenetic dosimetry methods where the absorbed dose is estimated by counting the number of dicentric chromosomes, which is a major radiation-induced change in DNA. However, DCA is a time-consuming task and requires technical expertise. In this study, a neural network was applied for automating the DCA. We used YOLOv5, a one-stage detection algorithm, to mitigate these limitations by automating the estimation of the number of dicentric chromosomes in chromosome metaphase images. YOLOv5 was pretrained on common object datasets. For training, 887 augmented chromosome images were used. We evaluated the model using validation and test datasets with 380 and 300 images, respectively. With pretrained parameters, the trained model detected chromosomes in the images with a maximum F1 score of 0.94 and a mean average precision (mAP) of 0.961. Conversely, when the model was randomly initialized, the training performance decreased, with a maximum F1 score and mAP of 0.82 and 0.873%, respectively. These results confirm that the model could effectively detect dicentric chromosomes in an image. Consequently, automatic DCA is expected to be conducted based on deep learning for object detection, requiring a relatively small amount of chromosome data for training using the pretrained network.
双着丝粒染色体分析(DCA)是一种细胞遗传学剂量测定方法,通过计数双着丝粒染色体的数量来估计吸收剂量,双着丝粒染色体是DNA中主要的辐射诱导变化。然而,DCA是一项耗时的任务,需要技术专长。在本研究中,应用神经网络实现DCA的自动化。我们使用单阶段检测算法YOLOv5,通过自动估计染色体中期图像中双着丝粒染色体的数量来减轻这些限制。YOLOv5在常见物体数据集上进行了预训练。为了进行训练,使用了887张增强染色体图像。我们分别使用包含380张和300张图像的验证集和测试集对模型进行评估。利用预训练参数,训练后的模型在图像中检测染色体时,最大F1分数为0.94,平均精度均值(mAP)为0.961。相反,当模型随机初始化时,训练性能下降,最大F1分数和mAP分别为0.82和0.873%。这些结果证实该模型能够有效地检测图像中的双着丝粒染色体。因此,预计基于深度学习的目标检测来进行自动DCA,使用预训练网络进行训练时需要相对少量的染色体数据。