School of Computing and Mathematical Sciences, University of Leicester, Leicester, UK.
Data Sciences and Quantitative Biology, Discovery Sciences, AstraZeneca R&D, Cambridge, UK.
Sci Rep. 2022 May 12;12(1):7894. doi: 10.1038/s41598-022-12099-3.
Cell line authentication is important in the biomedical field to ensure that researchers are not working with misidentified cells. Short tandem repeat is the gold standard method, but has its own limitations, including being expensive and time-consuming. Deep neural networks achieve great success in the analysis of cellular images in a cost-effective way. However, because of the lack of centralized available datasets, whether or not cell line authentication can be replaced or supported by cell image classification is still a question. Moreover, the relationship between the incubation times and cellular images has not been explored in previous studies. In this study, we automated the process of the cell line authentication by using deep learning analysis of brightfield cell line images. We proposed a novel multi-task framework to identify cell lines from cell images and predict the duration of how long cell lines have been incubated simultaneously. Using thirty cell lines' data from the AstraZeneca Cell Bank, we demonstrated that our proposed method can accurately identify cell lines from brightfield images with a 99.8% accuracy and predicts the incubation durations for cell images with the coefficient of determination score of 0.927. Considering that new cell lines are continually added to the AstraZeneca Cell Bank, we integrated the transfer learning technique with the proposed system to deal with data from new cell lines not included in the pre-trained model. Our method achieved excellent performance with a precision of 97.7% and recall of 95.8% in the detection of 14 new cell lines. These results demonstrated that our proposed framework can effectively identify cell lines using brightfield images.
细胞系鉴定在生物医学领域非常重要,可确保研究人员使用的是经过正确鉴定的细胞。短串联重复序列是金标准方法,但也存在自身的局限性,包括昂贵和耗时。深度神经网络以具有成本效益的方式在细胞图像分析中取得了巨大的成功。然而,由于缺乏集中可用的数据集,细胞图像分类是否可以替代或支持细胞系鉴定仍然是一个问题。此外,以前的研究并未探讨孵育时间与细胞图像之间的关系。在这项研究中,我们通过使用深度学习分析明场细胞系图像,实现了细胞系鉴定的自动化过程。我们提出了一种新颖的多任务框架,可以从细胞图像中识别细胞系,并同时预测细胞系孵育的时间。我们使用阿斯利康细胞库的三十种细胞系数据证明,我们提出的方法可以准确地从明场图像中识别细胞系,准确率达到 99.8%,并可以预测细胞图像的孵育时间,决定系数得分为 0.927。考虑到新的细胞系不断添加到阿斯利康细胞库中,我们将迁移学习技术与所提出的系统集成,以处理未包含在预训练模型中的新细胞系的数据。在检测 14 种新细胞系时,我们的方法取得了出色的性能,准确率为 97.7%,召回率为 95.8%。这些结果表明,我们提出的框架可以有效地使用明场图像识别细胞系。