Karasu Benyes Yasmin, Welch E Celeste, Singhal Abhinav, Ou Joyce, Tripathi Anubhav
Center for Biomedical Engineering, School of Engineering, Brown University, Providence, RI 02912, USA.
Department of Computer Science and Engineering, I.I.T. Delhi, Hauz Khas, New Delhi 110016, India.
Diagnostics (Basel). 2022 Jul 29;12(8):1838. doi: 10.3390/diagnostics12081838.
Routine Pap smears can facilitate early detection of cervical cancer and improve patient outcomes. The objective of this work is to develop an automated, clinically viable deep neural network for the multi-class Bethesda System diagnosis of multi-cell images in Liquid Pap smear samples. 8 deep learning models were trained on a publicly available multi-class SurePath preparation dataset. This included the 5 best-performing transfer learning models, an ensemble, a novel convolutional neural network (CNN), and a CNN + autoencoder (AE). Additionally, each model was tested on a novel ThinPrep Pap dataset to determine model generalizability across different liquid Pap preparation methods with and without Deep CORAL domain adaptation. All models achieved accuracies >90% when classifying SurePath images. The AE CNN model, 99.80% smaller than the average transfer model, maintained an accuracy of 96.54%. During consecutive training attempts, individual transfer models had high variability in performance, whereas the CNN, AE CNN, and ensemble did not. ThinPrep Pap classification accuracies were notably lower but increased with domain adaptation, with ResNet101 achieving the highest accuracy at 92.65%. This indicates a potential area for future improvement: development of a globally relevant model that can function across different slide preparation methods.
常规巴氏涂片检查有助于早期发现宫颈癌并改善患者预后。这项工作的目标是开发一种自动化的、临床可行的深度神经网络,用于对液基巴氏涂片样本中的多细胞图像进行多类别贝塞斯达系统诊断。在一个公开可用的多类别SurePath制备数据集上训练了8种深度学习模型。这包括5个性能最佳的迁移学习模型、一个集成模型、一个新型卷积神经网络(CNN)和一个CNN+自动编码器(AE)。此外,每个模型都在一个新型的ThinPrep巴氏数据集上进行测试,以确定在有无深度CORAL域适应的情况下,模型在不同液基巴氏制备方法中的通用性。在对SurePath图像进行分类时,所有模型的准确率均超过90%。AE CNN模型比平均迁移模型小99.80%,其准确率保持在96.54%。在连续的训练尝试中,单个迁移模型的性能具有很高的变异性,而CNN、AE CNN和集成模型则没有。ThinPrep巴氏分类准确率明显较低,但随着域适应而提高,ResNet101的准确率最高,为92.65%。这表明了未来改进的一个潜在领域:开发一种能够在不同玻片制备方法中发挥作用的全球相关模型。