Sanyal Parikshit, Barui Sanghita, Deb Prabal, Sharma Harish Chander
Department of Pathology, Military Hospital Jalandhar Cantt, Punjab, India.
Department of Pathology, Command Hospital, Alipore, Kolkata, West Bengal, India.
J Cytol. 2019 Jul-Sep;36(3):146-151. doi: 10.4103/JOC.JOC_201_18.
Cervical cancer is the second most common cancer in women. The liquid based cervical cytology (LBCC) is a useful tool of choice for screening cervical cancer.
To train a convolutional neural network (CNN) to identify abnormal foci from LBCC smears.
We have chosen retrospective study design from archived smears of patients undergoing screening from cervical cancer by LBCC smears.
2816 images, each of 256 × 256 pixels, were prepared from microphotographs of these LBCC smears, which included 816 "abnormal" foci (low grade or high grade squamous intraepithelial lesion) and 2000 'normal' foci (benign epithelial cells and reactive changes). The images were split into three sets, Training, Testing, and Evaluation. A convolutional neural network (CNN) was developed with the python programming language. The CNN was trained with the Training dataset; performance was assayed concurrently with the Testing dataset. Two CNN models were developed, after 20 and 10 epochs of training, respectively. The models were then run on the Evaluation dataset.
A contingency table was prepared from the original image labels and the labels predicted by the CNN.
Combined assessment of both models yielded a sensitivity of 95.63% in detecting abnormal foci, with 79.85% specificity. The negative predictive value was high (99.19%), suggesting potential utility in screening. False positives due to overlapping cells, neutrophils, and debris was the principal difficulty met during evaluation.
The CNN shows promise as a screening tool; however, for its use in confirmatory diagnosis, further training with a more diverse dataset will be required.
宫颈癌是女性中第二常见的癌症。液基宫颈细胞学检查(LBCC)是筛查宫颈癌的一种有用的首选工具。
训练一个卷积神经网络(CNN)以从LBCC涂片识别异常病灶。
我们从通过LBCC涂片进行宫颈癌筛查的患者存档涂片中选择了回顾性研究设计。
从这些LBCC涂片的显微照片中制备了2816张图像,每张图像为256×256像素,其中包括816个“异常”病灶(低级别或高级别鳞状上皮内病变)和2000个“正常”病灶(良性上皮细胞和反应性改变)。这些图像被分为三组:训练组、测试组和评估组。使用Python编程语言开发了一个卷积神经网络(CNN)。该CNN使用训练数据集进行训练;同时使用测试数据集评估性能。分别经过20个和10个训练轮次后开发了两个CNN模型。然后在评估数据集上运行这些模型。
根据原始图像标签和CNN预测的标签编制了列联表。
两个模型的综合评估在检测异常病灶时的灵敏度为95.63%,特异性为79.85%。阴性预测值很高(99.19%),表明在筛查中具有潜在用途。评估过程中遇到的主要困难是由于细胞重叠、中性粒细胞和碎片导致的假阳性。
CNN显示出作为一种筛查工具的前景;然而,要将其用于确诊诊断,将需要使用更多样化的数据集进行进一步训练。