Ye M H, Chen W Y, Cai B J, Jin C H, He X L
Department of Pathology, Hangzhou Medical College Zhejiang Provincial People's Hospital, Hangzhou 310014, China.
Zhejiang Tonghuashun Intelligent Technology Co., Ltd, Hangzhou 311100, China.
Zhonghua Bing Li Xue Za Zhi. 2021 Apr 8;50(4):358-362. doi: 10.3760/cma.j.cn112151-20200802-00613.
To develop a convolutional neural network based model for assisting pathological diagnoses on thyroid liquid-based cytology specimens. Seven-hundred thyroid TCT slides were collected, scanned for whole slide imaging (WSI), and divided into training and test sets after labeling the correct diagnosis (benign versus malignant). The extracted regions of interest after noise filtering were cropped into pieces of 512 × 512 patch on 10 × and 40 × magnifications, respectively. A classification model was constructed using deeply learning algorithms, and applied to the training set, then automatically tuned in the test set. After data enhancement and parameters optimization, accuracy, sensitivity, specificity, positive predictive value and negative predictive value of the model were calculated. s The training set with 560 WSI contained 4 926 cell clusters (11 164 patches), while the test set with 140 WSI contained 977 cell clusters (1 402 patches). YOLO network was selected to establish a detection model, and ResNet50 was used as a classification model. With 40 epochs training, results from 10× magnifications showed an accuracy of 90.01%, sensitivity of 89.31%, specificity of 92.51%, positive predictive value of 97.70% and negative predictive value of 70.82%. The area under curve was 0.97. The average diagnostic time was less than 1 second. Although the model for data of 40× magnifications was very sensitive (98.72%), but its specificity was poor, suggesting that the model was more reliable at 10× magnification. The performance of a deep-learning based model is equivalent to pathologists' diagnostic performance, but its efficiency is far beyond. The model can greatly improve consistency and efficiency, and reduce the missed diagnosis rate. In the future, larger studies should have more morphology diversity, improve model's accuracy and eventually develop a model for direct clinical use.
开发一种基于卷积神经网络的模型,用于辅助甲状腺液基细胞学标本的病理诊断。收集了700张甲状腺TCT玻片,进行全玻片成像(WSI)扫描,并在标记正确诊断结果(良性与恶性)后分为训练集和测试集。经过噪声过滤后提取的感兴趣区域分别在10倍和40倍放大倍数下裁剪成512×512大小的图像块。使用深度学习算法构建分类模型,并应用于训练集,然后在测试集中进行自动调整。经过数据增强和参数优化后,计算模型的准确率、灵敏度、特异性、阳性预测值和阴性预测值。训练集包含560张WSI,有4926个细胞簇(11164个图像块),而测试集包含140张WSI,有977个细胞簇(1402个图像块)。选择YOLO网络建立检测模型,使用ResNet50作为分类模型。经过40个轮次的训练,10倍放大倍数下的结果显示准确率为90.01%,灵敏度为89.31%,特异性为92.51%,阳性预测值为97.70%,阴性预测值为70.82%。曲线下面积为0.97。平均诊断时间不到1秒。虽然40倍放大倍数数据的模型非常敏感(98.72%),但其特异性较差,这表明该模型在10倍放大倍数下更可靠。基于深度学习的模型性能与病理学家的诊断性能相当,但其效率远高于病理学家。该模型可以大大提高一致性和效率,并降低漏诊率。未来,更大规模的研究应具有更多的形态学多样性,提高模型的准确性,并最终开发出可直接用于临床的模型。