Bu Ran, Xiang Wei, Cao Shitong
Key Laboratory of Electronic and Information Engineering, State Ethnic Affairs Commission (Southwest Minzu University), Chengdu, 610041 China.
J Shanghai Jiaotong Univ Sci. 2022;27(1):81-89. doi: 10.1007/s12204-021-2393-2. Epub 2021 Dec 26.
The COVID-19 medical diagnosis method based on individual's chest X-ray (CXR) is achieved difficultly in the initial research, owing to difficulties in identifying CXR data of COVID-19 individuals. At the beginning of the study, infected individuals' CXRs were scarce. The combination of artificial intelligence and medical diagnosis has been advanced and popular. To solve the difficulties, the interpretability analysis of AI model was used to explore the pathological characteristics of CXR samples infected with COVID-19 and assist medical diagnosis. The dataset was expanded by data augmentation to avoid overfitting. Transfer learning was used to test different pre-trained models and the unique output layers were designed to complete the model training with few samples. In this study, the output results of four pre-trained models were compared in three different output layers, and the results after data augmentation were compared with the results of the original dataset. The control variable method was used to conduct independent tests of 24 groups. Finally, 99.23% accuracy and 98% recall rate were obtained, and the visual results of CXR interpretability analysis were displayed. The network of COVID-19 interpretable diagnosis algorithm has the characteristics of high generalization and lightweight. It can be quickly applied to other urgent tasks with insufficient experimental data. At the same time, interpretability analysis brings new possibilities for medical diagnosis.
基于个体胸部X光(CXR)的新冠病毒(COVID-19)医学诊断方法在初步研究中难以实现,原因在于识别COVID-19个体的CXR数据存在困难。在研究初期,感染个体的CXR稀缺。人工智能与医学诊断的结合已经取得进展并受到欢迎。为了解决这些困难,利用人工智能模型的可解释性分析来探索感染COVID-19的CXR样本的病理特征并辅助医学诊断。通过数据增强来扩充数据集以避免过拟合。使用迁移学习来测试不同的预训练模型,并设计独特的输出层以用少量样本完成模型训练。在本研究中,比较了四个预训练模型在三种不同输出层的输出结果,并将数据增强后的结果与原始数据集的结果进行了比较。采用控制变量法进行了24组独立测试。最终,获得了99.23%的准确率和98%的召回率,并展示了CXR可解释性分析的可视化结果。COVID-19可解释诊断算法网络具有高泛化性和轻量级的特点。它可以快速应用于其他实验数据不足的紧急任务。同时,可解释性分析为医学诊断带来了新的可能性。