Department of Radiology, University of Ulsan College of Medicine, 388-1 Pungnap2-dong, Songpa-gu, Seoul 138-736, South Korea.
Med Phys. 2013 May;40(5):051912. doi: 10.1118/1.4802214.
To investigate the effect of using different computed tomography (CT) scanners on the accuracy of high-resolution CT (HRCT) images in classifying regional disease patterns in patients with diffuse lung disease, support vector machine (SVM) and Bayesian classifiers were applied to multicenter data.
Two experienced radiologists marked sets of 600 rectangular 20 × 20 pixel regions of interest (ROIs) on HRCT images obtained from two scanners (GE and Siemens), including 100 ROIs for each of local patterns of lungs-normal lung and five of regional pulmonary disease patterns (ground-glass opacity, reticular opacity, honeycombing, emphysema, and consolidation). Each ROI was assessed using 22 quantitative features belonging to one of the following descriptors: histogram, gradient, run-length, gray level co-occurrence matrix, low-attenuation area cluster, and top-hat transform. For automatic classification, a Bayesian classifier and a SVM classifier were compared under three different conditions. First, classification accuracies were estimated using data from each scanner. Next, data from the GE and Siemens scanners were used for training and testing, respectively, and vice versa. Finally, all ROI data were integrated regardless of the scanner type and were then trained and tested together. All experiments were performed based on forward feature selection and fivefold cross-validation with 20 repetitions.
For each scanner, better classification accuracies were achieved with the SVM classifier than the Bayesian classifier (92% and 82%, respectively, for the GE scanner; and 92% and 86%, respectively, for the Siemens scanner). The classification accuracies were 82%/72% for training with GE data and testing with Siemens data, and 79%/72% for the reverse. The use of training and test data obtained from the HRCT images of different scanners lowered the classification accuracy compared to the use of HRCT images from the same scanner. For integrated ROI data obtained from both scanners, the classification accuracies with the SVM and Bayesian classifiers were 92% and 77%, respectively. The selected features resulting from the classification process differed by scanner, with more features included for the classification of the integrated HRCT data than for the classification of the HRCT data from each scanner. For the integrated data, consisting of HRCT images of both scanners, the classification accuracy based on the SVM was statistically similar to the accuracy of the data obtained from each scanner. However, the classification accuracy of the integrated data using the Bayesian classifier was significantly lower than the classification accuracy of the ROI data of each scanner.
The use of an integrated dataset along with a SVM classifier rather than a Bayesian classifier has benefits in terms of the classification accuracy of HRCT images acquired with more than one scanner. This finding is of relevance in studies involving large number of images, as is the case in a multicenter trial with different scanners.
应用支持向量机(SVM)和贝叶斯分类器研究使用不同 CT 扫描仪对分类弥漫性肺疾病患者高分辨率 CT(HRCT)图像中区域疾病模式的准确性的影响,对多中心数据进行了研究。
两名有经验的放射科医生在从两台 CT 扫描仪(GE 和西门子)获得的 HRCT 图像上标记了 600 个 20×20 像素的感兴趣区(ROI),包括每台扫描仪 100 个正常肺和 5 个区域性肺疾病模式(磨玻璃影、网状影、蜂窝影、气肿和实变)的 ROI。每个 ROI 使用属于以下描述符之一的 22 个定量特征进行评估:直方图、梯度、游程长度、灰度共生矩阵、低衰减区聚类和顶帽变换。对于自动分类,在三种不同情况下比较了贝叶斯分类器和 SVM 分类器。首先,使用每个扫描仪的数据估计分类精度。接下来,分别使用 GE 和西门子扫描仪的数据进行训练和测试,反之亦然。最后,无论扫描仪类型如何,都集成所有 ROI 数据,然后一起进行训练和测试。所有实验均基于前向特征选择和五重交叉验证,重复 20 次。
对于每个扫描仪,SVM 分类器的分类精度均优于贝叶斯分类器(GE 扫描仪分别为 92%和 82%;西门子扫描仪分别为 92%和 86%)。使用 GE 数据进行训练和使用西门子数据进行测试的分类精度分别为 82%/72%,反之亦然。与使用同一扫描仪的 HRCT 图像相比,使用来自不同扫描仪的 HRCT 图像进行训练和测试会降低分类精度。对于来自两台扫描仪的 HRCT 图像的集成 ROI 数据,SVM 和贝叶斯分类器的分类精度分别为 92%和 77%。分类过程中选择的特征因扫描仪而异,与对每个扫描仪的 HRCT 数据进行分类相比,对集成 HRCT 数据的分类需要更多的特征。对于由两台扫描仪的 HRCT 图像组成的集成数据,基于 SVM 的分类准确性在统计学上与从每个扫描仪获得的数据的准确性相似。然而,使用贝叶斯分类器的集成数据的分类精度明显低于每个扫描仪的 ROI 数据的分类精度。
在使用多台扫描仪的情况下,使用集成数据集和 SVM 分类器而非贝叶斯分类器可以提高 HRCT 图像的分类准确性。在涉及大量图像的研究中,例如在具有不同扫描仪的多中心试验中,这一发现具有相关性。