Wei Guohui, Ma He, Qian Wei, Qiu Min
Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang 110819, China.
Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang 110819, China and Key Laboratory of Medical Image Computing, Ministry of Education, Northeastern University, Shenyang 110819, China.
Med Phys. 2016 Dec;43(12):6259. doi: 10.1118/1.4966030.
To develop a new algorithm to measure the similarity between the query lung mass and reference lung mass data set for content-based medical image retrieval (CBMIR).
A lung mass data set including 746 mass regions of interest (ROIs) was assembled. Among them, 375 ROIs depicted malignant lesions and 371 depicted benign lesions. Each mass ROI is represented by a vector of 26 texture features. A kernel function was employed to map the original data in input space to a feature space. In this space, a semisupervised distance metric was learned, which used differential scatter discriminant criterion to represent the semantic relevance, and the regularization term to represent the visual similarity. The learned distance metric can measure the similarity of the query mass and reference mass data set. The clustering accuracy is used to configure the parameters. The retrieval accuracy and classification accuracy are used as the performance assessment index.
After configuring the parameters, a mean clustering accuracy of 0.87 can be achieved. For retrieval accuracy, our algorithm achieves better performance than other state-of-the-art retrieval algorithms when applying a leave-one-out validation method to the testing data set. For classification accuracy, the area under the ROC curve of our algorithm can be achieved as 0.941 ± 0.006. The running times of 346 query images with the proposed algorithm are 5.399 and 6.0 s, respectively.
The study results demonstrated the proposed algorithm outperforms the compared algorithms, when taking the semantic relevant and visual similarity into account in kernel space. The algorithm can be used in a CBMIR system for a query mass to retrieve similarity masses, which can help doctors make better decisions.
开发一种新算法,用于在基于内容的医学图像检索(CBMIR)中测量查询肺结节与参考肺结节数据集之间的相似度。
组装了一个包含746个感兴趣的结节区域(ROI)的肺结节数据集。其中,375个ROI描绘了恶性病变,371个描绘了良性病变。每个结节ROI由一个包含26个纹理特征的向量表示。采用核函数将输入空间中的原始数据映射到特征空间。在这个空间中,学习了一种半监督距离度量,它使用差分散射判别准则来表示语义相关性,使用正则化项来表示视觉相似性。学习到的距离度量可以测量查询结节与参考结节数据集的相似度。使用聚类准确率来配置参数。使用检索准确率和分类准确率作为性能评估指标。
配置参数后,平均聚类准确率可达到0.87。对于检索准确率,当对测试数据集应用留一法验证方法时,我们的算法比其他现有检索算法具有更好的性能。对于分类准确率,我们算法的ROC曲线下面积可达0.941±0.006。使用所提出的算法对346张查询图像的运行时间分别为5.399秒和6.0秒。
研究结果表明,在核空间中考虑语义相关性和视觉相似性时,所提出的算法优于比较算法。该算法可用于CBMIR系统中对查询结节检索相似结节,有助于医生做出更好的决策。