基于核的半监督距离度量在医学图像检索中肺肿块的相似性测量

Similarity measurement of lung masses for medical image retrieval using kernel based semisupervised distance metric.

作者信息

Wei Guohui, Ma He, Qian Wei, Qiu Min

机构信息

Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang 110819, China.

Sino-Dutch Biomedical and Information Engineering School, Northeastern University, Shenyang 110819, China and Key Laboratory of Medical Image Computing, Ministry of Education, Northeastern University, Shenyang 110819, China.

出版信息

Med Phys. 2016 Dec;43(12):6259. doi: 10.1118/1.4966030.

DOI:10.1118/1.4966030

PMID:27908158

Abstract

PURPOSE

To develop a new algorithm to measure the similarity between the query lung mass and reference lung mass data set for content-based medical image retrieval (CBMIR).

METHODS

A lung mass data set including 746 mass regions of interest (ROIs) was assembled. Among them, 375 ROIs depicted malignant lesions and 371 depicted benign lesions. Each mass ROI is represented by a vector of 26 texture features. A kernel function was employed to map the original data in input space to a feature space. In this space, a semisupervised distance metric was learned, which used differential scatter discriminant criterion to represent the semantic relevance, and the regularization term to represent the visual similarity. The learned distance metric can measure the similarity of the query mass and reference mass data set. The clustering accuracy is used to configure the parameters. The retrieval accuracy and classification accuracy are used as the performance assessment index.

RESULTS

After configuring the parameters, a mean clustering accuracy of 0.87 can be achieved. For retrieval accuracy, our algorithm achieves better performance than other state-of-the-art retrieval algorithms when applying a leave-one-out validation method to the testing data set. For classification accuracy, the area under the ROC curve of our algorithm can be achieved as 0.941 ± 0.006. The running times of 346 query images with the proposed algorithm are 5.399 and 6.0 s, respectively.

CONCLUSIONS

The study results demonstrated the proposed algorithm outperforms the compared algorithms, when taking the semantic relevant and visual similarity into account in kernel space. The algorithm can be used in a CBMIR system for a query mass to retrieve similarity masses, which can help doctors make better decisions.

摘要

目的

开发一种新算法，用于在基于内容的医学图像检索（CBMIR）中测量查询肺结节与参考肺结节数据集之间的相似度。

方法

组装了一个包含746个感兴趣的结节区域（ROI）的肺结节数据集。其中，375个ROI描绘了恶性病变，371个描绘了良性病变。每个结节ROI由一个包含26个纹理特征的向量表示。采用核函数将输入空间中的原始数据映射到特征空间。在这个空间中，学习了一种半监督距离度量，它使用差分散射判别准则来表示语义相关性，使用正则化项来表示视觉相似性。学习到的距离度量可以测量查询结节与参考结节数据集的相似度。使用聚类准确率来配置参数。使用检索准确率和分类准确率作为性能评估指标。