Information Processing Laboratory, Department of Informatics, Athens University of Economics and Business, 76 Patission Str., 104.34 Athens, Greece.
Comput Med Imaging Graph. 2015 Jan;39:27-34. doi: 10.1016/j.compmedimag.2014.05.009. Epub 2014 Jun 2.
Latent Semantic Analysis (LSA) although has been used successfully in text retrieval when applied to CBIR induces scalability issues with large image collections. The method so far has been used with small collections due to the high cost of storage and computational time for solving the SVD problem for a large and dense feature matrix. Here we present an effective and efficient approach of applying LSA skipping the SVD solution of the feature matrix and overcoming in this way the deficiencies of the method with large scale datasets. Early and late fusion techniques are tested and their performance is calculated. The study demonstrates that early fusion of several composite descriptors with visual words increase retrieval effectiveness. It also combines well in a late fusion for mixed (textual and visual) ad hoc and modality classification. The results reported are comparable to state of the art algorithms without including additional knowledge from the medical domain.
潜在语义分析(LSA)虽然在文本检索中得到了成功应用,但应用于 CBIR 时,会导致大规模图像集合的可扩展性问题。由于求解大型密集特征矩阵的奇异值分解(SVD)问题需要较高的存储和计算成本,该方法迄今为止仅适用于小型集合。在这里,我们提出了一种有效且高效的方法,即跳过特征矩阵的 SVD 求解,从而克服了该方法在大规模数据集上的缺陷。本文测试了早期和晚期融合技术,并计算了它们的性能。研究表明,使用视觉词汇对多个复合描述符进行早期融合可以提高检索的有效性。它还可以在混合(文本和视觉)特定和模态分类的晚期融合中很好地结合。报告的结果与最先进的算法相当,而不包括来自医学领域的额外知识。