Sabetghadam Serwah, Lupu Mihai, Bierig Ralf, Rauber Andreas
1Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria.
2Department of Computer Science, Maynooth University, Maynooth, Ireland.
Int J Multimed Inf Retr. 2018;7(3):157-171. doi: 10.1007/s13735-017-0145-8. Epub 2017 Dec 16.
Nowadays, there is a proliferation of available information sources from different modalities-text, images, audio, video and more. Information objects are not isolated anymore. They are frequently connected via metadata, semantic links, etc. This leads to various challenges in graph-based information retrieval. This paper is concerned with the reachability analysis of multimodal graph modelled collections. We use our framework to leverage the combination of features of different modalities through our formulation of faceted search. This study highlights the effect of different facets and link types in improving reachability of relevant information objects. The experiments are performed on the Image CLEF 2011 Wikipedia collection with about 400,000 documents and images. The results demonstrate that the combination of different facets is conductive to obtain higher reachability. We obtain 373% recall gain for very hard topics by using our graph model of the collection. Further, by adding semantic links to the collection, we gain a 10% increase in the overall recall.
如今,来自不同形式(文本、图像、音频、视频等)的可用信息源大量涌现。信息对象不再孤立。它们常常通过元数据、语义链接等相互连接。这给基于图的信息检索带来了各种挑战。本文关注多模态图建模集合的可达性分析。我们通过构建分面搜索,利用我们的框架来整合不同模态的特征。本研究突出了不同分面和链接类型在提高相关信息对象可达性方面的作用。实验是在拥有约40万份文档和图像的2011年图像CLEF维基百科集合上进行的。结果表明,不同分面的组合有助于获得更高的可达性。通过使用我们的集合图模型,对于非常难的主题,我们获得了373%的召回率提升。此外,通过向集合中添加语义链接,我们在总体召回率上提高了10%。