Wichaiwong Tanakorn
Department of Computer Science, Faculty of Science, Kasetsart University, Bangkok 10903, Thailand.
ScientificWorldJournal. 2014 Feb 13;2014:404518. doi: 10.1155/2014/404518. eCollection 2014.
XML document is now widely used for modelling and storing structured documents. The structure is very rich and carries important information about contents and their relationships, for example, e-Commerce. XML data-centric collections require query terms allowing users to specify constraints on the document structure; mapping structure queries and assigning the weight are significant for the set of possibly relevant documents with respect to structural conditions. In this paper, we present an extension to the MEXIR search system that supports the combination of structural and content queries in the form of content-and-structure queries, which we call the Exponentiation function. It has been shown the structural information improve the effectiveness of the search system up to 52.60% over the baseline BM25 at MAP.
XML文档如今被广泛用于对结构化文档进行建模和存储。其结构非常丰富,承载着有关内容及其关系的重要信息,例如电子商务。以XML数据为中心的集合需要查询词,以便用户能够指定对文档结构的约束;映射结构查询并分配权重对于满足结构条件的可能相关文档集来说意义重大。在本文中,我们提出了对MEXIR搜索系统的扩展,该扩展以内容与结构查询的形式支持结构查询和内容查询的组合,我们将其称为指数函数。研究表明,在平均精度均值(MAP)方面,结构信息比基线BM25搜索系统的有效性提高了52.60%。