Xie Haoran, Li Qing, Mao Xudong, Li Xiaodong, Cai Yi, Rao Yanghui
Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong Special Administrative Region.
Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong Special Administrative Region; Multimedia Software Engineering Research Centre, City University of Hong Kong, Kowloon, Hong Kong Special Administrative Region.
Neural Netw. 2014 Oct;58:111-21. doi: 10.1016/j.neunet.2014.05.009. Epub 2014 May 27.
In the era of big data, collaborative tagging (a.k.a. folksonomy) systems have proliferated as a consequence of the growth of Web 2.0 communities. Constructing user profiles from folksonomy systems is useful for many applications such as personalized search and recommender systems. The identification of latent user communities is one way to better understand and meet user needs. The behavior of users is highly influenced by the behavior of their neighbors or community members, and this can be utilized in constructing user profiles. However, conventional user profiling techniques often encounter data sparsity problems as data from a single user is insufficient to build a powerful profile. Hence, in this paper we propose a method of enriching user profiles based on latent user communities in folksonomy data. Specifically, the proposed approach contains four sub-processes: (i) tag-based user profiles are extracted from a folksonomy tripartite graph; (ii) a multi-faceted folksonomy graph is constructed by integrating tag and image affinity subgraphs with the folksonomy tripartite graph; (iii) random walk distance is used to unify various relationships and measure user similarities; (iv) a novel prototype-based clustering method based on user similarities is used to identify user communities, which are further used to enrich the extracted user profiles. To evaluate the proposed method, we conducted experiments using a public dataset, the results of which show that our approach outperforms previous ones in user profile enrichment.
在大数据时代,随着Web 2.0社区的发展,协作标签(又称大众分类法)系统大量涌现。从大众分类法系统构建用户简档对许多应用(如个性化搜索和推荐系统)很有用。识别潜在用户社区是更好地理解和满足用户需求的一种方式。用户的行为受到其邻居或社区成员行为的高度影响,这可用于构建用户简档。然而,传统的用户简档技术经常遇到数据稀疏问题,因为来自单个用户的数据不足以构建强大的简档。因此,在本文中,我们提出了一种基于大众分类法数据中的潜在用户社区来丰富用户简档的方法。具体而言,所提出的方法包含四个子过程:(i)从大众分类法三方图中提取基于标签的用户简档;(ii)通过将标签和图像亲和性子图与大众分类法三方图集成来构建多方面大众分类法图;(iii)使用随机游走距离来统一各种关系并衡量用户相似度;(iv)使用基于用户相似度的新颖的基于原型的聚类方法来识别用户社区,这些社区进一步用于丰富提取的用户简档。为了评估所提出的方法,我们使用一个公共数据集进行了实验,实验结果表明我们的方法在用户简档丰富方面优于以前的方法。