IEEE Trans Neural Netw Learn Syst. 2021 Mar;32(3):1375-1388. doi: 10.1109/TNNLS.2020.2984665. Epub 2021 Mar 1.
Traditional recommendation methods suffer from limited performance, which can be addressed by incorporating abundant auxiliary/side information. This article focuses on a personalized music recommender system that incorporates rich content and context data in a unified and adaptive way to address the abovementioned problems. The content information includes music textual content, such as metadata, tags, and lyrics, and the context data incorporate users' behaviors, including music listening records, music playing sequences, and sessions. Specifically, a heterogeneous information network (HIN) is first presented to incorporate different kinds of content and context data. Then, a novel method called content- and context-aware music embedding (CAME) is proposed to obtain the low-dimension dense real-valued feature representations (embeddings) of music pieces from HIN. Especially, one music piece generally highlights different aspects when interacting with various neighbors, and it should have different representations separately. CAME seamlessly combines deep learning techniques, including convolutional neural networks and attention mechanisms, with the embedding model to capture the intrinsic features of music pieces as well as their dynamic relevance and interactions adaptively. Finally, we further infer users' general musical preferences as well as their contextual preferences for music and propose a content- and context-aware music recommendation method. Comprehensive experiments as well as quantitative and qualitative evaluations have been performed on real-world music data sets, and the results show that the proposed recommendation approach outperforms state-of-the-art baselines and is able to handle sparse data effectively.
传统的推荐方法受到性能限制,可以通过结合丰富的辅助/侧信息来解决。本文专注于个性化音乐推荐系统,该系统以统一和自适应的方式整合丰富的内容和上下文数据,以解决上述问题。内容信息包括音乐文本内容,如元数据、标签和歌词,上下文数据包括用户行为,如音乐收听记录、音乐播放序列和会话。具体来说,首先提出了一个异构信息网络(HIN),以整合不同类型的内容和上下文数据。然后,提出了一种称为内容和上下文感知音乐嵌入(CAME)的新方法,从 HIN 中获得音乐作品的低维密集实值特征表示(嵌入)。特别是,当与各种邻居交互时,一个音乐作品通常会突出不同的方面,并且应该分别具有不同的表示。CAME 无缝结合了深度学习技术,包括卷积神经网络和注意力机制,以及嵌入模型,以自适应地捕捉音乐作品的内在特征及其动态相关性和交互。最后,我们进一步推断用户的一般音乐偏好以及他们对音乐的上下文偏好,并提出了一种内容和上下文感知的音乐推荐方法。在真实的音乐数据集上进行了全面的实验以及定量和定性评估,结果表明,所提出的推荐方法优于最先进的基线方法,并且能够有效地处理稀疏数据。