Institute of Computational Perception, Johannes Kepler University Linz, Linz, Austria.
PLoS One. 2019 Jun 7;14(6):e0217389. doi: 10.1371/journal.pone.0217389. eCollection 2019.
Popularity-based approaches are widely adopted in music recommendation systems, both in industry and research. These approaches recommend to the target user what is currently popular among all users of the system. However, as the popularity distribution of music items typically is a long-tail distribution, popularity-based approaches to music recommendation fall short in satisfying listeners that have specialized music preferences far away from the global music mainstream. Addressing this gap, the contribution of this article is three-fold.
First, we provide several quantitative measures describing the proximity of a user's music preference to the music mainstream. Assuming that there is a difference between the global music mainstream and a country-specific one, we define the measures at two levels: relating a listener's music preferences to the global music preferences of all users, or relating them to music preferences of the user's country. To quantify such music preferences, we define a music item's popularity in terms of artist playcounts (APC) and artist listener counts (ALC). Moreover, we adopt a distribution-based and a rank-based approach as means to decrease bias towards the head of the long-tail distribution. This eventually results in a framework of 6 measures to quantify music mainstream.
Second, we perform in-depth quantitative and qualitative studies of music mainstream in that we (i) analyze differences between countries in terms of their level of mainstreaminess, (ii) uncover both positive and negative outliers (substantially higher and lower country-specific popularity, respectively, compared to the global mainstream), analyzing these with a mixed-methods approach, and (iii) investigate differences between countries in terms of listening preferences related to popular music artists. We conduct our studies and experiments using the standardized LFM-1b dataset, from which we analyze about 800,000,000 listening events shared by about 53,000 users (from 47 countries) of the music streaming platform Last.fm. We show that there are substantial country-specific differences in listeners' music consumption behavior with respect to the most popular artists listened to.
Third, we demonstrate the applicability of our study results to improve music recommendation systems. To this end, we conduct rating prediction experiments in which we tailor recommendations to a user's level of preference for the music mainstream using the proposed 6 mainstreaminess measures: defined by a distribution-based or rank-based approach, defined on a global level or on a country level (for the user's country), and for APC or ALC. Our approach roughly equals a hybrid recommendation approach in which a demographic filtering strategy is implemented before collaborative filtering is performed. Results suggest that, in terms of rating prediction accuracy, each of the presented mainstreaminess definitions has its merits.
流行度为基础的方法在音乐推荐系统中被广泛采用,无论是在工业界还是学术界。这些方法向目标用户推荐系统中所有用户当前喜欢的音乐。然而,由于音乐项目的流行度分布通常是长尾分布,基于流行度的音乐推荐方法在满足那些音乐偏好远离全球主流音乐的听众方面存在不足。为了解决这个差距,本文的贡献有三个方面。
首先,我们提供了几个定量的衡量标准,用于描述用户音乐偏好与音乐主流的接近程度。假设全球音乐主流与特定国家的音乐主流之间存在差异,我们在两个层面上定义了这些衡量标准:将听众的音乐偏好与系统中所有用户的全球音乐偏好联系起来,或与用户所在国家的音乐偏好联系起来。为了量化这种音乐偏好,我们根据艺术家播放次数(APC)和艺术家听众数量(ALC)来定义音乐项目的流行度。此外,我们采用基于分布的和基于排名的方法来减少对长尾分布头部的偏见。这最终导致了一个包含 6 个衡量音乐主流的指标的框架。
其次,我们深入研究了音乐主流在不同国家之间的差异,我们(i)分析了不同国家在主流度方面的差异,(ii)发现了正的和负的异常值(与全球主流相比,分别是显著更高和更低的国家特定流行度),并采用混合方法对这些异常值进行分析,(iii)研究了不同国家在与流行音乐艺术家相关的音乐偏好方面的差异。我们使用标准化的 LFM-1b 数据集进行研究和实验,该数据集包含大约 8 亿个由大约 53000 名来自 47 个国家的音乐流媒体平台 Last.fm 用户分享的收听事件。我们发现,在听众对最受欢迎的艺术家的音乐消费行为方面,存在着实质性的国家特定差异。
第三,我们展示了我们的研究结果如何应用于改进音乐推荐系统。为此,我们进行了评分预测实验,根据提出的 6 个主流度衡量标准,针对用户对音乐主流的偏好程度,为用户量身定制推荐:基于分布的或基于排名的方法,基于全球水平或国家水平(对于用户所在的国家),以及基于 APC 或 ALC。我们的方法大致相当于一种混合推荐方法,其中在执行协同过滤之前实施了一种人口统计学过滤策略。结果表明,就评分预测准确性而言,每个提出的主流度定义都有其优点。