Flexer Arthur, Stevens Jeff
Austrian Research Institute for Artificial Intelligence (OFAI), Austria.
George Mason University, Virginia, USA.
J New Music Res. 2017 Aug 3;47(1):17-28. doi: 10.1080/09298215.2017.1354891. eCollection 2018.
This paper is concerned with the impact of hubness, a general problem of machine learning in high-dimensional spaces, on a real-world music recommendation system based on visualisation of a k-nearest neighbour (knn) graph. Due to a problem of measuring distances in high dimensions, hub objects are recommended over and over again while anti-hubs are nonexistent in recommendation lists, resulting in poor reachability of the music catalogue. We present mutual proximity graphs, which are an alternative to knn and mutual knn graphs, and are able to avoid hub vertices having abnormally high connectivity. We show that mutual proximity graphs yield much better graph connectivity resulting in improved reachability compared to knn graphs, mutual knn graphs and mutual knn graphs enhanced with minimum spanning trees, while simultaneously reducing the negative effects of hubness.
本文关注的是枢纽性(高维空间中机器学习的一个普遍问题)对基于k近邻(knn)图可视化的真实世界音乐推荐系统的影响。由于高维空间中距离测量的问题,枢纽对象被反复推荐,而反枢纽对象在推荐列表中不存在,导致音乐目录的可达性较差。我们提出了相互邻近图,它是knn图和相互knn图的替代方案,能够避免枢纽顶点具有异常高的连通性。我们表明,与knn图、相互knn图以及用最小生成树增强的相互knn图相比,相互邻近图产生了更好的图连通性,从而提高了可达性,同时减少了枢纽性的负面影响。