Okyay Savas, Aygun Sercan
Computer Engineering, Eskisehir Osmangazi University, Eskisehir, Turkey.
Computer Engineering, Eskisehir Technical University, Eskisehir, Turkey.
PeerJ Comput Sci. 2021 Dec 9;7:e784. doi: 10.7717/peerj-cs.784. eCollection 2021.
Recommender systems include a broad scope of applications and are associated with subjective preferences, indicating variations in recommendations. As a field of data science and machine learning, recommender systems require both statistical perspectives and sufficient performance monitoring. In this paper, we propose diversified similarity measurements by observing recommendation performance using generic metrics. Considering collaborative filtering, the probability of an item being preferred by any user is measured. Having examined the best neighbor counts, we verified the test item bias phenomenon for similarity equations. Because of the statistical parameters used for computing in a global scope, there is implicit information in the literature, whether those parameters comprise the focal point user data statically. Regarding each dynamic prediction, user-wise parameters are expected to be generated at runtime by excluding the item of interest. This yields reliable results and is more compatible with real-time systems. Furthermore, we underline the effect of significance weighting by examining the similarities between a user of interest and its neighbors. Overall, this study uniquely combines significance weighting and test-item bias mitigation by inspecting the fine-tuned neighborhood. Consequently, the results reveal adequate and combinations. The source code of our architecture is available at https://codeocean.com/capsule/1427708/tree/v1.
推荐系统包含广泛的应用范围,并且与主观偏好相关联,这表明推荐存在差异。作为数据科学和机器学习的一个领域,推荐系统既需要统计学视角,也需要充分的性能监测。在本文中,我们通过使用通用指标观察推荐性能来提出多样化的相似性度量。考虑到协同过滤,测量任何用户偏好某一物品的概率。在研究了最佳邻居数量之后,我们验证了相似性方程的测试物品偏差现象。由于在全局范围内用于计算的统计参数,文献中存在隐含信息,无论这些参数是否静态地包含焦点用户数据。对于每个动态预测,预计在运行时通过排除感兴趣的物品来生成用户特定的参数。这会产生可靠的结果,并且与实时系统更兼容。此外,我们通过检查感兴趣用户与其邻居之间的相似性来强调显著性加权的效果。总体而言,本研究通过检查微调后的邻域,独特地结合了显著性加权和测试物品偏差缓解。因此,结果揭示了充分的 和 组合。我们架构的源代码可在https://codeocean.com/capsule/1427708/tree/v1获取。