• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

推荐中的人类可变性和探索-利用权衡

Human Variability and the Explore-Exploit Trade-Off in Recommendation.

机构信息

Department of Mathematics and Computer Science, Rutgers University.

Center for Perceptual Systems, University of Texas at Austin.

出版信息

Cogn Sci. 2023 Apr;47(4):e13279. doi: 10.1111/cogs.13279.

DOI:10.1111/cogs.13279
PMID:37052215
Abstract

The enormous scale of the available information and products on the Internet has necessitated the development of algorithms that intermediate between options and human users. These algorithms attempt to provide the user with relevant information. In doing so, the algorithms may incur potential negative consequences stemming from the need to select items about which it is uncertain to obtain information about users versus the need to select items about which it is certain to secure high ratings. This tension is an instance of the exploration-exploitation trade-off in the context of recommender systems. Because humans are in this interaction loop, the long-term trade-off behavior depends on human variability. Our goal is to characterize the trade-off behavior as a function of human variability fundamental to such human-algorithm interaction. To tackle the characterization, we first introduce a unifying model that smoothly transitions between active learning and recommending relevant information. The unifying model gives us access to a continuum of algorithms along the exploration-exploitation trade-off. We then present two experiments to measure the trade-off behavior under two very different levels of human variability. The experimental results inform a thorough simulation study in which we modeled and varied human variability systematically over a wide rage. The main result is that exploration-exploitation trade-off grows in severity as human variability increases, but there exists a regime of low variability where algorithms balanced in exploration and exploitation can largely overcome the trade-off.

摘要

互联网上可用信息和产品的巨大规模使得开发在选项和用户之间进行中介的算法成为必要。这些算法试图为用户提供相关信息。在这样做的过程中,算法可能会产生潜在的负面后果,这些后果源于需要选择不确定能够获取用户信息的项目与需要选择确定能够获得高评分的项目之间的权衡。这种紧张关系是推荐系统中探索-利用权衡的一个实例。由于人类处于这种交互循环中,长期的权衡行为取决于人类的可变性。我们的目标是将这种权衡行为作为人类与算法交互的基本人类可变性的函数来进行描述。为了解决这个问题,我们首先引入了一个统一的模型,该模型在主动学习和推荐相关信息之间平稳过渡。该统一模型使我们能够访问沿探索-利用权衡的一系列算法。然后,我们进行了两项实验,以在两种非常不同的人类可变性水平下测量权衡行为。实验结果为我们提供了一个深入的模拟研究,在该研究中,我们对人类可变性进行了系统建模和广泛的变化。主要结果是,随着人类可变性的增加,探索-利用权衡的严重性会增加,但在可变性较低的情况下,能够在探索和利用之间取得平衡的算法可以在很大程度上克服这种权衡。

相似文献

1
Human Variability and the Explore-Exploit Trade-Off in Recommendation.推荐中的人类可变性和探索-利用权衡
Cogn Sci. 2023 Apr;47(4):e13279. doi: 10.1111/cogs.13279.
2
Protection from uncertainty in the exploration/exploitation trade-off.在探索/开发权衡中保护免受不确定性的影响。
J Exp Psychol Learn Mem Cogn. 2022 Apr;48(4):547-568. doi: 10.1037/xlm0000883. Epub 2021 Jun 10.
3
Computational mechanisms of curiosity and goal-directed exploration.好奇心和目标导向探索的计算机制。
Elife. 2019 May 10;8:e41703. doi: 10.7554/eLife.41703.
4
Dopaminergic Control of the Exploration-Exploitation Trade-Off via the Basal Ganglia.多巴胺能通过基底神经节对探索-利用权衡的控制。
Front Neurosci. 2012 Feb 6;6:9. doi: 10.3389/fnins.2012.00009. eCollection 2012.
5
Boldness predicts an individual's position along an exploration-exploitation foraging trade-off.大胆程度预示着个体在探索-利用觅食权衡中的位置。
J Anim Ecol. 2017 Sep;86(5):1257-1268. doi: 10.1111/1365-2656.12724. Epub 2017 Jul 24.
6
Exploration-exploitation trade-off features a saltatory search behaviour.探索-开发权衡具有跳跃式搜索行为。
J R Soc Interface. 2013 Jun 19;10(85):20130352. doi: 10.1098/rsif.2013.0352. Print 2013 Aug 6.
7
Exploration versus exploitation decisions in the human brain: A systematic review of functional neuroimaging and neuropsychological studies.人类大脑中的探索与开发决策:功能神经影像学和神经心理学研究的系统综述。
Neuropsychologia. 2024 Jan 10;192:108740. doi: 10.1016/j.neuropsychologia.2023.108740. Epub 2023 Nov 29.
8
The role of uncertainty in attentional and choice exploration.不确定性在注意力和选择探索中的作用。
Psychon Bull Rev. 2019 Dec;26(6):1911-1916. doi: 10.3758/s13423-019-01653-2.
9
Dopamine blockade impairs the exploration-exploitation trade-off in rats.多巴胺阻断会损害大鼠的探索-利用权衡。
Sci Rep. 2019 May 1;9(1):6770. doi: 10.1038/s41598-019-43245-z.
10
An information-theoretic approach to curiosity-driven reinforcement learning.一种用于好奇心驱动强化学习的信息论方法。
Theory Biosci. 2012 Sep;131(3):139-48. doi: 10.1007/s12064-011-0142-z. Epub 2012 Jul 12.