• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

职业篮球运动员的强化学习。

Reinforcement learning in professional basketball players.

机构信息

Department of Neurobiology, The Interdisciplinary Center for Neural Computation and Edmond and Lily Safra Center for Brain Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel.

出版信息

Nat Commun. 2011 Dec 6;2:569. doi: 10.1038/ncomms1580.

DOI:10.1038/ncomms1580
PMID:22146388
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3247813/
Abstract

Reinforcement learning in complex natural environments is a challenging task because the agent should generalize from the outcomes of actions taken in one state of the world to future actions in different states of the world. The extent to which human experts find the proper level of generalization is unclear. Here we show, using the sequences of field goal attempts made by professional basketball players, that the outcome of even a single field goal attempt has a considerable effect on the rate of subsequent 3 point shot attempts, in line with standard models of reinforcement learning. However, this change in behaviour is associated with negative correlations between the outcomes of successive field goal attempts. These results indicate that despite years of experience and high motivation, professional players overgeneralize from the outcomes of their most recent actions, which leads to decreased performance.

摘要

在复杂的自然环境中进行强化学习是一项具有挑战性的任务,因为代理需要将在一个世界状态下采取的行动的结果推广到未来在不同世界状态下的行动。人类专家在多大程度上能够找到适当的泛化程度尚不清楚。在这里,我们使用职业篮球运动员的投篮尝试序列表明,即使是单次投篮尝试的结果也会对随后的三分球尝试率产生相当大的影响,这与强化学习的标准模型一致。然而,这种行为的变化与连续投篮尝试结果之间的负相关有关。这些结果表明,尽管拥有多年的经验和高度的积极性,职业球员还是会从最近的行动结果中过度泛化,从而导致表现下降。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/f133be52d927/ncomms1580-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/2f24de71a122/ncomms1580-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/28a1b853974d/ncomms1580-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/efb14dc248b7/ncomms1580-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/f133be52d927/ncomms1580-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/2f24de71a122/ncomms1580-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/28a1b853974d/ncomms1580-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/efb14dc248b7/ncomms1580-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806a/3247813/f133be52d927/ncomms1580-f4.jpg

相似文献

1
Reinforcement learning in professional basketball players.职业篮球运动员的强化学习。
Nat Commun. 2011 Dec 6;2:569. doi: 10.1038/ncomms1580.
2
Spatial generalization in operant learning: lessons from professional basketball.操作性学习中的空间泛化:来自职业篮球的经验教训。
PLoS Comput Biol. 2014 May 22;10(5):e1003623. doi: 10.1371/journal.pcbi.1003623. eCollection 2014 May.
3
The effects of sport expertise and shot results on basketball players' action anticipation.运动专长和投篮结果对篮球运动员动作预判的影响。
PLoS One. 2020 Jan 6;15(1):e0227521. doi: 10.1371/journal.pone.0227521. eCollection 2020.
4
Growth, functional capacities and motivation for achievement and competitiveness in youth basketball: an interdisciplinary approach.青少年篮球运动中的成长、功能能力、成就动机和竞争力:一种跨学科方法
J Sports Sci. 2018 Apr;36(7):742-748. doi: 10.1080/02640414.2017.1340654. Epub 2017 Jun 12.
5
Left-handedness in professional basketball: prevalence, performance, and survival.职业篮球中的左撇子:流行度、表现和存活率。
Percept Mot Skills. 2011 Dec;113(3):815-24. doi: 10.2466/05.19.25.PMS.113.6.815-824.
6
The effect of perceived streakiness on the shot-taking behaviour of basketball players.感知到的连续性对篮球运动员投篮行为的影响。
Eur J Sport Sci. 2015;15(7):647-54. doi: 10.1080/17461391.2014.982205. Epub 2014 Nov 27.
7
Perceived hotness affects behavior of basketball players and coaches.热感知会影响篮球运动员和教练的行为。
Psychol Sci. 2013 Jul 1;24(7):1151-6. doi: 10.1177/0956797612468452. Epub 2013 Apr 29.
8
Performance, motivation, and enjoyment in young female basketball players: An interdisciplinary approach.年轻女子篮球运动员的表现、动机和享受:跨学科方法。
J Sports Sci. 2020 Apr;38(8):873-885. doi: 10.1080/02640414.2020.1736247. Epub 2020 Mar 5.
9
Sport-specific decision-making in a Go/NoGo reaction task: difference among nonathletes and baseball and basketball players.在“去/不去”反应任务中特定运动项目的决策:非运动员与棒球和篮球运动员之间的差异。
Percept Mot Skills. 2008 Feb;106(1):163-70. doi: 10.2466/pms.106.1.163-170.
10
Motor adaptation in complex sports - the influence of visual context information on the adaptation of the three-point shot to altered task demands in expert basketball players.复杂运动中的动作适应——视觉情境信息对专家篮球运动员改变三点投篮任务要求的适应的影响。
J Sports Sci. 2013;31(7):750-8. doi: 10.1080/02640414.2012.750003. Epub 2012 Dec 10.

引用本文的文献

1
Social post-error adaptations across four NBA basketball seasons.四个NBA篮球赛季中的社会失误后适应情况。
Sci Rep. 2025 May 23;15(1):17919. doi: 10.1038/s41598-025-02006-x.
2
Reward signals in the motor cortex: from biology to neurotechnology.运动皮层中的奖赏信号:从生物学到神经技术
Nat Commun. 2025 Feb 3;16(1):1307. doi: 10.1038/s41467-024-55016-0.
3
Computational and Neural Evidence for Altered Fast and Slow Learning from Losses in Problem Gambling.问题赌博中因损失而导致快速和慢速学习改变的计算和神经学证据。

本文引用的文献

1
Neural signature of fictive learning signals in a sequential investment task.序列投资任务中虚构学习信号的神经特征
Proc Natl Acad Sci U S A. 2007 May 29;104(22):9493-8. doi: 10.1073/pnas.0608842104. Epub 2007 May 22.
2
Is matching innate?匹配是天生的吗?
J Exp Anal Behav. 2007 Mar;87(2):161-99. doi: 10.1901/jeab.2007.92-05.
3
Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans.多巴胺依赖的预测误差是人类寻求奖励行为的基础。
J Neurosci. 2025 Jan 1;45(1):e0080242024. doi: 10.1523/JNEUROSCI.0080-24.2024.
4
Subpopulations of neurons in lOFC encode previous and current rewards at time of choice.腹外侧眶额皮层的神经元亚群在做出选择时编码先前和当前的奖励。
Elife. 2021 Oct 25;10:e70129. doi: 10.7554/eLife.70129.
5
Dichotomous dopaminergic and noradrenergic neural states mediate distinct aspects of exploitative behavioral states.二元多巴胺能和去甲肾上腺素能神经状态介导了剥削性行为状态的不同方面。
Sci Adv. 2021 Jul 23;7(30). doi: 10.1126/sciadv.abh2059. Print 2021 Jul.
6
Competition Rather Than Observation and Cooperation Facilitates Optimal Motor Planning.竞争而非观察与合作有助于实现最佳运动规划。
Front Sports Act Living. 2021 Feb 26;3:637225. doi: 10.3389/fspor.2021.637225. eCollection 2021.
7
Lateral orbitofrontal cortex promotes trial-by-trial learning of risky, but not spatial, biases.外侧眶额皮层促进风险但非空间偏向的逐次学习。
Elife. 2019 Nov 6;8:e49744. doi: 10.7554/eLife.49744.
8
Risk aversion in the adjustment of speed-accuracy tradeoff depending on time constraints.根据时间限制调整速度-准确性权衡时的风险规避。
Sci Rep. 2019 Aug 13;9(1):11732. doi: 10.1038/s41598-019-48052-0.
9
Deviation from the matching law reflects an optimal strategy involving learning over multiple timescales.偏离匹配律反映了一种涉及多个时间尺度的学习的最优策略。
Nat Commun. 2019 Apr 1;10(1):1466. doi: 10.1038/s41467-019-09388-3.
10
Evidence for Sequential Performance Effects in Professional Darts.职业飞镖运动中连续表现效应的证据。
Front Psychol. 2018 Apr 26;9:591. doi: 10.3389/fpsyg.2018.00591. eCollection 2018.
Nature. 2006 Aug 31;442(7106):1042-5. doi: 10.1038/nature05051. Epub 2006 Aug 23.
4
Cortical substrates for exploratory decisions in humans.人类探索性决策的皮质基础。
Nature. 2006 Jun 15;441(7095):876-9. doi: 10.1038/nature04766.
5
The computational neurobiology of learning and reward.学习与奖励的计算神经生物学
Curr Opin Neurobiol. 2006 Apr;16(2):199-204. doi: 10.1016/j.conb.2006.03.006. Epub 2006 Mar 24.
6
Reinforcement learning and decision making in monkeys during a competitive game.猴子在竞争性游戏中的强化学习与决策
Brain Res Cogn Brain Res. 2004 Dec;22(1):45-58. doi: 10.1016/j.cogbrainres.2004.07.007.
7
Activity in posterior parietal cortex is correlated with the relative subjective desirability of action.顶叶后部皮质的活动与行动的相对主观合意性相关。
Neuron. 2004 Oct 14;44(2):365-78. doi: 10.1016/j.neuron.2004.09.009.
8
Matching behavior and the representation of value in the parietal cortex.匹配行为与顶叶皮质中价值的表征
Science. 2004 Jun 18;304(5678):1782-7. doi: 10.1126/science.1094765.
9
Dissociable roles of ventral and dorsal striatum in instrumental conditioning.腹侧和背侧纹状体在工具性条件反射中的不同作用。
Science. 2004 Apr 16;304(5669):452-4. doi: 10.1126/science.1094285.
10
The rat approximates an ideal detector of changes in rates of reward: implications for the law of effect.大鼠近似于奖励率变化的理想探测器:对效果律的启示。
J Exp Psychol Anim Behav Process. 2001 Oct;27(4):354-72. doi: 10.1037//0097-7403.27.4.354.