• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用于投资组合选择的风险感知多臂老虎机问题。

Risk-aware multi-armed bandit problem with application to portfolio selection.

作者信息

Huo Xiaoguang, Fu Feng

机构信息

Department of Mathematics, Cornell University, Ithaca, NY 14850, USA.

Department of Mathematics, Dartmouth College, Hanover, NH 03755, USA.

出版信息

R Soc Open Sci. 2017 Nov 15;4(11):171377. doi: 10.1098/rsos.171377. eCollection 2017 Nov.

DOI:10.1098/rsos.171377
PMID:29291122
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5717697/
Abstract

Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the versus dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of the financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return.

摘要

近年来,序贯投资组合选择在机器学习和量化金融领域引起了越来越多的关注。作为强化学习策略的数学框架,随机多臂老虎机问题解决了不确定性下序贯决策中的主要困难,即探索与利用的困境,因此为投资组合选择提供了自然的联系。在本文中,我们将风险意识纳入经典的多臂老虎机框架,并引入一种构建投资组合的算法。通过基于金融市场的拓扑结构对资产进行筛选,并将最优多臂老虎机策略与一致风险度量的最小化相结合,我们实现了风险与回报之间的平衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e42/5717697/911414a53985/rsos171377-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e42/5717697/79faaf430ef0/rsos171377-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e42/5717697/911414a53985/rsos171377-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e42/5717697/79faaf430ef0/rsos171377-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e42/5717697/911414a53985/rsos171377-g2.jpg

相似文献

1
Risk-aware multi-armed bandit problem with application to portfolio selection.应用于投资组合选择的风险感知多臂老虎机问题。
R Soc Open Sci. 2017 Nov 15;4(11):171377. doi: 10.1098/rsos.171377. eCollection 2017 Nov.
2
Overtaking method based on sand-sifter mechanism: Why do optimistic value functions find optimal solutions in multi-armed bandit problems?基于筛沙机制的超越方法:为何乐观值函数能在多臂老虎机问题中找到最优解?
Biosystems. 2015 Sep;135:55-65. doi: 10.1016/j.biosystems.2015.06.009. Epub 2015 Jul 10.
3
From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization.从确定性到随机性:一种用于投资组合优化的可解释的无模型随机强化学习框架。
Appl Intell (Dordr). 2023;53(12):15188-15203. doi: 10.1007/s10489-022-04217-5. Epub 2022 Nov 11.
4
An empirical evaluation of active inference in multi-armed bandits.多臂赌博机中主动推理的实证评估。
Neural Netw. 2021 Dec;144:229-246. doi: 10.1016/j.neunet.2021.08.018. Epub 2021 Aug 26.
5
Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers.利用半导体激光器中混沌时间波形的偏差控制解决大规模多臂老虎机问题的决策方法。
Sci Rep. 2022 May 16;12(1):8073. doi: 10.1038/s41598-022-12155-y.
6
Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm.非平稳多臂赌博机:一种新概念漂移感知算法的实证评估
Entropy (Basel). 2021 Mar 23;23(3):380. doi: 10.3390/e23030380.
7
Optimism in the face of uncertainty supported by a statistically-designed multi-armed bandit algorithm.面对不确定性时的乐观态度由一种经过统计设计的多臂赌博机算法提供支持。
Biosystems. 2017 Oct;160:25-32. doi: 10.1016/j.biosystems.2017.08.004. Epub 2017 Aug 22.
8
Gateway Selection in Millimeter Wave UAV Wireless Networks Using Multi-Player Multi-Armed Bandit.基于多人多臂老虎机的毫米波无人机无线网络中的网关选择
Sensors (Basel). 2020 Jul 16;20(14):3947. doi: 10.3390/s20143947.
9
Amoeba-inspired Tug-of-War algorithms for exploration-exploitation dilemma in extended Bandit Problem.用于扩展多臂老虎机问题中探索-利用困境的受变形虫启发的拔河算法。
Biosystems. 2014 Mar;117:1-9. doi: 10.1016/j.biosystems.2013.12.007. Epub 2013 Dec 31.
10
Finding structure in multi-armed bandits.在多臂老虎机中寻找结构。
Cogn Psychol. 2020 Jun;119:101261. doi: 10.1016/j.cogpsych.2019.101261. Epub 2020 Feb 12.

引用本文的文献

1
Understanding gambling behaviour and risk attitudes using cryptocurrency-based casino blockchain data.利用基于加密货币的赌场区块链数据理解赌博行为和风险态度。
R Soc Open Sci. 2020 Oct 21;7(10):201446. doi: 10.1098/rsos.201446. eCollection 2020 Oct.

本文引用的文献

1
Dynamic Portfolio Strategy Using Clustering Approach.基于聚类方法的动态投资组合策略
PLoS One. 2017 Jan 27;12(1):e0169299. doi: 10.1371/journal.pone.0169299. eCollection 2017.
2
Antisocial pool rewarding does not deter public cooperation.反社会群体奖励并不能阻止公众合作。
Proc Biol Sci. 2015 Oct 7;282(1816):20151975. doi: 10.1098/rspb.2015.1975.
3
Biological auctions with multiple rewards.具有多重奖励的生物拍卖
Proc Biol Sci. 2015 Aug 7;282(1812):20151041. doi: 10.1098/rspb.2015.1041.
4
Conformity enhances network reciprocity in evolutionary social dilemmas.在进化社会困境中,从众行为增强了网络互惠性。
J R Soc Interface. 2015 Feb 6;12(103). doi: 10.1098/rsif.2014.1299.
5
Solving the collective-risk social dilemma with risky assets in well-mixed and structured populations.在均匀混合和结构化群体中利用风险资产解决集体风险社会困境。
Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Nov;90(5-1):052823. doi: 10.1103/PhysRevE.90.052823. Epub 2014 Nov 24.
6
Global migration can lead to stronger spatial selection than local migration.全球迁移可能导致比本地迁移更强的空间选择。
J Stat Phys. 2013 May 1;151(3-4):637-653. doi: 10.1007/s10955-012-0631-6.
7
Spread of risk across financial markets: better to invest in the peripheries.金融市场风险蔓延:投资外围地区更好。
Sci Rep. 2013;3:1665. doi: 10.1038/srep01665.
8
Cascading failures in bi-partite graphs: model for systemic risk propagation.双分支图中的级联失效:系统风险传播模型。
Sci Rep. 2013;3:1219. doi: 10.1038/srep01219. Epub 2013 Feb 5.
9
Strategy selection in structured populations.结构化群体中的策略选择
J Theor Biol. 2009 Aug 7;259(3):570-81. doi: 10.1016/j.jtbi.2009.03.035. Epub 2009 Apr 7.
10
A simple rule for the evolution of cooperation on graphs and social networks.关于图和社交网络上合作演化的一条简单规则。
Nature. 2006 May 25;441(7092):502-5. doi: 10.1038/nature04605.