• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于套期保值投资组合问题的多智能体强化学习方法。

Multi-agent reinforcement learning approach for hedging portfolio problem.

作者信息

Pham Uyen, Luu Quoc, Tran Hien

机构信息

Economic Mathematics, University of Economics and Law, Ho Chi Minh City, Vietnam.

Quantitative and Computational Finance, John von Neumann Institute, Ho Chi Minh City, Vietnam.

出版信息

Soft comput. 2021;25(12):7877-7885. doi: 10.1007/s00500-021-05801-6. Epub 2021 Apr 19.

DOI:10.1007/s00500-021-05801-6
PMID:33897298
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8054257/
Abstract

Developing a hedging strategy to reduce risk of losses for a given set of stocks in a portfolio is a difficult task due to cost of the hedge. In Vietnam stock market, cross-hedge is involved hedging a long position of a stock because there is no put option for the stock. In addition, only VN30 stock index futures contracts are traded on Hanoi Stock Exchange. Inspired by recently achievement of deep reinforcement learning, we explore feasibility to construct a hedging strategy automatically by leveraging cooperative multi-agent in reinforcement learning techniques without advanced domain knowledge. In this work, we use 10 popular stocks on Ho Chi Minh Stock Exchange, and VN30F1M (VN30 Index Futures contracts within one month settlement) to develop a stock market simulator (including transaction fee, tax, and settlement date of transactions) for reinforcement learning agent training. We use daily return as input data for training process. Results suggest that the agent can learn trading and hedging policy to make profit and reduce losses. Furthermore, we also find that our agent can protect portfolios and make positive profit in case market collapses systematically. In practice, this work can help Vietnam's stock market investors to improve performance and reduce losses in trading, especially when the volatility cannot be controlled.

摘要

由于套期保值成本,为投资组合中的特定股票集制定套期保值策略以降低损失风险是一项艰巨的任务。在越南股票市场,交叉套期保值涉及对股票的多头头寸进行套期保值,因为该股票没有看跌期权。此外,河内证券交易所仅交易VN30股指期货合约。受近期深度强化学习成果的启发,我们探索了在无需高级领域知识的情况下,利用强化学习技术中的合作多智能体自动构建套期保值策略的可行性。在这项工作中,我们使用胡志明证券交易所的10只热门股票以及VN30F1M(一个月内结算的VN30股指期货合约)来开发一个股票市场模拟器(包括交易费用、税费和交易结算日期),用于强化学习智能体训练。我们将日回报率用作训练过程的输入数据。结果表明,智能体可以学习交易和套期保值策略以实现盈利并减少损失。此外,我们还发现,在市场系统性崩溃的情况下,我们的智能体可以保护投资组合并实现正利润。在实践中,这项工作可以帮助越南股票市场投资者提高交易绩效并减少损失,尤其是在波动率无法控制的情况下。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/de3e7711adb8/500_2021_5801_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/037176eaa4a6/500_2021_5801_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/8141da3c64dc/500_2021_5801_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/75bc079ebaed/500_2021_5801_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/de3e7711adb8/500_2021_5801_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/037176eaa4a6/500_2021_5801_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/8141da3c64dc/500_2021_5801_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/75bc079ebaed/500_2021_5801_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55bf/8054257/de3e7711adb8/500_2021_5801_Fig4_HTML.jpg

相似文献

1
Multi-agent reinforcement learning approach for hedging portfolio problem.用于套期保值投资组合问题的多智能体强化学习方法。
Soft comput. 2021;25(12):7877-7885. doi: 10.1007/s00500-021-05801-6. Epub 2021 Apr 19.
2
Is It Possible to Earn Abnormal Return in an Inefficient Market? An Approach Based on Machine Learning in Stock Trading.在非有效市场中是否有可能获得异常收益?基于机器学习的股票交易方法。
Comput Intell Neurosci. 2021 Dec 8;2021:2917577. doi: 10.1155/2021/2917577. eCollection 2021.
3
Dynamic stock-decision ensemble strategy based on deep reinforcement learning.基于深度强化学习的动态库存决策集成策略
Appl Intell (Dordr). 2023;53(2):2452-2470. doi: 10.1007/s10489-022-03606-0. Epub 2022 May 9.
4
The impact of oil and gold price fluctuations on the South African equity market: Volatility spillovers and financial policy implications.石油和黄金价格波动对南非股票市场的影响:波动溢出效应及金融政策含义。
Resour Policy. 2020 Oct;68:101740. doi: 10.1016/j.resourpol.2020.101740. Epub 2020 Jun 17.
5
The dynamics of volatility spillovers between oil prices and stock market returns at the sector level and hedging strategies: evidence from Pakistan.油价与股票市场回报率在部门层面的波动溢出动态及对冲策略:来自巴基斯坦的证据。
Environ Sci Pollut Res Int. 2020 Aug;27(24):30706-30715. doi: 10.1007/s11356-020-09351-6. Epub 2020 May 29.
6
Gold and US sectoral stocks during COVID-19 pandemic.新冠疫情期间的黄金与美国行业股票。
Res Int Bus Finance. 2021 Oct;57:101424. doi: 10.1016/j.ribaf.2021.101424. Epub 2021 May 5.
7
An LSTM and GRU based trading strategy adapted to the Moroccan market.一种基于长短期记忆网络(LSTM)和门控循环单元(GRU)的、适用于摩洛哥市场的交易策略。
J Big Data. 2021;8(1):126. doi: 10.1186/s40537-021-00512-z. Epub 2021 Sep 24.
8
Hedging the Risks of MENA Stock Markets with Gold: Evidence from the Spectral Approach.用黄金对冲中东和北非地区股票市场风险:来自频谱方法的证据。
Comput Econ. 2023;61(1):197-231. doi: 10.1007/s10614-021-10204-8. Epub 2021 Oct 2.
9
From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization.从确定性到随机性:一种用于投资组合优化的可解释的无模型随机强化学习框架。
Appl Intell (Dordr). 2023;53(12):15188-15203. doi: 10.1007/s10489-022-04217-5. Epub 2022 Nov 11.
10
Does bitcoin provide hedge to Islamic stock markets for pre- and during COVID-19 outbreak? A comparative analysis with gold.比特币在新冠疫情爆发前及期间是否为伊斯兰股票市场提供了避险作用?与黄金的比较分析。
Resour Policy. 2021 Dec;74:102407. doi: 10.1016/j.resourpol.2021.102407. Epub 2021 Oct 11.

引用本文的文献

1
Stock market trading actor-critic reinforcement learning and adaptable data structure.股票市场交易的演员-评论家强化学习与适应性数据结构。
PeerJ Comput Sci. 2025 Feb 18;11:e2690. doi: 10.7717/peerj-cs.2690. eCollection 2025.
2
Explainable post hoc portfolio management financial policy of a Deep Reinforcement Learning agent.深度强化学习智能体的可解释事后投资组合管理财务策略。
PLoS One. 2025 Jan 16;20(1):e0315528. doi: 10.1371/journal.pone.0315528. eCollection 2025.