QF-交易员网络：基于量子价格水平的损益控制的日内交易深度强化

QF-TraderNet: Intraday Trading Deep Reinforcement With Quantum Price Levels Based Profit-And-Loss Control.

作者信息

Qiu Yifu, Qiu Yitao, Yuan Yicong, Chen Zheng, Lee Raymond

机构信息

Department of Computer Science and Technology, Division of Science and Technology, BNU-HKBU United International College, Zhuhai, China.

出版信息

Front Artif Intell. 2021 Oct 29;4:749878. doi: 10.3389/frai.2021.749878. eCollection 2021.

DOI:10.3389/frai.2021.749878

PMID:34778753

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8586520/

Abstract

Reinforcement Learning (RL) based machine trading attracts a rich profusion of interest. However, in the existing research, RL in the day-trade task suffers from the noisy financial movement in the short time scale, difficulty in order settlement, and expensive action search in a continuous-value space. This paper introduced an end-to-end RL intraday trading agent, namely QF-TraderNet, based on the quantum finance theory (QFT) and deep reinforcement learning. We proposed a novel design for the intraday RL trader's action space, inspired by the Quantum Price Levels (QPLs). Our action space design also brings the model a learnable profit-and-loss control strategy. QF-TraderNet composes two neural networks: 1) A long short term memory networks for the feature learning of financial time series; 2) a policy generator network (PGN) for generating the distribution of actions. The profitability and robustness of QF-TraderNet have been verified in multi-type financial datasets, including FOREX, metals, crude oil, and financial indices. The experimental results demonstrate that QF-TraderNet outperforms other baselines in terms of cumulative price returns and Sharpe Ratio, and the robustness in the acceidential market shift.

摘要

基于强化学习（RL）的机器交易引起了广泛关注。然而，在现有研究中，日内交易任务中的强化学习存在短时间尺度下金融波动噪声大、订单结算困难以及连续值空间中动作搜索成本高等问题。本文基于量子金融理论（QFT）和深度强化学习，引入了一种端到端的强化学习日内交易智能体，即QF-TraderNet。受量子价格水平（QPLs）启发，我们为日内强化学习交易者的动作空间提出了一种新颖设计。我们的动作空间设计还为模型带来了一种可学习的盈亏控制策略。QF-TraderNet由两个神经网络组成：1）用于金融时间序列特征学习的长短期记忆网络；2）用于生成动作分布的策略生成器网络（PGN）。QF-TraderNet的盈利能力和稳健性已在包括外汇、金属、原油和金融指数在内的多种类型金融数据集中得到验证。实验结果表明，QF-TraderNet在累积价格回报和夏普比率方面优于其他基线，并且在意外市场变化中具有稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/913d/8586520/970fc841d236/frai-04-749878-g001.jpg

相似文献

QF-TraderNet: Intraday Trading Deep Reinforcement With Quantum Price Levels Based Profit-And-Loss Control.

Front Artif Intell. 2021 Oct 29;4:749878. doi: 10.3389/frai.2021.749878. eCollection 2021.

Price Trailing for Financial Trading Using Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2021 Jul;32(7):2837-2846. doi: 10.1109/TNNLS.2020.2997523. Epub 2021 Jul 6.

Structural break-aware pairs trading strategy using deep reinforcement learning.

J Supercomput. 2022;78(3):3843-3882. doi: 10.1007/s11227-021-04013-x. Epub 2021 Aug 17.

Diversity-driven knowledge distillation for financial trading using Deep Reinforcement Learning.

Neural Netw. 2021 Aug;140:193-202. doi: 10.1016/j.neunet.2021.02.026. Epub 2021 Mar 17.

Deep Direct Reinforcement Learning for Financial Signal Representation and Trading.

IEEE Trans Neural Netw Learn Syst. 2017 Mar;28(3):653-664. doi: 10.1109/TNNLS.2016.2522401. Epub 2016 Feb 15.

Modeling limit order trading with a continuous action policy for deep reinforcement learning.

Neural Netw. 2023 Aug;165:506-515. doi: 10.1016/j.neunet.2023.05.051. Epub 2023 Jun 2.

Asynchronous Deep Double Dueling Q-learning for trading-signal execution in limit order book markets.

Front Artif Intell. 2023 Sep 25;6:1151003. doi: 10.3389/frai.2023.1151003. eCollection 2023.

LSTM-DDPG for Trading with Variable Positions.

Sensors (Basel). 2021 Sep 30;21(19):6571. doi: 10.3390/s21196571.

Dynamic stock-decision ensemble strategy based on deep reinforcement learning.

Appl Intell (Dordr). 2023;53(2):2452-2470. doi: 10.1007/s10489-022-03606-0. Epub 2022 May 9.

Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning.

PLoS One. 2020 Jul 27;15(7):e0236178. doi: 10.1371/journal.pone.0236178. eCollection 2020.

本文引用的文献

Network Models to Enhance Automated Cryptocurrency Portfolio Management.

Front Artif Intell. 2020 Apr 24;3:22. doi: 10.3389/frai.2020.00022. eCollection 2020.

Neural Network Models for Bitcoin Option Pricing.

Front Artif Intell. 2019 Jul 3;2:5. doi: 10.3389/frai.2019.00005. eCollection 2019.

Temporal Attention-Augmented Bilinear Network for Financial Time-Series Data Analysis.

IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1407-1418. doi: 10.1109/TNNLS.2018.2869225. Epub 2018 Sep 28.

A deep learning framework for financial time series using stacked autoencoders and long-short term memory.

PLoS One. 2017 Jul 14;12(7):e0180944. doi: 10.1371/journal.pone.0180944. eCollection 2017.

Financial Time Series Prediction Using Elman Recurrent Random Neural Networks.

Comput Intell Neurosci. 2016;2016:4742515. doi: 10.1155/2016/4742515. Epub 2016 May 18.

Deep Direct Reinforcement Learning for Financial Signal Representation and Trading.

IEEE Trans Neural Netw Learn Syst. 2017 Mar;28(3):653-664. doi: 10.1109/TNNLS.2016.2522401. Epub 2016 Feb 15.

Learning to trade via direct reinforcement.

IEEE Trans Neural Netw. 2001;12(4):875-89. doi: 10.1109/72.935097.

Learning to forget: continual prediction with LSTM.

Neural Comput. 2000 Oct;12(10):2451-71. doi: 10.1162/089976600300015015.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

QF-交易员网络：基于量子价格水平的损益控制的日内交易深度强化

QF-TraderNet: Intraday Trading Deep Reinforcement With Quantum Price Levels Based Profit-And-Loss Control.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献