Tan Zheng, Yan Ziqin, Zhu Guangwei
Xiyuan Hedge Fund, 388 Yizhou Road, Chengdu, Sichuan, PR China.
Institute of Chinese Financial Studies, Southwestern University of Finance and Economics, 555 Liutai Avenue, Wenjiang District, Chengdu, Sichuan, PR China.
Heliyon. 2019 Aug 17;5(8):e02310. doi: 10.1016/j.heliyon.2019.e02310. eCollection 2019 Aug.
In recent years, a variety of research fields, including finance, have begun to place great emphasis on machine learning techniques because they exhibit broad abilities to simulate more complicated problems. In contrast to the traditional linear regression scheme that is usually used to describe the relationship between the stock forward return and company characteristics, the field of finance has experienced the rapid development of tree-based algorithms and neural network paradigms when illustrating complex stock dynamics. These nonlinear methods have proved to be effective in predicting stock prices and selecting stocks that can outperform the general market. This article implements and evaluates the robustness of the random forest (RF) model in the context of the stock selection strategy. The model is trained for stocks in the Chinese stock market, and two types of feature spaces, fundamental/technical feature space and pure momentum feature space, are adopted to forecast the price trend in the long run and the short run, respectively. It is evidenced that both feature paradigms have led to remarkable excess returns during the past five out-of-sample period years, with the Sharpe ratios calculated to be 2.75 and 5 for the portfolio net value of the multi-factor space strategy and momentum space strategy, respectively. Although the excess return has weakened in recent years with respect to the multi-factor strategy, our findings point to a less efficient market that is far from equilibrium.
近年来,包括金融在内的各种研究领域开始高度重视机器学习技术,因为它们展现出模拟更复杂问题的广泛能力。与通常用于描述股票远期回报与公司特征之间关系的传统线性回归方法不同,在阐释复杂的股票动态时,金融领域经历了基于树的算法和神经网络范式的快速发展。这些非线性方法已被证明在预测股票价格和挑选能够超越整体市场表现的股票方面是有效的。本文在选股策略的背景下实现并评估了随机森林(RF)模型的稳健性。该模型针对中国股票市场的股票进行训练,并且分别采用两种类型的特征空间,即基本面/技术面特征空间和纯动量特征空间,来预测长期和短期的价格趋势。事实证明,在过去五年的样本外期间,这两种特征范式均带来了显著的超额回报,多因素空间策略和动量空间策略的投资组合净值的夏普比率分别计算为2.75和5。尽管近年来多因素策略的超额回报有所减弱,但我们的研究结果表明市场效率较低且远未达到均衡状态。