Institute of Engineering of Porto (ISEP/P.PORTO), Polytechnic of Porto, Rua Dr. António Bernardino de Almeida n\({^\underline{\circ}}\) 431, 4200-072 Porto, Portugal.
Interdisciplinary Studies Research Center (ISRC), ISEP/P.PORTO, 4249-015 Porto, Portugal.
Sensors (Basel). 2022 Jun 10;22(12):4409. doi: 10.3390/s22124409.
The volume of data is growing exponentially and becoming more valuable to organizations that collect it, from e-commerce data, shipping, audio and video logs, text messages, internet search queries, stock market activity, financial transactions, the Internet of Things, and various other sources. The major challenges are related with the way to extract insights from such a rich data environment and whether Deep Learning can be successful with Big Data. To get some insight on these topics, social network data are employed as a case study on how sentiments can affect decisions in stock market environments. In this paper, we propose a generalized Deep Learning-based classification framework for Stock Market Sentiment Analysis. This work comprises the study, the development, and implementation of an automatic classification system based on Deep Learning and the validation of its adequacy and efficiency in any scenario, particularly Stock Market Sentiment Analysis. Distinct datasets and several Deep Learning approaches with different layers and embedded techniques are used, and their performances are evaluated. These developments show how Deep Learning reacts to distinct contexts. The results also give context on how different techniques with different parameter combinations react to certain types of data. Convolution obtained the best results when dealing with complex data inputs, and long short-term layers kept a memory of data, allowing inputs which are not as common to still be considered for decisions. The models that resulted from Stock Market Sentiment Analysis datasets were applied with some success to real-life problems. The best models reached accuracies of 73% in training and 69% in certain test datasets. In a simulation, a model was able to provide a Return on Investment of 4.4%. The results contribute to understanding how to process Big Data efficiently using Deep Learning and specialized hardware techniques.
数据量呈指数级增长,对于收集数据的组织来说变得越来越有价值,这些数据来自电子商务数据、航运、音频和视频日志、文本消息、互联网搜索查询、股票市场活动、金融交易、物联网以及其他各种来源。主要挑战涉及从如此丰富的数据环境中提取见解的方法,以及深度学习是否可以成功处理大数据。为了深入了解这些主题,本文将社交网络数据用作案例研究,探讨情绪如何影响股票市场环境中的决策。在本文中,我们提出了一种基于深度学习的股票市场情绪分析的通用分类框架。这项工作包括研究、开发和实施基于深度学习的自动分类系统,并验证其在任何场景(特别是股票市场情绪分析)中的充分性和效率。使用了不同的数据集和具有不同层和嵌入式技术的几种深度学习方法,并对其性能进行了评估。这些发展展示了深度学习如何应对不同的环境。结果还说明了具有不同参数组合的不同技术如何对某些类型的数据做出反应。在处理复杂数据输入时,卷积获得了最佳结果,而长短时记忆层保留了数据的记忆,允许考虑不太常见的输入进行决策。从股票市场情绪分析数据集得出的模型在解决实际问题方面取得了一定的成功。最佳模型在训练中的准确率达到 73%,在某些测试数据集中达到 69%。在模拟中,一个模型能够提供 4.4%的投资回报率。这些结果有助于理解如何使用深度学习和专门的硬件技术高效地处理大数据。