Pan Wenhao, Han Yingying, Li Jinjin, Zhang Emily, He Bikai
School of Public Administration, South China University of Technology, Guangzhou, China.
School of Psychology, Guizhou Normal University, Guiyang, China.
Curr Psychol. 2022 Nov 3:1-18. doi: 10.1007/s12144-022-03876-4.
The outbreak of COVID-19 has led to a global health crisis and caused huge emotional swings. However, the positive emotional expressions, like self-confidence, optimism, and praise, that appear in Chinese social networks are rarely explored by researchers. This study aims to analyze the characteristics of netizens' positive energy expressions and the impact of node events on public emotional expression during the COVID-19 pandemic. First, a total of 6,525,249 Chinese texts posted by Sina Weibo users were randomly selected through textual data cleaning and word segmentation for corpus construction. A fine-grained sentiment lexicon that contained was built using Word2Vec technology; this lexicon was later used to conduct sentiment category analysis on original posts. Next, through manual labeling and multi-classification machine learning model construction, four mainstream machine learning algorithms were selected to train the emotional intensity model. Finally, the lexicon and optimized emotional intensity model were used to analyze the emotional expressions of Chinese netizens. The results show that expression accounted for 40.97% during the COVID-19 pandemic. Over the course of time, emotions were displayed at the highest levels and the lowest. The analysis results of the node events showed after the outbreak was confirmed officially, the expressions of and increased simultaneously. After the initial victory in pandemic prevention and control, the expression of and reached a peak, while the increase of was the most prominent. The fine-grained sentiment lexicon, which includes a category, demonstrated reliable algorithm performance and can be used for sentiment classification of Chinese Internet context. We also found many expressions in Chinese online social platforms which are proven to be significantly affected by nod events of different nature.
新型冠状病毒肺炎(COVID-19)疫情引发了全球健康危机,并造成了巨大的情绪波动。然而,中国社交网络中出现的自信、乐观和赞扬等积极情绪表达却很少被研究人员探讨。本研究旨在分析新冠疫情期间网民正能量表达的特征以及节点事件对公众情绪表达的影响。首先,通过文本数据清理和分词,从新浪微博用户发布的6525249条中文文本中随机选取,用于构建语料库。利用Word2Vec技术构建了一个包含[具体内容缺失]的细粒度情感词典;该词典随后用于对原始帖子进行情感类别分析。接下来,通过人工标注和多分类机器学习模型构建,选择四种主流机器学习算法来训练情感强度模型。最后,利用该词典和优化后的情感强度模型对中国网民的情绪表达进行分析。结果表明,在新冠疫情期间,[具体情绪缺失]表达占40.97%。随着时间的推移,[具体情绪缺失]情绪出现的频率最高,[具体情绪缺失]最低。节点事件的分析结果显示,疫情正式确诊后,[具体情绪缺失]和[具体情绪缺失]的表达同时增加。在疫情防控取得初步胜利后,[具体情绪缺失]和[具体情绪缺失]的表达达到峰值,而[具体情绪缺失]的增加最为显著。包含[具体类别缺失]类别的细粒度情感词典表现出可靠的算法性能,可用于中国互联网语境下的情感分类。我们还在中国在线社交平台上发现了许多[具体情绪缺失]表达,事实证明这些表达受到不同性质节点事件的显著影响。