Narynov Sergazy, Mukhtarkhanuly Daniyar, Omarov Batyrkhan
Alem Research, Almaty, Kazakhstan.
Suleyman Demirel University, Almaty, Kazakhstan.
Data Brief. 2020 Feb 4;29:105195. doi: 10.1016/j.dib.2020.105195. eCollection 2020 Apr.
This paper presents dataset collected from social networks that are mostly used by youth of Commonwealth of Independent States (CIS) countries. The data was collected from public accounts of VKontakte social network by using VK.api and applying the most used keywords that would signify depressive mood. The collected data was classified by psychologists into two types: depressive and non-depressive. The dataset consists of 32 018 depressive posts and 32 021 non-depressive posts. Since the most common language that is spoken in CIS countries is Russian, the posts are written in Russian, consequently the collected data is in Russian language as well. The data can mostly be useful for researchers who explore tendencies to depression in CIS countries. The dataset is important for the research community, as it was not only collected from open sources, but also marked by our psychiatrists from the republican scientific and practical center of mental health. Since the dataset has very high validity, it can be used for further research in the field of mental health.
本文展示了从社交网络收集的数据集,这些社交网络主要被独联体国家的年轻人使用。数据是通过VK.api从VKontakte社交网络的公共账户中收集的,并应用了最常用的表示抑郁情绪的关键词。收集到的数据由心理学家分为两类:抑郁类和非抑郁类。该数据集包括32018条抑郁帖子和32021条非抑郁帖子。由于独联体国家最常用的语言是俄语,帖子是用俄语写的,因此收集到的数据也是俄语的。这些数据对研究独联体国家抑郁倾向的研究人员可能非常有用。该数据集对研究界很重要,因为它不仅从公开来源收集,还由我们共和国心理健康科学与实践中心的精神科医生进行了标注。由于该数据集具有很高的有效性,它可用于心理健康领域的进一步研究。