Suppr超能文献

印度尼西亚总统选举舆情:2024年之前公众回应数据集。

Indonesian presidential election sentiment: Dataset of response public before 2024.

作者信息

Firdaus Asno Azzawagama, Yudhana Anton, Riadi Imam

机构信息

Master Program of Informatics, Universitas Ahmad Dahlan, Yogyakarta 55166, Indonesia.

Department of Electrical Engineering, Universitas Ahmad Dahlan, Yogyakarta 55166, Indonesia.

出版信息

Data Brief. 2023 Dec 19;52:109993. doi: 10.1016/j.dib.2023.109993. eCollection 2024 Feb.

Abstract

Indonesia is one of the countries that is currently entering the political year for the election of President, Regional Heads, and Members of the Legislative in 2024. This has become a hot topic on social media, especially about the Presidential Election. Twitter is one of the platforms with the largest users in Indonesia. It is interesting to see the alignment of Twitter users towards presidential candidates who already have a carrying party, namely Ganjar Pranowo, Prabowo Subianto, and Anies Baswedan based on a sentiment analysis approach. User feedback data about Indonesian Presidential candidates are obtained from the Twitter platform using Twitter API with Python programming language. The data obtained was 30,000 data with each candidate as many as 10,000 data. Data is pulled in April 2023 with specific keywords. The time for data withdrawal is chosen based on the announcement of Presidential Candidates carried by political parties before the schedule for determining or campaigning for Presidential candidates. Current data can potentially be used again as a comparison of analysis of presidential candidates on campaign time spans and after campaigns or actual calculation results. The data that can be accessed is in CSV format and has gone through several stages such as labelling using Language experts, removing spam Tweets & empty cells and preprocessing.

摘要

印度尼西亚是目前正进入2024年总统、地区首长和立法机构成员选举政治年的国家之一。这已成为社交媒体上的热门话题,尤其是关于总统选举。推特是印度尼西亚用户最多的平台之一。基于情感分析方法,观察推特用户对已有支持政党的总统候选人,即甘贾尔·普拉诺沃、普拉博沃·苏比安托和安妮丝·巴斯韦丹的支持情况很有意思。关于印度尼西亚总统候选人的用户反馈数据是使用Python编程语言通过推特应用程序编程接口从推特平台获取的。获得的数据有30000条,每个候选人各有10000条数据。数据于2023年4月通过特定关键词提取。数据提取时间是根据政党公布总统候选人的时间确定的,在确定或竞选总统候选人的时间表之前。当前数据有可能再次用作竞选时间段以及竞选后或实际计算结果的总统候选人分析比较。可访问的数据为CSV格式,并且已经历了几个阶段,如由语言专家进行标注、去除垃圾推文和空单元格以及预处理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47ad/10788203/948b361f624e/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验