Manzoor Muhammad Faraz, Abid Adnan, Nawaz Naeem A, Alvi Atif
Department of Computer Science, University of Management and Technology, Lahore, Pakistan.
Department of Computer Science, Virtual University of Pakistan, Lahore, Pakistan.
Data Brief. 2022 May 17;42:108293. doi: 10.1016/j.dib.2022.108293. eCollection 2022 Jun.
Dataset presented in this paper is obtained from the top online automobile selling and purchasing websites. A total of 1000 reviews related to hybrid cars in the form of text reviews are extracted with the help of the Web Scraper tool. The dataset presents the customers sentiments in the form of reviews related to hybrid cars. Various aspects are taken into consideration while annotating the reviews such as driving, performance, comfort, safety features, interior, exterior and accessories. The annotation of data is done at three levels by three annotators i.e., (1) overall polarity of a review, (2) segregation of the sentence term in which aspect is discussed, (3) polarity of the discussed aspect. Cohen's Kappa score of 0.90 was achieved among the authors while annotating the reviews. Dataset can be used for sentiment analysis, information retrieving, lexicon analysis, and grammatical and morphological analysis.
本文中呈现的数据集来自顶级在线汽车销售和购买网站。借助网络爬虫工具,共提取了1000条以文本评论形式呈现的与混合动力汽车相关的评论。该数据集以与混合动力汽车相关的评论形式呈现客户情绪。在注释评论时考虑了各个方面,如驾驶、性能、舒适性、安全功能、内饰、外观和配件。数据注释由三位注释者分三个级别完成,即:(1)评论的总体极性,(2)讨论方面的句子术语的分类,(3)所讨论方面的极性。作者在注释评论时,Cohen's Kappa得分达到了0.90。该数据集可用于情感分析、信息检索、词汇分析以及语法和形态分析。