Suppr超能文献

基于方面的混合动力汽车消费者在线评论句子分割数据集。

Aspect based sentence segregated dataset of hybrid car's consumers online reviews.

作者信息

Manzoor Muhammad Faraz, Abid Adnan, Nawaz Naeem A, Alvi Atif

机构信息

Department of Computer Science, University of Management and Technology, Lahore, Pakistan.

Department of Computer Science, Virtual University of Pakistan, Lahore, Pakistan.

出版信息

Data Brief. 2022 May 17;42:108293. doi: 10.1016/j.dib.2022.108293. eCollection 2022 Jun.

Abstract

Dataset presented in this paper is obtained from the top online automobile selling and purchasing websites. A total of 1000 reviews related to hybrid cars in the form of text reviews are extracted with the help of the Web Scraper tool. The dataset presents the customers sentiments in the form of reviews related to hybrid cars. Various aspects are taken into consideration while annotating the reviews such as driving, performance, comfort, safety features, interior, exterior and accessories. The annotation of data is done at three levels by three annotators i.e., (1) overall polarity of a review, (2) segregation of the sentence term in which aspect is discussed, (3) polarity of the discussed aspect. Cohen's Kappa score of 0.90 was achieved among the authors while annotating the reviews. Dataset can be used for sentiment analysis, information retrieving, lexicon analysis, and grammatical and morphological analysis.

摘要

本文中呈现的数据集来自顶级在线汽车销售和购买网站。借助网络爬虫工具,共提取了1000条以文本评论形式呈现的与混合动力汽车相关的评论。该数据集以与混合动力汽车相关的评论形式呈现客户情绪。在注释评论时考虑了各个方面,如驾驶、性能、舒适性、安全功能、内饰、外观和配件。数据注释由三位注释者分三个级别完成,即:(1)评论的总体极性,(2)讨论方面的句子术语的分类,(3)所讨论方面的极性。作者在注释评论时,Cohen's Kappa得分达到了0.90。该数据集可用于情感分析、信息检索、词汇分析以及语法和形态分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9343/9142620/6b1e868a2418/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验