• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

构建和评估希腊语情感分析资源。

Building and evaluating resources for sentiment analysis in the Greek language.

作者信息

Tsakalidis Adam, Papadopoulos Symeon, Voskaki Rania, Ioannidou Kyriaki, Boididou Christina, Cristea Alexandra I, Liakata Maria, Kompatsiaris Yiannis

机构信息

1Department of Computer Science, University of Warwick, Coventry, UK.

The Alan Turing Institute, London, UK.

出版信息

Lang Resour Eval. 2018;52(4):1021-1044. doi: 10.1007/s10579-018-9420-4. Epub 2018 Jul 14.

DOI:10.1007/s10579-018-9420-4
PMID:30930705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6411313/
Abstract

Sentiment lexicons and word embeddings constitute well-established sources of information for sentiment analysis in online social media. Although their effectiveness has been demonstrated in state-of-the-art sentiment analysis and related tasks in the English language, such publicly available resources are much less developed and evaluated for the Greek language. In this paper, we tackle the problems arising when analyzing text in such an under-resourced language. We present and make publicly available a rich set of such resources, ranging from a manually annotated lexicon, to semi-supervised word embedding vectors and annotated datasets for different tasks. Our experiments using different algorithms and parameters on our resources show promising results over standard baselines; on average, we achieve a 24.9% relative improvement in F-score on the cross-domain sentiment analysis task when training the same algorithms with our resources, compared to training them on more traditional feature sources, such as n-grams. Importantly, while our resources were built with the primary focus on the cross-domain sentiment analysis task, they also show promising results in related tasks, such as emotion analysis and sarcasm detection.

摘要

情感词典和词嵌入是在线社交媒体中情感分析的成熟信息来源。尽管它们的有效性已在英语的先进情感分析及相关任务中得到证明,但此类公开可用资源在希腊语方面的开发和评估要少得多。在本文中,我们解决了在分析这种资源匮乏语言的文本时出现的问题。我们展示并公开了一组丰富的此类资源,从手动注释的词典到半监督词嵌入向量以及针对不同任务的注释数据集。我们使用不同算法和参数对这些资源进行的实验表明,与使用更传统的特征源(如n-gram)训练相同算法相比,在跨域情感分析任务中,使用我们的资源训练时,F分数平均相对提高了24.9%。重要的是,虽然我们的资源主要是为跨域情感分析任务构建的,但它们在情感分析和讽刺检测等相关任务中也显示出了有希望的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db2d/6411313/f0998e6b66f5/10579_2018_9420_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db2d/6411313/f0998e6b66f5/10579_2018_9420_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db2d/6411313/f0998e6b66f5/10579_2018_9420_Fig1_HTML.jpg

相似文献

1
Building and evaluating resources for sentiment analysis in the Greek language.构建和评估希腊语情感分析资源。
Lang Resour Eval. 2018;52(4):1021-1044. doi: 10.1007/s10579-018-9420-4. Epub 2018 Jul 14.
2
BengSentiLex and BengSwearLex: creating lexicons for sentiment analysis and profanity detection in low-resource Bengali language.孟加拉语情感词典和孟加拉语脏话词典:为低资源孟加拉语的情感分析和亵渎检测创建词汇表。
PeerJ Comput Sci. 2021 Nov 16;7:e681. doi: 10.7717/peerj-cs.681. eCollection 2021.
3
A new word embedding model integrated with medical knowledge for deep learning-based sentiment classification.一种集成医学知识的新词嵌入模型,用于基于深度学习的情感分类。
Artif Intell Med. 2024 Feb;148:102758. doi: 10.1016/j.artmed.2023.102758. Epub 2024 Jan 8.
4
Automatic Construction and Global Optimization of a Multisentiment Lexicon.多情感词典的自动构建与全局优化
Comput Intell Neurosci. 2016;2016:2093406. doi: 10.1155/2016/2093406. Epub 2016 Nov 29.
5
Building lexicon-based sentiment analysis model for low-resource languages.为低资源语言构建基于词典的情感分析模型。
MethodsX. 2023 Oct 22;11:102460. doi: 10.1016/j.mex.2023.102460. eCollection 2023 Dec.
6
Semi-supervised distributed representations of documents for sentiment analysis.用于情感分析的文档的半监督分布式表示。
Neural Netw. 2019 Nov;119:139-150. doi: 10.1016/j.neunet.2019.08.001. Epub 2019 Aug 6.
7
Multi-class sentiment analysis of urdu text using multilingual BERT.使用多语言 BERT 进行乌尔都语文本的多类情感分析。
Sci Rep. 2022 Mar 31;12(1):5436. doi: 10.1038/s41598-022-09381-9.
8
Neural Networks with Emotion Associations, Topic Modeling and Supervised Term Weighting for Sentiment Analysis.用于情感分析的具有情感关联、主题建模和监督词加权的神经网络
Int J Neural Syst. 2021 Oct;31(10):2150013. doi: 10.1142/S0129065721500131. Epub 2021 Feb 10.
9
An Improved BERT and Syntactic Dependency Representation Model for Sentiment Analysis.基于改进的 BERT 和句法依存关系表示模型的情感分析。
Comput Intell Neurosci. 2022 May 5;2022:5754151. doi: 10.1155/2022/5754151. eCollection 2022.
10
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

引用本文的文献

1
Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text.用户撰写文本中心理概念提取与分类的可靠性分析
Proc Int AAAI Conf Weblogs Soc Media. 2024 May 31;18:422-434. doi: 10.1609/icwsm.v18i1.31324. Epub 2024 May 28.
2
Sentiment analysis of COVID-19 cases in Greece using Twitter data.利用推特数据对希腊新冠肺炎病例进行情感分析。
Expert Syst Appl. 2023 Nov 15;230:120577. doi: 10.1016/j.eswa.2023.120577. Epub 2023 Jun 7.