• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

情感编码:多语言情感分析中一次性训练和全局预测的新范式。

SentiCode: A new paradigm for one-time training and global prediction in multilingual sentiment analysis.

作者信息

Kanfoud Mohamed Raouf, Bouramoul Abdelkrim

机构信息

MISC Laboratory, Constantine 2 University Abdelhamid Mehri, Constantine, 25000 Algeria.

出版信息

J Intell Inf Syst. 2022;59(2):501-522. doi: 10.1007/s10844-022-00714-8. Epub 2022 May 25.

DOI:10.1007/s10844-022-00714-8
PMID:35645462
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9130974/
Abstract

The main objective of multilingual sentiment analysis is to analyze reviews regardless of the original language in which they are written. Switching from one language to another is very common on social media platforms. Analyzing these multilingual reviews is a challenge since each language is different in terms of syntax, grammar, etc. This paper presents a new language-independent representation approach for sentiment analysis, SentiCode. Unlike previous work in multilingual sentiment analysis, the proposed approach does not rely on machine translation to bridge the gap between different languages. Instead, it exploits common features of languages, such as part-of-speech tags used in Universal Dependencies. Equally important, SentiCode enables sentiment analysis in multi-language and multi-domain environments simultaneously. Several experiments were conducted using machine/deep learning techniques to evaluate the performance of SentiCode in multilingual (English, French, German, Arabic, and Russian) and multi-domain environments. In addition, the vocabulary proposed by SentiCode and the effect of each token were evaluated by the ablation method. The results highlight the 70% accuracy of SentiCode, with the best trade-off between efficiency and computing time (training and testing) in a total of about 0.67 seconds, which is very convenient for real-time applications.

摘要

多语言情感分析的主要目标是分析评论,而不考虑其原始语言。在社交媒体平台上,从一种语言切换到另一种语言是很常见的。分析这些多语言评论是一项挑战,因为每种语言在句法、语法等方面都有所不同。本文提出了一种用于情感分析的新的独立于语言的表示方法——SentiCode。与之前在多语言情感分析方面的工作不同,该方法不依赖机器翻译来弥合不同语言之间的差距。相反,它利用语言的共同特征,如通用依存关系中使用的词性标注。同样重要的是,SentiCode能够同时在多语言和多领域环境中进行情感分析。使用机器学习/深度学习技术进行了多项实验,以评估SentiCode在多语言(英语、法语、德语、阿拉伯语和俄语)和多领域环境中的性能。此外,还通过消融方法评估了SentiCode提出的词汇表以及每个词元的效果。结果显示SentiCode的准确率为70%,在效率和计算时间(训练和测试)之间达到了最佳平衡,总共约0.67秒,这对于实时应用非常方便。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/4ecad4de6434/10844_2022_714_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/ea0af461d0b5/10844_2022_714_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/522479f57daa/10844_2022_714_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/743d728385d1/10844_2022_714_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/b1a11eafaace/10844_2022_714_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/00b0802d7cc2/10844_2022_714_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/09c3721b0811/10844_2022_714_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/4ecad4de6434/10844_2022_714_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/ea0af461d0b5/10844_2022_714_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/522479f57daa/10844_2022_714_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/743d728385d1/10844_2022_714_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/b1a11eafaace/10844_2022_714_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/00b0802d7cc2/10844_2022_714_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/09c3721b0811/10844_2022_714_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e64d/9130974/4ecad4de6434/10844_2022_714_Fig7_HTML.jpg

相似文献

1
SentiCode: A new paradigm for one-time training and global prediction in multilingual sentiment analysis.情感编码:多语言情感分析中一次性训练和全局预测的新范式。
J Intell Inf Syst. 2022;59(2):501-522. doi: 10.1007/s10844-022-00714-8. Epub 2022 May 25.
2
Multi-class sentiment analysis of urdu text using multilingual BERT.使用多语言 BERT 进行乌尔都语文本的多类情感分析。
Sci Rep. 2022 Mar 31;12(1):5436. doi: 10.1038/s41598-022-09381-9.
3
A multimodal approach to cross-lingual sentiment analysis with ensemble of transformer and LLM.一种结合Transformer和大语言模型集成的跨语言情感分析多模态方法。
Sci Rep. 2024 Apr 26;14(1):9603. doi: 10.1038/s41598-024-60210-7.
4
Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis.用于增强阿拉伯语情感分析的异质集成深度学习模型。
Sensors (Basel). 2022 May 12;22(10):3707. doi: 10.3390/s22103707.
5
Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models.多语言环境下的情感分析:机器学习与混合深度学习模型的比较分析
Heliyon. 2023 Sep 19;9(9):e20281. doi: 10.1016/j.heliyon.2023.e20281. eCollection 2023 Sep.
6
Heterogeneous text graph for comprehensive multilingual sentiment analysis: capturing short- and long-distance semantics.用于综合多语言情感分析的异构文本图:捕捉短距离和长距离语义
PeerJ Comput Sci. 2024 Feb 23;10:e1876. doi: 10.7717/peerj-cs.1876. eCollection 2024.
7
Deep learning based sentiment analysis and offensive language identification on multilingual code-mixed data.基于深度学习的多语言混合数据情感分析和攻击性语言识别。
Sci Rep. 2022 Dec 13;12(1):21557. doi: 10.1038/s41598-022-26092-3.
8
An ensemble deep learning classifier for sentiment analysis on code-mix Hindi-English data.一种用于印地语-英语代码混合数据情感分析的集成深度学习分类器。
Soft comput. 2022 Apr 23:1-18. doi: 10.1007/s00500-022-07091-y.
9
Quantum computing and machine learning for Arabic language sentiment classification in social media.量子计算和机器学习在社交媒体中对阿拉伯语情感分类的应用。
Sci Rep. 2023 Oct 12;13(1):17305. doi: 10.1038/s41598-023-44113-7.
10
Multilingual text categorization and sentiment analysis: a comparative analysis of the utilization of multilingual approaches for classifying twitter data.多语言文本分类与情感分析:对用于推特数据分类的多语言方法利用情况的比较分析。
Neural Comput Appl. 2023 May 8:1-17. doi: 10.1007/s00521-023-08629-3.

引用本文的文献

1
Multilingual deep learning framework for fake news detection using capsule neural network.使用胶囊神经网络的多语言假新闻检测深度学习框架。
J Intell Inf Syst. 2023 May 9:1-17. doi: 10.1007/s10844-023-00788-y.