• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多语言环境下的情感分析:机器学习与混合深度学习模型的比较分析

Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models.

作者信息

Das Rajesh Kumar, Islam Mirajul, Hasan Md Mahmudul, Razia Sultana, Hassan Mocksidul, Khushbu Sharun Akter

机构信息

Department of Computer Science and Engineering, Daffodil International University, Dhaka 1341, Bangladesh.

Faculty of Graduate Studies, Daffodil International University, Dhaka 1341, Bangladesh.

出版信息

Heliyon. 2023 Sep 19;9(9):e20281. doi: 10.1016/j.heliyon.2023.e20281. eCollection 2023 Sep.

DOI:10.1016/j.heliyon.2023.e20281
PMID:37809397
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10560063/
Abstract

This research paper investigates the efficacy of various machine learning models, including deep learning and hybrid models, for text classification in the English and Bangla languages. The study focuses on sentiment analysis of comments from a popular Bengali e-commerce site, "DARAZ," which comprises both Bangla and translated English reviews. The primary objective of this study is to conduct a comparative analysis of various models, evaluating their efficacy in the domain of sentiment analysis. The research methodology includes implementing seven machine learning models and deep learning models, such as Long Short-Term Memory (LSTM), Bidirectional LSTM (Bi-LSTM), Convolutional 1D (Conv1D), and a combined Conv1D-LSTM. Preprocessing techniques are applied to a modified text set to enhance model accuracy. The major conclusion of the study is that Support Vector Machine (SVM) models exhibit superior performance compared to other models, achieving an accuracy of 82.56% for English text sentiment analysis and 86.43% for Bangla text sentiment analysis using the porter stemming algorithm. Additionally, the Bi-LSTM Based Model demonstrates the best performance among the deep learning models, achieving an accuracy of 78.10% for English text and 83.72% for Bangla text using porter stemming. This study signifies significant progress in natural language processing research, particularly for Bangla, by enhancing improved text classification models and methodologies. The results of this research make a significant contribution to the field of sentiment analysis and offer valuable insights for future research and practical applications.

摘要

本研究论文探讨了包括深度学习模型和混合模型在内的各种机器学习模型在英文和孟加拉语文本分类中的有效性。该研究聚焦于对一家热门孟加拉语电子商务网站“Daraz”上的评论进行情感分析,该网站既有孟加拉语评论,也有英文翻译评论。本研究的主要目的是对各种模型进行比较分析,评估它们在情感分析领域的有效性。研究方法包括实现七种机器学习模型和深度学习模型,如长短期记忆网络(LSTM)、双向长短期记忆网络(Bi-LSTM)、一维卷积网络(Conv1D)以及组合的Conv1D-LSTM。预处理技术应用于经过修改的文本集以提高模型准确性。该研究的主要结论是,支持向量机(SVM)模型相比其他模型表现更优,使用波特词干提取算法时,英文文本情感分析的准确率达到82.56%,孟加拉语文本情感分析的准确率达到86.43%。此外,基于Bi-LSTM的模型在深度学习模型中表现最佳,使用波特词干提取算法时,英文文本的准确率达到78.10%,孟加拉语文本的准确率达到83.72%。本研究通过改进文本分类模型和方法,在自然语言处理研究中,尤其是在孟加拉语方面取得了显著进展。本研究结果对情感分析领域做出了重大贡献,并为未来研究和实际应用提供了有价值的见解。

相似文献

1
Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models.多语言环境下的情感分析:机器学习与混合深度学习模型的比较分析
Heliyon. 2023 Sep 19;9(9):e20281. doi: 10.1016/j.heliyon.2023.e20281. eCollection 2023 Sep.
2
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
3
Predicting patients' sentiments about medications using artificial intelligence techniques.使用人工智能技术预测患者对药物的看法。
Sci Rep. 2024 Dec 30;14(1):31928. doi: 10.1038/s41598-024-83222-9.
4
Wood Waste Valorization and Classification Approaches: A systematic review.木材废料的增值与分类方法:一项系统综述
Open Res Eur. 2025 May 6;5:5. doi: 10.12688/openreseurope.18862.1. eCollection 2025.
5
Introducing the dataset for measuring centrality for sustainability-A case study of Pecinci municipality, Serbia.介绍用于衡量可持续性中心性的数据集——以塞尔维亚佩钦奇市为例
Data Brief. 2025 May 27;61:111714. doi: 10.1016/j.dib.2025.111714. eCollection 2025 Aug.
6
Molecular feature-based classification of retroperitoneal liposarcoma: a prospective cohort study.基于分子特征的腹膜后脂肪肉瘤分类:一项前瞻性队列研究。
Elife. 2025 May 23;14:RP100887. doi: 10.7554/eLife.100887.
7
Cauliflower leaf diseases: A computer vision dataset for smart agriculture.花椰菜叶部病害:一个用于智慧农业的计算机视觉数据集。
Data Brief. 2025 Apr 28;60:111594. doi: 10.1016/j.dib.2025.111594. eCollection 2025 Jun.
8
Detecting Redundant Health Survey Questions by Using Language-Agnostic Bidirectional Encoder Representations From Transformers Sentence Embedding: Algorithm Development Study.使用来自Transformer句子嵌入的语言无关双向编码器表示法检测冗余健康调查问题:算法开发研究
JMIR Med Inform. 2025 Jun 10;13:e71687. doi: 10.2196/71687.
9
Learning hemodynamic scalar fields on coronary artery meshes: A benchmark of geometric deep learning models.在冠状动脉网格上学习血流动力学标量场:几何深度学习模型的基准测试
Comput Biol Med. 2025 Jun 17;195:110477. doi: 10.1016/j.compbiomed.2025.110477.
10
Surveillance for Violent Deaths - National Violent Death Reporting System, 50 States, the District of Columbia, and Puerto Rico, 2022.暴力死亡监测——2022年全国暴力死亡报告系统,50个州、哥伦比亚特区和波多黎各
MMWR Surveill Summ. 2025 Jun 12;74(5):1-42. doi: 10.15585/mmwr.ss7405a1.

引用本文的文献

1
EA-CNN: Enhanced attention-CNN with explainable AI for fruit and vegetable classification.EA-CNN:用于果蔬分类的具有可解释人工智能的增强注意力卷积神经网络。
Heliyon. 2024 Nov 30;10(23):e40820. doi: 10.1016/j.heliyon.2024.e40820. eCollection 2024 Dec 15.
2
MuTCELM: An optimal multi-TextCNN-based ensemble learning for text classification.MuTCELM:一种基于最优多文本卷积神经网络的文本分类集成学习方法
Heliyon. 2024 Sep 30;10(19):e38515. doi: 10.1016/j.heliyon.2024.e38515. eCollection 2024 Oct 15.
3
An improved deep convolutional neural network-based YouTube video classification using textual features.

本文引用的文献

1
Determination of the lactose content in low-lactose milk using Fourier-transform infrared spectroscopy (FTIR) and convolutional neural network.使用傅里叶变换红外光谱(FTIR)和卷积神经网络测定低乳糖牛奶中的乳糖含量。
Heliyon. 2023 Jan 10;9(1):e12898. doi: 10.1016/j.heliyon.2023.e12898. eCollection 2023 Jan.
2
Handling consumer vulnerability in e-commerce product images using machine learning.利用机器学习处理电子商务产品图片中的消费者易受影响因素。
Heliyon. 2022 Sep 24;8(9):e10743. doi: 10.1016/j.heliyon.2022.e10743. eCollection 2022 Sep.
3
A deep learning approach in predicting products' sentiment ratings: a comparative analysis.
一种基于改进的深度卷积神经网络并利用文本特征的YouTube视频分类方法。
Heliyon. 2024 Aug 10;10(16):e35812. doi: 10.1016/j.heliyon.2024.e35812. eCollection 2024 Aug 30.
一种用于预测产品情感评分的深度学习方法:比较分析。
J Supercomput. 2022;78(5):7206-7226. doi: 10.1007/s11227-021-04169-6. Epub 2021 Nov 5.
4
Sentiment analysis and its applications in fighting COVID-19 and infectious diseases: A systematic review.情感分析及其在抗击新冠疫情和传染病中的应用:一项系统综述
Expert Syst Appl. 2021 Apr 1;167:114155. doi: 10.1016/j.eswa.2020.114155. Epub 2020 Oct 28.
5
Improving the accuracy of medical diagnosis with causal machine learning.利用因果机器学习提高医学诊断的准确性。
Nat Commun. 2020 Aug 11;11(1):3923. doi: 10.1038/s41467-020-17419-7.
6
Flexible Piezoelectric Acoustic Sensors and Machine Learning for Speech Processing.用于语音处理的柔性压电声学传感器和机器学习。
Adv Mater. 2020 Sep;32(35):e1904020. doi: 10.1002/adma.201904020. Epub 2019 Oct 16.
7
Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models.机器学习技术在个体化乳腺癌风险预测中的应用:与 BCRAT 和 BOADICEA 模型的比较。
Breast Cancer Res. 2019 Jun 20;21(1):75. doi: 10.1186/s13058-019-1158-4.
8
Data augmentation in dermatology image recognition using machine learning.基于机器学习的皮肤病学图像识别中的数据增强。
Skin Res Technol. 2019 Nov;25(6):815-820. doi: 10.1111/srt.12726. Epub 2019 May 29.
9
Deep learning for cellular image analysis.深度学习在细胞图像分析中的应用。
Nat Methods. 2019 Dec;16(12):1233-1246. doi: 10.1038/s41592-019-0403-1. Epub 2019 May 27.
10
A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data.基于电子患者自报告文本数据的症状自然语言处理和文本挖掘的系统评价。
Int J Med Inform. 2019 May;125:37-46. doi: 10.1016/j.ijmedinf.2019.02.008. Epub 2019 Feb 20.