• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态仇恨言论检测:一种用于多语言文本和图像的新型深度学习框架。

Multimodal hate speech detection: a novel deep learning framework for multilingual text and images.

作者信息

Saddozai Furqan Khan, Badri Sahar K, Alghazzawi Daniyal, Khattak Asad, Asghar Muhammad Zubair

机构信息

Gomal Research Institute of Computing, Faculty of Computing, Gomal University, D.I.Khan, KP, Pakistan.

Information Systems Department, Faculty of Computing and Information Technology, King Abdul Aziz University, Jeddah, Saudi Arabia.

出版信息

PeerJ Comput Sci. 2025 Apr 16;11:e2801. doi: 10.7717/peerj-cs.2801. eCollection 2025.

DOI:10.7717/peerj-cs.2801
PMID:40567705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12190340/
Abstract

The rapid proliferation of social media platforms has facilitated the expression of opinions but also enabled the spread of hate speech. Detecting multimodal hate speech in low-resource multilingual contexts poses significant challenges. This study presents a deep learning framework that integrates bidirectional long short-term memory (BiLSTM) and EfficientNetB1 to classify hate speech in Urdu-English tweets, leveraging both text and image modalities. We introduce multimodal multilingual hate speech (MMHS11K), a manually annotated dataset comprising 11,000 multimodal tweets. Using an early fusion strategy, text and image features were combined for classification. Experimental results demonstrate that the BiLSTM+EfficientNetB1 model outperforms unimodal and baseline multimodal approaches, achieving an F1-score of 81.2% for Urdu tweets and 75.5% for English tweets. This research addresses critical gaps in multilingual and multimodal hate speech detection, offering a foundation for future advancements.

摘要

社交媒体平台的迅速扩散既促进了观点的表达,但也使得仇恨言论得以传播。在资源匮乏的多语言环境中检测多模态仇恨言论面临着重大挑战。本研究提出了一个深度学习框架,该框架整合了双向长短期记忆(BiLSTM)和高效神经网络B1(EfficientNetB1),以利用文本和图像模态对乌尔都语-英语推文中的仇恨言论进行分类。我们引入了多模态多语言仇恨言论(MMHS11K),这是一个包含11000条多模态推文的人工标注数据集。使用早期融合策略,将文本和图像特征结合起来进行分类。实验结果表明,BiLSTM+EfficientNetB1模型优于单模态和基线多模态方法,乌尔都语推文的F1分数达到81.2%,英语推文的F1分数达到75.5%。本研究解决了多语言和多模态仇恨言论检测中的关键空白,为未来的进展奠定了基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/f32399107042/peerj-cs-11-2801-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/2af441400293/peerj-cs-11-2801-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/5dbbcf36d8d1/peerj-cs-11-2801-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/d0ed8cc03e05/peerj-cs-11-2801-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/cbd5d5ff908b/peerj-cs-11-2801-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/44e5bc52974c/peerj-cs-11-2801-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/982f6abd0751/peerj-cs-11-2801-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/f32399107042/peerj-cs-11-2801-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/2af441400293/peerj-cs-11-2801-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/5dbbcf36d8d1/peerj-cs-11-2801-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/d0ed8cc03e05/peerj-cs-11-2801-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/cbd5d5ff908b/peerj-cs-11-2801-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/44e5bc52974c/peerj-cs-11-2801-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/982f6abd0751/peerj-cs-11-2801-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0b7/12190340/f32399107042/peerj-cs-11-2801-g007.jpg

相似文献

1
Multimodal hate speech detection: a novel deep learning framework for multilingual text and images.多模态仇恨言论检测:一种用于多语言文本和图像的新型深度学习框架。
PeerJ Comput Sci. 2025 Apr 16;11:e2801. doi: 10.7717/peerj-cs.2801. eCollection 2025.
2
UMEDNet: a multimodal approach for emotion detection in the Urdu language.UMEDNet:一种用于乌尔都语情感检测的多模态方法。
PeerJ Comput Sci. 2025 May 1;11:e2861. doi: 10.7717/peerj-cs.2861. eCollection 2025.
3
TARGE: large language model-powered explainable hate speech detection.TARGE:由大语言模型驱动的可解释仇恨言论检测
PeerJ Comput Sci. 2025 May 30;11:e2911. doi: 10.7717/peerj-cs.2911. eCollection 2025.
4
Decoding Digital Discourse Through Multimodal Text and Image Machine Learning Models to Classify Sentiment and Detect Hate Speech in Race- and Lesbian, Gay, Bisexual, Transgender, Queer, Intersex, and Asexual Community-Related Posts on Social Media: Quantitative Study.通过多模态文本和图像机器学习模型解码数字话语,以对社交媒体上与种族以及女同性恋、男同性恋、双性恋、跨性别者、酷儿、双性人及无性恋者群体相关帖子中的情感进行分类并检测仇恨言论:定量研究
J Med Internet Res. 2025 May 12;27:e72822. doi: 10.2196/72822.
5
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
6
Roman urdu hate speech detection using hybrid machine learning models and hyperparameter optimization.基于混合机器学习模型和超参数优化的罗马 Urdu 仇恨言论检测
Sci Rep. 2024 Nov 19;14(1):28590. doi: 10.1038/s41598-024-79106-7.
7
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
8
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
9
A Deep Learning Model for Identifying the Risk of Mesenteric Malperfusion in Acute Aortic Dissection Using Initial Diagnostic Data: Algorithm Development and Validation.一种利用初始诊断数据识别急性主动脉夹层中肠系膜灌注不良风险的深度学习模型:算法开发与验证
J Med Internet Res. 2025 Jun 10;27:e72649. doi: 10.2196/72649.
10
Class-weighted Dempster-Shafer in dual-level fusion for multimodal fake real estate listings detection.用于多模态虚假房地产列表检测的双层融合中的类加权邓普斯特-谢弗方法
PeerJ Comput Sci. 2025 May 27;11:e2797. doi: 10.7717/peerj-cs.2797. eCollection 2025.

本文引用的文献

1
EffUnet-SpaGen: An Efficient and Spatial Generative Approach to Glaucoma Detection.EffUnet-SpaGen:一种用于青光眼检测的高效空间生成方法。
J Imaging. 2021 May 30;7(6):92. doi: 10.3390/jimaging7060092.
2
Roman Urdu Hate Speech Detection Using Transformer-Based Model for Cyber Security Applications.基于转换器模型的罗曼 Urdu 仇恨言论检测在网络安全应用中的研究
Sensors (Basel). 2023 Apr 12;23(8):3909. doi: 10.3390/s23083909.