一种通过阿拉伯语推文进行自杀检测的优化深度学习方法。

An optimized deep learning approach for suicide detection through Arabic tweets.

作者信息

Baghdadi Nadiah A, Malki Amer, Magdy Balaha Hossam, AbdulAzeem Yousry, Badawy Mahmoud, Elhosseini Mostafa

机构信息

Nursing Management and Education Department, College of Nursing, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

College of Computer Science and Engineering, Taibah University, Yanbu, Saudi Arabia.

出版信息

PeerJ Comput Sci. 2022 Aug 23;8:e1070. doi: 10.7717/peerj-cs.1070. eCollection 2022.

DOI:10.7717/peerj-cs.1070

PMID:36092010

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9455273/

Abstract

Many people worldwide suffer from mental illnesses such as major depressive disorder (MDD), which affect their thoughts, behavior, and quality of life. Suicide is regarded as the second leading cause of death among teenagers when treatment is not received. Twitter is a platform for expressing their emotions and thoughts about many subjects. Many studies, including this one, suggest using social media data to track depression and other mental illnesses. Even though Arabic is widely spoken and has a complex syntax, depressive detection methods have not been applied to the language. The Arabic tweets dataset should be scraped and annotated first. Then, a complete framework for categorizing tweet inputs into two classes (such as Normal or Suicide) is suggested in this study. The article also proposes an Arabic tweet preprocessing algorithm that contrasts lemmatization, stemming, and various lexical analysis methods. Experiments are conducted using Twitter data scraped from the Internet. Five different annotators have annotated the data. Performance metrics are reported on the suggested dataset using the latest Bidirectional Encoder Representations from Transformers (BERT) and Universal Sentence Encoder (USE) models. The measured performance metrics are balanced accuracy, specificity, F1-score, IoU, ROC, Youden Index, NPV, and weighted sum metric (WSM). Regarding USE models, the best-weighted sum metric (WSM) is 80.2%, and with regards to Arabic BERT models, the best WSM is 95.26%.

摘要

全球许多人患有精神疾病，如重度抑郁症（MDD），这些疾病会影响他们的思想、行为和生活质量。在未接受治疗的情况下，自杀被视为青少年的第二大死因。推特是一个表达他们对许多主题的情感和想法的平台。包括本研究在内的许多研究都建议利用社交媒体数据来追踪抑郁症和其他精神疾病。尽管阿拉伯语广泛使用且语法复杂，但抑郁检测方法尚未应用于该语言。首先应抓取并标注阿拉伯语推文数据集。然后，本研究提出了一个将推文输入分类为两类（如正常或自杀）的完整框架。文章还提出了一种阿拉伯语推文预处理算法，该算法对比了词形还原、词干提取和各种词汇分析方法。使用从互联网上抓取的推特数据进行实验。五名不同的注释者对数据进行了标注。使用最新的来自变换器的双向编码器表示（BERT）和通用句子编码器（USE）模型，在建议的数据集上报告性能指标。测量的性能指标包括平衡准确率、特异性、F1分数、交并比（IoU）、ROC、约登指数、阴性预测值和加权和指标（WSM）。关于USE模型，最佳加权和指标（WSM）为80.2%，关于阿拉伯语BERT模型，最佳WSM为95.26%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd65/9455273/d73d3fbae2be/peerj-cs-08-1070-g001.jpg

相似文献

An optimized deep learning approach for suicide detection through Arabic tweets.一种通过阿拉伯语推文进行自杀检测的优化深度学习方法。

PeerJ Comput Sci. 2022 Aug 23;8:e1070. doi: 10.7717/peerj-cs.1070. eCollection 2022.

Social Media Monitoring of the COVID-19 Pandemic and Influenza Epidemic With Adaptation for Informal Language in Arabic Twitter Data: Qualitative Study.对新冠疫情和流感流行进行社交媒体监测，并针对阿拉伯语推特数据中的非正式语言进行调整：定性研究。

JMIR Med Inform. 2021 Sep 17;9(9):e27670. doi: 10.2196/27670.

Pretrained Transformer Language Models Versus Pretrained Word Embeddings for the Detection of Accurate Health Information on Arabic Social Media: Comparative Study.用于在阿拉伯社交媒体上检测准确健康信息的预训练Transformer语言模型与预训练词嵌入：比较研究

JMIR Form Res. 2022 Jun 29;6(6):e34834. doi: 10.2196/34834.

Traditional Machine Learning Models and Bidirectional Encoder Representations From Transformer (BERT)-Based Automatic Classification of Tweets About Eating Disorders: Algorithm Development and Validation Study.传统机器学习模型与基于双向编码器表征变换器（BERT）的饮食失调推文自动分类：算法开发与验证研究

JMIR Med Inform. 2022 Feb 24;10(2):e34492. doi: 10.2196/34492.

Asian hate speech detection on Twitter during COVID-19.新冠疫情期间推特上的反亚裔仇恨言论检测

Front Artif Intell. 2022 Aug 15;5:932381. doi: 10.3389/frai.2022.932381. eCollection 2022.

Comparison of pretrained transformer-based models for influenza and COVID-19 detection using social media text data in Saskatchewan, Canada.加拿大萨斯喀彻温省使用社交媒体文本数据对基于预训练变压器的流感和新冠病毒检测模型的比较

Front Digit Health. 2023 Jun 28;5:1203874. doi: 10.3389/fdgth.2023.1203874. eCollection 2023.

Identifying Potential Lyme Disease Cases Using Self-Reported Worldwide Tweets: Deep Learning Modeling Approach Enhanced With Sentimental Words Through Emojis.利用自我报告的全球推文识别潜在莱姆病病例：通过表情符号增强带有情感词汇的深度学习模型。

J Med Internet Res. 2023 Oct 16;25:e47014. doi: 10.2196/47014.

Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach.在 Twitter 上检测潜在有害和保护自杀相关内容：机器学习方法。

J Med Internet Res. 2022 Aug 17;24(8):e34705. doi: 10.2196/34705.

Momentary Depressive Feeling Detection Using X (Formerly Twitter) Data: Contextual Language Approach.使用X（原推特）数据检测瞬间抑郁情绪：上下文语言方法。

JMIR AI. 2023 Nov 27;2:e49531. doi: 10.2196/49531.

Development of a COVID-19-Related Anti-Asian Tweet Data Set: Quantitative Study.与新冠疫情相关的反亚裔推文数据集的开发：定量研究。

JMIR Form Res. 2023 Feb 28;7:e40403. doi: 10.2196/40403.

引用本文的文献

A Scoping Review of Arabic Natural Language Processing for Mental Health.阿拉伯语心理健康自然语言处理的范围综述

Healthcare (Basel). 2025 Apr 22;13(9):963. doi: 10.3390/healthcare13090963.

Evaluating of BERT-based and Large Language Mod for Suicide Detection, Prevention, and Risk Assessment: A Systematic Review.基于BERT和大语言模型的自杀检测、预防及风险评估研究：一项系统综述

J Med Syst. 2024 Dec 30;48(1):113. doi: 10.1007/s10916-024-02134-3.

Mental illness detection through harvesting social media: a comprehensive literature review.通过挖掘社交媒体进行精神疾病检测：一项全面的文献综述

PeerJ Comput Sci. 2024 Oct 7;10:e2296. doi: 10.7717/peerj-cs.2296. eCollection 2024.

Precise Prostate Cancer Assessment Using IVIM-Based Parametric Estimation of Blood Diffusion from DW-MRI.基于扩散加权磁共振成像（DW-MRI）的体素内不相干运动（IVIM）参数估计对前列腺癌进行精准评估

Bioengineering (Basel). 2024 Jun 19;11(6):629. doi: 10.3390/bioengineering11060629.

Special issue on analysis and mining of social media data.社交媒体数据分析与挖掘特刊。

PeerJ Comput Sci. 2024 Feb 29;10:e1909. doi: 10.7717/peerj-cs.1909. eCollection 2024.

A concentrated machine learning-based classification system for age-related macular degeneration (AMD) diagnosis using fundus images.基于机器学习的眼底图像年龄相关性黄斑变性（AMD）诊断集中分类系统。

Sci Rep. 2024 Jan 29;14(1):2434. doi: 10.1038/s41598-024-52131-2.

Semi-supervised learning and bidirectional decoding for effective grammar correction in low-resource scenarios.低资源场景下用于有效语法纠正的半监督学习与双向解码

PeerJ Comput Sci. 2023 Oct 24;9:e1639. doi: 10.7717/peerj-cs.1639. eCollection 2023.

本文引用的文献

AC-TL-GTO: Alzheimer Automatic Accurate Classification Using Transfer Learning and Artificial Gorilla Troops Optimizer.AC-TL-GTO：基于迁移学习和人工大猩猩群优化的阿尔茨海默病自动精确分类。

Sensors (Basel). 2022 Jun 2;22(11):4250. doi: 10.3390/s22114250.

Natural language processing applied to mental illness detection: a narrative review.应用于精神疾病检测的自然语言处理：一篇叙述性综述。

NPJ Digit Med. 2022 Apr 8;5(1):46. doi: 10.1038/s41746-022-00589-7.

An hybrid deep learning approach for depression prediction from user tweets using feature-rich CNN and bi-directional LSTM.一种使用特征丰富的卷积神经网络（CNN）和双向长短期记忆网络（Bi-LSTM）从用户推文预测抑郁症的混合深度学习方法。

Multimed Tools Appl. 2022;81(17):23649-23685. doi: 10.1007/s11042-022-12648-y. Epub 2022 Mar 18.

An automated diagnosis and classification of COVID-19 from chest CT images using a transfer learning-based convolutional neural network.利用基于迁移学习的卷积神经网络对 chest CT 图像进行 COVID-19 的自动诊断和分类。

Comput Biol Med. 2022 May;144:105383. doi: 10.1016/j.compbiomed.2022.105383. Epub 2022 Mar 10.

Detecting and Measuring Depression on Social Media Using a Machine Learning Approach: Systematic Review.使用机器学习方法在社交媒体上检测和测量抑郁症：系统评价

JMIR Ment Health. 2022 Mar 1;9(3):e27244. doi: 10.2196/27244.

Detecting Depression Signs on Social Media: A Systematic Literature Review.在社交媒体上检测抑郁症迹象：一项系统的文献综述。

Healthcare (Basel). 2022 Feb 1;10(2):291. doi: 10.3390/healthcare10020291.

Explainable depression detection with multi-aspect features using a hybrid deep learning model on social media.基于社交媒体，使用混合深度学习模型通过多方面特征进行可解释的抑郁症检测。

World Wide Web. 2022;25(1):281-304. doi: 10.1007/s11280-021-00992-2. Epub 2022 Jan 28.

Automatic detection of depression symptoms in twitter using multimodal analysis.使用多模态分析自动检测推特上的抑郁症状。

J Supercomput. 2022;78(4):4709-4744. doi: 10.1007/s11227-021-04040-8. Epub 2021 Sep 9.

A deep learning model for detecting mental illness from user content on social media.基于社交媒体用户内容的精神疾病深度学习模型

Sci Rep. 2020 Jul 16;10(1):11846. doi: 10.1038/s41598-020-68764-y.

Detecting depression using a framework combining deep multimodal neural networks with a purpose-built automated evaluation.使用结合深度多模态神经网络和专门构建的自动化评估的框架来检测抑郁症。

Psychol Assess. 2019 Aug;31(8):1019-1027. doi: 10.1037/pas0000724. Epub 2019 May 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种通过阿拉伯语推文进行自杀检测的优化深度学习方法。

An optimized deep learning approach for suicide detection through Arabic tweets.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献