Shah Shahid Munir, Aljawarneh Mahmoud Mohammad, Saleem Muhammad Aamer, Jawarneh Mahmoud Saleh
Faculty of Engineering Sciences and Technology, Hamdard University, Karachi, Pakistan.
Faculty of Information Technology, Applied Science Private University, Amman, Jordan.
PeerJ Comput Sci. 2024 Oct 7;10:e2296. doi: 10.7717/peerj-cs.2296. eCollection 2024.
Mental illness is a common disease that at its extremes leads to personal and societal suffering. A complicated multi-factorial disease, mental illness is influenced by a number of socioeconomic and clinical factors, including individual risk factors. Traditionally, approaches relying on personal interviews and filling out questionnaires have been employed to diagnose mental illness; however, these manual procedures have been found to be frequently prone to errors and unable to reliably identify individuals with mental illness. Fortunately, people with mental illnesses frequently express their ailments on social media, making it possible to more precisely identify mental disease by harvesting their social media posts. This study offers a thorough analysis of how to identify mental illnesses (more specifically, depression) from users' social media data. Along with the explanation of data acquisition, preprocessing, feature extraction, and classification techniques, the most recent published literature is presented to give the readers a thorough understanding of the subject. Since, in the recent past, the majority of the relevant scientific community has focused on using machine learning (ML) and deep learning (DL) models to identify mental illness, so the review also focuses on these techniques and along with their detail, their critical analysis is presented. More than 100 DL, ML, and natural language processing (NLP) based models developed for mental illness in the recent past have been reviewed, and their technical contributions and strengths are discussed. There exist multiple review studies, however, discussing extensive recent literature along with the complete road map on how to design a mental illness detection system using social media data and ML and DL classification methods is limited. The review also includes detail on how a dataset may be acquired from social media platforms, how it is preprocessed, and features are extracted from it to employ for mental illness detection. Hence, we anticipate that this review will help readers learn more and give them a comprehensive road map for identifying mental illnesses using users' social media data.
精神疾病是一种常见疾病,在极端情况下会导致个人痛苦和社会痛苦。作为一种复杂的多因素疾病,精神疾病受到许多社会经济和临床因素的影响,包括个体风险因素。传统上,诊断精神疾病采用依赖个人访谈和填写问卷的方法;然而,这些人工程序经常容易出错,无法可靠地识别患有精神疾病的个体。幸运的是,患有精神疾病的人经常在社交媒体上表达他们的疾病,这使得通过收集他们的社交媒体帖子更精确地识别精神疾病成为可能。本研究全面分析了如何从用户的社交媒体数据中识别精神疾病(更具体地说是抑郁症)。除了解释数据采集、预处理、特征提取和分类技术外,还展示了最新发表的文献,以使读者对该主题有全面的了解。由于最近大多数相关科学界都专注于使用机器学习(ML)和深度学习(DL)模型来识别精神疾病,因此本综述也聚焦于这些技术,并详细介绍了它们,并对其进行了批判性分析。对最近开发的100多个基于DL、ML和自然语言处理(NLP)的精神疾病模型进行了综述,并讨论了它们的技术贡献和优势。然而,虽然存在多项综述研究,但全面讨论近期文献以及关于如何使用社交媒体数据以及ML和DL分类方法设计精神疾病检测系统的完整路线图的研究却很有限。该综述还包括关于如何从社交媒体平台获取数据集、如何对其进行预处理以及如何从中提取特征以用于精神疾病检测的详细信息。因此,我们预计本综述将帮助读者了解更多内容,并为他们提供使用用户社交媒体数据识别精神疾病的全面路线图。