• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于 Bi-LSTM 和 BERT 模型的心理健康预测新型文本挖掘方法。

A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model.

机构信息

Department of Artificial Intelligence, Ajou University, Suwon, Republic of Korea.

Department of Software, Sejong University, Republic of Korea.

出版信息

Comput Intell Neurosci. 2022 Mar 3;2022:7893775. doi: 10.1155/2022/7893775. eCollection 2022.

DOI:10.1155/2022/7893775
PMID:35281185
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8913054/
Abstract

With the current advancement in the Internet, there has been a growing demand for building intelligent and smart systems that can efficiently address the detection of health-related problems on social media, such as the detection of depression and anxiety. These types of systems, which are mainly dependent on machine learning techniques, must be able to deal with obtaining the semantic and syntactic meaning of texts posted by users on social media. The data generated by users on social media contains unstructured and unpredictable content. Several systems based on machine learning and social media platforms have recently been introduced to identify health-related problems. However, the text representation and deep learning techniques employed provide only limited information and knowledge about the different texts posted by users. This is owing to a lack of long-term dependencies between each word in the entire text and a lack of proper exploitation of recent deep learning schemes. In this paper, we propose a novel framework to efficiently and effectively identify depression and anxiety-related posts while maintaining the contextual and semantic meaning of the words used in the whole corpus when applying bidirectional encoder representations from transformers (BERT). In addition, we propose a knowledge distillation technique, which is a recent technique for transferring knowledge from a large pretrained model (BERT) to a smaller model to boost performance and accuracy. We also devised our own data collection framework from Reddit and Twitter, which are the most common social media sites. Finally, we employed word2vec and BERT with Bi-LSTM to effectively analyze and detect depression and anxiety signs from social media posts. Our system surpasses other state-of-the-art methods and achieves an accuracy of 98% using the knowledge distillation technique.

摘要

随着互联网的当前发展,对于构建能够有效解决社交媒体中与健康相关问题(例如抑郁和焦虑的检测)的智能和智能系统的需求不断增长。这些系统主要依赖于机器学习技术,必须能够处理获取用户在社交媒体上发布的文本的语义和句法意义。用户在社交媒体上生成的数据包含非结构化和不可预测的内容。最近已经引入了几个基于机器学习和社交媒体平台的系统来识别与健康相关的问题。但是,所采用的文本表示和深度学习技术仅提供了有关用户发布的不同文本的有限信息和知识。这是由于缺乏整个文本中每个单词之间的长期依赖性,以及缺乏对最近深度学习方案的适当利用。在本文中,我们提出了一种新颖的框架,该框架可以有效地识别抑郁和焦虑相关的帖子,同时在应用来自变压器的双向编码器表示(BERT)时保持整个语料库中使用的单词的上下文和语义意义。此外,我们提出了一种知识提炼技术,这是一种从大型预训练模型(BERT)转移知识到较小模型以提高性能和准确性的新技术。我们还设计了自己的从 Reddit 和 Twitter 收集数据的框架,这是最常见的社交媒体网站。最后,我们使用 word2vec 和 BERT 与 Bi-LSTM 有效地分析和检测社交媒体帖子中的抑郁和焦虑迹象。我们的系统超越了其他最先进的方法,并在使用知识提炼技术时达到了 98%的准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/75447c122bb7/CIN2022-7893775.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/e842c4de1f7b/CIN2022-7893775.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/e757c20d2f81/CIN2022-7893775.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/b7011a3ab0b7/CIN2022-7893775.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/3cf89dc30e81/CIN2022-7893775.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/63aae59095e1/CIN2022-7893775.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/23adcbe39a0d/CIN2022-7893775.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/3cfc7067e15c/CIN2022-7893775.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/01ba55da106f/CIN2022-7893775.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/d33edc6702c3/CIN2022-7893775.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/8b7b0dc74805/CIN2022-7893775.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/75447c122bb7/CIN2022-7893775.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/e842c4de1f7b/CIN2022-7893775.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/e757c20d2f81/CIN2022-7893775.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/b7011a3ab0b7/CIN2022-7893775.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/3cf89dc30e81/CIN2022-7893775.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/63aae59095e1/CIN2022-7893775.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/23adcbe39a0d/CIN2022-7893775.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/3cfc7067e15c/CIN2022-7893775.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/01ba55da106f/CIN2022-7893775.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/d33edc6702c3/CIN2022-7893775.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/8b7b0dc74805/CIN2022-7893775.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c24/8913054/75447c122bb7/CIN2022-7893775.011.jpg

相似文献

1
A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model.一种基于 Bi-LSTM 和 BERT 模型的心理健康预测新型文本挖掘方法。
Comput Intell Neurosci. 2022 Mar 3;2022:7893775. doi: 10.1155/2022/7893775. eCollection 2022.
2
Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach.在 Twitter 上检测潜在有害和保护自杀相关内容:机器学习方法。
J Med Internet Res. 2022 Aug 17;24(8):e34705. doi: 10.2196/34705.
3
Vaccine sentiment analysis using BERT + NBSVM and geo-spatial approaches.使用BERT + NBSVM和地理空间方法的疫苗情绪分析。
J Supercomput. 2023 May 7:1-31. doi: 10.1007/s11227-023-05319-8.
4
Identifying health related occupations of Twitter users through word embedding and deep neural networks.通过词嵌入和深度神经网络识别 Twitter 用户的健康相关职业。
BMC Bioinformatics. 2022 Sep 28;22(Suppl 10):630. doi: 10.1186/s12859-022-04933-2.
5
An Automated Toxicity Classification on Social Media Using LSTM and Word Embedding.基于 LSTM 和词嵌入的社交媒体自动毒性分类。
Comput Intell Neurosci. 2022 Feb 15;2022:8467349. doi: 10.1155/2022/8467349. eCollection 2022.
6
Understanding Mental Health Issues in Different Subdomains of Social Networking Services: Computational Analysis of Text-Based Reddit Posts.理解不同社交网络服务领域的心理健康问题:基于文本的 Reddit 帖子的计算分析。
J Med Internet Res. 2023 Nov 30;25:e49074. doi: 10.2196/49074.
7
Cancer Prevention and Treatment on Chinese Social Media: Machine Learning-Based Content Analysis Study.中文社交媒体中的癌症预防与治疗:基于机器学习的内容分析研究。
J Med Internet Res. 2024 Aug 14;26:e55937. doi: 10.2196/55937.
8
Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features.社交媒体中的药物警戒:使用带有词嵌入聚类特征的序列标注挖掘药物不良反应提及信息。
J Am Med Inform Assoc. 2015 May;22(3):671-81. doi: 10.1093/jamia/ocu041. Epub 2015 Mar 9.
9
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
10
Mining e-cigarette adverse events in social media using Bi-LSTM recurrent neural network with word embedding representation.利用带有词嵌入表示的 Bi-LSTM 递归神经网络挖掘社交媒体中的电子烟不良事件。
J Am Med Inform Assoc. 2018 Jan 1;25(1):72-80. doi: 10.1093/jamia/ocx045.

引用本文的文献

1
Enhancing medical text classification with GAN-based data augmentation and multi-task learning in BERT.利用基于生成对抗网络(GAN)的数据增强和BERT中的多任务学习来增强医学文本分类
Sci Rep. 2025 Apr 22;15(1):13854. doi: 10.1038/s41598-025-98281-9.
2
Online comments of tourist attractions combining artificial intelligence text mining model and attention mechanism.结合人工智能文本挖掘模型和注意力机制的旅游景点在线评论
Sci Rep. 2025 Jan 7;15(1):1121. doi: 10.1038/s41598-025-85139-3.
3
Task-Specific Transformer-Based Language Models in Health Care: Scoping Review.

本文引用的文献

1
Explainable depression detection with multi-aspect features using a hybrid deep learning model on social media.基于社交媒体,使用混合深度学习模型通过多方面特征进行可解释的抑郁症检测。
World Wide Web. 2022;25(1):281-304. doi: 10.1007/s11280-021-00992-2. Epub 2022 Jan 28.
2
Traffic accident detection and condition analysis based on social networking data.基于社交网络数据的交通事故检测与状态分析。
Accid Anal Prev. 2021 Mar;151:105973. doi: 10.1016/j.aap.2021.105973. Epub 2021 Jan 15.
3
A deep learning model for detecting mental illness from user content on social media.
基于任务特定的转换器的语言模型在医疗保健中的应用:范围综述。
JMIR Med Inform. 2024 Nov 18;12:e49724. doi: 10.2196/49724.
4
Debate and Dilemmas Regarding Generative AI in Mental Health Care: Scoping Review.心理健康护理中生成式人工智能的争论与困境:范围综述
Interact J Med Res. 2024 Aug 12;13:e53672. doi: 10.2196/53672.
5
Predicting which patients with cancer will see a psychiatrist or counsellor from their initial oncology consultation document using natural language processing.使用自然语言处理技术,从最初的肿瘤学咨询文档中预测哪些癌症患者将会咨询精神科医生或咨询师。
Commun Med (Lond). 2024 Apr 8;4(1):69. doi: 10.1038/s43856-024-00495-x.
6
Optimizing classification of diseases through language model analysis of symptoms.通过对症状进行语言模型分析来优化疾病分类。
Sci Rep. 2024 Jan 17;14(1):1507. doi: 10.1038/s41598-024-51615-5.
7
Retracted: A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model.撤回:一种使用双向长短期记忆网络(Bi-LSTM)和BERT模型进行心理健康预测的新型文本挖掘方法。
Comput Intell Neurosci. 2023 Oct 11;2023:9872487. doi: 10.1155/2023/9872487. eCollection 2023.
8
Mapping loneliness through social intelligence analysis: a step towards creating global loneliness map.通过社会智能分析绘制孤独图:迈向创建全球孤独图的一步。
BMJ Health Care Inform. 2023 Oct;30(1). doi: 10.1136/bmjhci-2022-100728.
9
An automated materials and processes identification tool for material informatics using deep learning approach.一种使用深度学习方法的用于材料信息学的自动化材料与工艺识别工具。
Heliyon. 2023 Sep 14;9(9):e20003. doi: 10.1016/j.heliyon.2023.e20003. eCollection 2023 Sep.
10
Fake news stance detection using selective features and FakeNET.基于选择性特征和 FakeNET 的虚假新闻立场检测
PLoS One. 2023 Jul 31;18(7):e0287298. doi: 10.1371/journal.pone.0287298. eCollection 2023.
基于社交媒体用户内容的精神疾病深度学习模型
Sci Rep. 2020 Jul 16;10(1):11846. doi: 10.1038/s41598-020-68764-y.
4
Mobile Phone and Wearable Sensor-Based mHealth Approaches for Psychiatric Disorders and Symptoms: Systematic Review.基于手机和可穿戴传感器的精神疾病及症状移动健康方法:系统评价
JMIR Ment Health. 2019 Feb 20;6(2):e9819. doi: 10.2196/mental.9819.
5
Revealing real-time emotional responses: a personalized assessment based on heartbeat dynamics.揭示实时情绪反应:基于心跳动态的个性化评估。
Sci Rep. 2014 May 21;4:4998. doi: 10.1038/srep04998.
6
From mobile mental health to mobile wellbeing: opportunities and challenges.从移动心理健康到移动健康:机遇与挑战。
Stud Health Technol Inform. 2013;184:141-7.
7
Rethinking Communication in the E-health Era.重新思考电子健康时代的沟通。
J Health Psychol. 2003 Jan;8(1):7-23. doi: 10.1177/1359105303008001426.
8
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.