• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过语言模型检测基于网络论坛用户的抑郁症实现早期医疗干预:纵向分析与评估

Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation.

作者信息

Owen David, Antypas Dimosthenis, Hassoulas Athanasios, Pardiñas Antonio F, Espinosa-Anke Luis, Collados Jose Camacho

机构信息

School of Computer Science and Informatics, Cardiff University, Cardiff, United Kingdom.

Centre for Medical Education, School of Medicine, Cardiff University, Cardiff, United Kingdom.

出版信息

JMIR AI. 2023 Mar 24;2:e41205. doi: 10.2196/41205.

DOI:10.2196/41205
PMID:37525646
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7614849/
Abstract

BACKGROUND

Major depressive disorder is a common mental disorder affecting 5% of adults worldwide. Early contact with health care services is critical for achieving accurate diagnosis and improving patient outcomes. Key symptoms of major depressive disorder (depression hereafter) such as cognitive distortions are observed in verbal communication, which can also manifest in the structure of written language. Thus, the automatic analysis of text outputs may provide opportunities for early intervention in settings where written communication is rich and regular, such as social media and web-based forums.

OBJECTIVE

The objective of this study was 2-fold. We sought to gauge the effectiveness of different machine learning approaches to identify users of the mass web-based forum Reddit, who eventually disclose a diagnosis of depression. We then aimed to determine whether the time between a forum post and a depression diagnosis date was a relevant factor in performing this detection.

METHODS

A total of 2 Reddit data sets containing posts belonging to users with and without a history of depression diagnosis were obtained. The intersection of these data sets provided users with an estimated date of depression diagnosis. This derived data set was used as an input for several machine learning classifiers, including transformer-based language models (LMs).

RESULTS

Bidirectional Encoder Representations from Transformers (BERT) and MentalBERT transformer-based LMs proved the most effective in distinguishing forum users with a known depression diagnosis from those without. They each obtained a mean -score of 0.64 across the experimental setups used for binary classification. The results also suggested that the final 12 to 16 weeks (about 3-4 months) of posts before a depressed user's estimated diagnosis date are the most indicative of their illness, with data before that period not helping the models detect more accurately. Furthermore, in the 4- to 8-week period before the user's estimated diagnosis date, their posts exhibited more negative sentiment than any other 4-week period in their post history.

CONCLUSIONS

Transformer-based LMs may be used on data from web-based social media forums to identify users at risk for psychiatric conditions such as depression. Language features picked up by these classifiers might predate depression onset by weeks to months, enabling proactive mental health care interventions to support those at risk for this condition.

摘要

背景

重度抑郁症是一种常见的精神障碍,全球5%的成年人受其影响。尽早联系医疗服务对于实现准确诊断和改善患者预后至关重要。重度抑郁症(以下简称抑郁症)的关键症状,如认知扭曲,在言语交流中可见,也可能体现在书面语言结构中。因此,在书面交流丰富且频繁的环境中,如社交媒体和网络论坛,对文本输出进行自动分析可能为早期干预提供机会。

目的

本研究有两个目标。我们试图评估不同机器学习方法识别大规模网络论坛Reddit用户的有效性,这些用户最终被诊断为患有抑郁症。然后,我们旨在确定论坛帖子发布时间与抑郁症诊断日期之间的时间间隔是否是进行这种检测的一个相关因素。

方法

总共获得了2个Reddit数据集,其中包含有抑郁症诊断史和无抑郁症诊断史用户的帖子。这些数据集的交集为用户提供了抑郁症诊断的估计日期。这个派生数据集被用作几个机器学习分类器的输入,包括基于Transformer的语言模型(LMs)。

结果

基于Transformer的双向编码器表征(BERT)和MentalBERT语言模型在区分已知患有抑郁症的论坛用户和未患抑郁症的用户方面最为有效。在用于二元分类的实验设置中,它们各自的平均得分均为0.64。结果还表明,在抑郁症患者估计诊断日期前的最后12至16周(约3 - 4个月)的帖子最能表明其病情,在此之前的数据无助于模型更准确地检测。此外,在用户估计诊断日期前的4至8周内,他们的帖子比其发帖历史中的任何其他4周时间段表现出更多的负面情绪。

结论

基于Transformer的语言模型可用于基于网络的社交媒体论坛数据,以识别有患抑郁症等精神疾病风险的用户。这些分类器提取的语言特征可能在抑郁症发作前数周甚至数月出现,从而能够进行积极的精神卫生保健干预,以支持有患此病风险的人群。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/a54ca179fb44/ai_v2i1e41205_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/62ea4ec79510/ai_v2i1e41205_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/6afb4259a5df/ai_v2i1e41205_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/728c5057a580/ai_v2i1e41205_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/a54ca179fb44/ai_v2i1e41205_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/62ea4ec79510/ai_v2i1e41205_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/6afb4259a5df/ai_v2i1e41205_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/728c5057a580/ai_v2i1e41205_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c413/11334377/a54ca179fb44/ai_v2i1e41205_fig4.jpg

相似文献

1
Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation.通过语言模型检测基于网络论坛用户的抑郁症实现早期医疗干预:纵向分析与评估
JMIR AI. 2023 Mar 24;2:e41205. doi: 10.2196/41205.
2
Understanding Mental Health Issues in Different Subdomains of Social Networking Services: Computational Analysis of Text-Based Reddit Posts.理解不同社交网络服务领域的心理健康问题:基于文本的 Reddit 帖子的计算分析。
J Med Internet Res. 2023 Nov 30;25:e49074. doi: 10.2196/49074.
3
User Dynamics and Thematic Exploration in r/Depression During the COVID-19 Pandemic: Insights From Overlapping r/SuicideWatch Users.新冠疫情期间 r/Depression 中的用户动态和主题探索:来自重叠 r/SuicideWatch 用户的洞察。
J Med Internet Res. 2024 May 20;26:e53968. doi: 10.2196/53968.
4
Momentary Depressive Feeling Detection Using X (Formerly Twitter) Data: Contextual Language Approach.使用X(原推特)数据检测瞬间抑郁情绪:上下文语言方法。
JMIR AI. 2023 Nov 27;2:e49531. doi: 10.2196/49531.
5
Understanding user communication around loneliness on online forums.理解用户在在线论坛上围绕孤独感进行的交流。
PLoS One. 2021 Sep 23;16(9):e0257791. doi: 10.1371/journal.pone.0257791. eCollection 2021.
6
Emotional Expression on Social Media Support Forums for Substance Cessation: Observational Study of Text-Based Reddit Posts.社交媒体戒瘾支持论坛上的情绪表达:基于文本的 Reddit 帖子的观察性研究。
J Med Internet Res. 2023 Jul 19;25:e45267. doi: 10.2196/45267.
7
Determination of Patient Sentiment and Emotion in Ophthalmology: Infoveillance Tutorial on Web-Based Health Forum Discussions.眼科患者情绪的测定:基于网络健康论坛讨论的 Infoveillance 教程。
J Med Internet Res. 2021 May 17;23(5):e20803. doi: 10.2196/20803.
8
Using Social Media to Help Understand Patient-Reported Health Outcomes of Post-COVID-19 Condition: Natural Language Processing Approach.利用社交媒体帮助了解新冠后症状患者报告的健康结果:自然语言处理方法。
J Med Internet Res. 2023 Sep 19;25:e45767. doi: 10.2196/45767.
9
Detection of Suicidality Among Opioid Users on Reddit: Machine Learning-Based Approach.在 Reddit 上检测阿片类药物使用者的自杀倾向:基于机器学习的方法。
J Med Internet Res. 2020 Nov 27;22(11):e15293. doi: 10.2196/15293.
10
Natural Language Processing for Depression Prediction on Sina Weibo: Method Study and Analysis.基于新浪微博的抑郁预测的自然语言处理方法研究与分析
JMIR Ment Health. 2024 Sep 4;11:e58259. doi: 10.2196/58259.

引用本文的文献

1
The Applications of Large Language Models in Mental Health: Scoping Review.大语言模型在心理健康领域的应用:范围综述
J Med Internet Res. 2025 May 5;27:e69284. doi: 10.2196/69284.
2
Profiling Generalized Anxiety Disorder on Social Networks: Content and Behavior Analysis.社交网络上广泛性焦虑症的剖析:内容与行为分析
J Med Internet Res. 2025 Mar 20;27:e53399. doi: 10.2196/53399.
3
Development and Feasibility Study of HOPE Model for Prediction of Depression Among Older Adults Using Wi-Fi-based Motion Sensor Data: Machine Learning Study.

本文引用的文献

1
Natural language processing applied to mental illness detection: a narrative review.应用于精神疾病检测的自然语言处理:一篇叙述性综述。
NPJ Digit Med. 2022 Apr 8;5(1):46. doi: 10.1038/s41746-022-00589-7.
2
The relationship between text message sentiment and self-reported depression.短信情绪与自我报告抑郁的关系。
J Affect Disord. 2022 Apr 1;302:7-14. doi: 10.1016/j.jad.2021.12.048. Epub 2021 Dec 25.
3
Depression Detection on Reddit With an Emotion-Based Attention Network: Algorithm Development and Validation.基于情感注意力网络的Reddit上的抑郁症检测:算法开发与验证
基于Wi-Fi的运动传感器数据预测老年人抑郁症的HOPE模型的开发与可行性研究:机器学习研究
JMIR Aging. 2025 Mar 3;8:e67715. doi: 10.2196/67715.
4
Toward molecular diagnosis of major depressive disorder by plasma peptides using a deep learning approach.采用深度学习方法通过血浆肽实现对重度抑郁症的分子诊断。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae554.
5
AI for Analyzing Mental Health Disorders Among Social Media Users: Quarter-Century Narrative Review of Progress and Challenges.人工智能分析社交媒体用户心理健康障碍:进展和挑战的四分之一世纪叙事综述。
J Med Internet Res. 2024 Nov 15;26:e59225. doi: 10.2196/59225.
6
Identifying Marijuana Use Behaviors Among Youth Experiencing Homelessness Using a Machine Learning-Based Framework: Development and Evaluation Study.使用基于机器学习的框架识别无家可归青少年中的大麻使用行为:开发与评估研究
JMIR AI. 2024 Oct 17;3:e53488. doi: 10.2196/53488.
JMIR Med Inform. 2021 Jul 16;9(7):e28754. doi: 10.2196/28754.
4
An Effective BERT-Based Pipeline for Twitter Sentiment Analysis: A Case Study in Italian.基于 BERT 的 Twitter 情感分析有效流水线:意大利语案例研究。
Sensors (Basel). 2020 Dec 28;21(1):133. doi: 10.3390/s21010133.
5
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
6
Facebook language predicts depression in medical records.脸书的语言可预测病历中的抑郁症状。
Proc Natl Acad Sci U S A. 2018 Oct 30;115(44):11203-11208. doi: 10.1073/pnas.1802331115. Epub 2018 Oct 15.
7
Forecasting the onset and course of mental illness with Twitter data.利用 Twitter 数据预测精神疾病的发病和病程。
Sci Rep. 2017 Oct 11;7(1):13006. doi: 10.1038/s41598-017-12961-9.
8
A randomised controlled trial of the effectiveness of a program for early detection and treatment of depression in primary care.一项关于初级保健中抑郁早期发现和治疗计划有效性的随机对照试验。
J Affect Disord. 2016 Jul 1;198:96-101. doi: 10.1016/j.jad.2016.03.025. Epub 2016 Mar 15.
9
Imbalanced Learning Based on Logistic Discrimination.基于逻辑判别法的不均衡学习
Comput Intell Neurosci. 2016;2016:5423204. doi: 10.1155/2016/5423204. Epub 2016 Jan 4.
10
Critical slowing down as early warning for the onset and termination of depression.临界减速可作为抑郁发作和终止的早期预警。
Proc Natl Acad Sci U S A. 2014 Jan 7;111(1):87-92. doi: 10.1073/pnas.1312114110. Epub 2013 Dec 9.