• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于微博的抑郁症领域词汇自动构建:文本挖掘研究

Automatic Construction of a Depression-Domain Lexicon Based on Microblogs: Text Mining Study.

作者信息

Li Genghao, Li Bing, Huang Langlin, Hou Sibing

机构信息

School of Information Technology & Management, University of International Business and Economics, Beijing, China.

Graduate School of Art and Science, Columbia University, New York, NY, United States.

出版信息

JMIR Med Inform. 2020 Jun 23;8(6):e17650. doi: 10.2196/17650.

DOI:10.2196/17650
PMID:32574151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7381008/
Abstract

BACKGROUND

According to a World Health Organization report in 2017, there was almost one patient with depression among every 20 people in China. However, the diagnosis of depression is usually difficult in terms of clinical detection owing to slow observation, high cost, and patient resistance. Meanwhile, with the rapid emergence of social networking sites, people tend to share their daily life and disclose inner feelings online frequently, making it possible to effectively identify mental conditions using the rich text information. There are many achievements regarding an English web-based corpus, but for research in China so far, the extraction of language features from web-related depression signals is still in a relatively primary stage.

OBJECTIVE

The purpose of this study was to propose an effective approach for constructing a depression-domain lexicon. This lexicon will contain language features that could help identify social media users who potentially have depression. Our study also compared the performance of detection with and without our lexicon.

METHODS

We autoconstructed a depression-domain lexicon using Word2Vec, a semantic relationship graph, and the label propagation algorithm. These two methods combined performed well in a specific corpus during construction. The lexicon was obtained based on 111,052 Weibo microblogs from 1868 users who were depressed or nondepressed. During depression detection, we considered six features, and we used five classification methods to test the detection performance.

RESULTS

The experiment results showed that in terms of the F1 value, our autoconstruction method performed 1% to 6% better than baseline approaches and was more effective and steadier. When applied to detection models like logistic regression and support vector machine, our lexicon helped the models outperform by 2% to 9% and was able to improve the final accuracy of potential depression detection.

CONCLUSIONS

Our depression-domain lexicon was proven to be a meaningful input for classification algorithms, providing linguistic insights on the depressive status of test subjects. We believe that this lexicon will enhance early depression detection in people on social media. Future work will need to be carried out on a larger corpus and with more complex methods.

摘要

背景

根据世界卫生组织2017年的一份报告,在中国,每20人中就有近1人患有抑郁症。然而,由于观察缓慢、成本高昂以及患者抵触等原因,抑郁症的临床检测诊断通常很困难。与此同时,随着社交网站的迅速兴起,人们倾向于在网上频繁分享日常生活并披露内心感受,这使得利用丰富的文本信息有效识别心理状况成为可能。关于基于英语网络语料库已有许多成果,但就目前中国的研究而言,从网络相关抑郁信号中提取语言特征仍处于相对初级阶段。

目的

本研究的目的是提出一种构建抑郁领域词汇表的有效方法。该词汇表将包含有助于识别可能患有抑郁症的社交媒体用户的语言特征。我们的研究还比较了使用和不使用我们的词汇表进行检测的性能。

方法

我们使用Word2Vec、语义关系图和标签传播算法自动构建了一个抑郁领域词汇表。在构建过程中,这两种方法相结合在特定语料库中表现良好。该词汇表是基于来自1868名抑郁或非抑郁用户的111,052条微博构建的。在抑郁检测过程中,我们考虑了六个特征,并使用五种分类方法来测试检测性能。

结果

实验结果表明,在F1值方面,我们的自动构建方法比基线方法表现好1%至6%,更有效且更稳定。当应用于逻辑回归和支持向量机等检测模型时,我们的词汇表帮助模型性能提升了2%至9%,并能够提高潜在抑郁检测的最终准确率。

结论

我们的抑郁领域词汇表被证明是分类算法的有意义输入,为测试对象的抑郁状态提供了语言洞察。我们相信这个词汇表将加强对社交媒体上人群的早期抑郁检测。未来的工作需要在更大的语料库上并使用更复杂的方法进行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/47a3f91f5ff2/medinform_v8i6e17650_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/f05a27eef34e/medinform_v8i6e17650_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/5cd16c37460b/medinform_v8i6e17650_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/f1ea4fe19be2/medinform_v8i6e17650_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/256efcfe8ae3/medinform_v8i6e17650_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/47a3f91f5ff2/medinform_v8i6e17650_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/f05a27eef34e/medinform_v8i6e17650_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/5cd16c37460b/medinform_v8i6e17650_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/f1ea4fe19be2/medinform_v8i6e17650_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/256efcfe8ae3/medinform_v8i6e17650_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c38/7381008/47a3f91f5ff2/medinform_v8i6e17650_fig5.jpg

相似文献

1
Automatic Construction of a Depression-Domain Lexicon Based on Microblogs: Text Mining Study.基于微博的抑郁症领域词汇自动构建:文本挖掘研究
JMIR Med Inform. 2020 Jun 23;8(6):e17650. doi: 10.2196/17650.
2
Momentary Depressive Feeling Detection Using X (Formerly Twitter) Data: Contextual Language Approach.使用X(原推特)数据检测瞬间抑郁情绪:上下文语言方法。
JMIR AI. 2023 Nov 27;2:e49531. doi: 10.2196/49531.
3
Psychologist in a Pocket: Lexicon Development and Content Validation of a Mobile-Based App for Depression Screening.口袋里的心理学家:一款基于移动设备的抑郁筛查应用的词典开发和内容验证。
JMIR Mhealth Uhealth. 2016 Jul 20;4(3):e88. doi: 10.2196/mhealth.5284.
4
Depression Risk Prediction for Chinese Microblogs via Deep-Learning Methods: Content Analysis.基于深度学习方法的中文微博抑郁风险预测:内容分析
JMIR Med Inform. 2020 Jul 29;8(7):e17958. doi: 10.2196/17958.
5
Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation.通过语言模型检测基于网络论坛用户的抑郁症实现早期医疗干预:纵向分析与评估
JMIR AI. 2023 Mar 24;2:e41205. doi: 10.2196/41205.
6
Automatic Construction and Global Optimization of a Multisentiment Lexicon.多情感词典的自动构建与全局优化
Comput Intell Neurosci. 2016;2016:2093406. doi: 10.1155/2016/2093406. Epub 2016 Nov 29.
7
Identification and Classification of Depressed Mental State for End-User over Social Media.面向社交媒体终端用户的抑郁心理状态识别与分类
Comput Intell Neurosci. 2022 Apr 21;2022:8755922. doi: 10.1155/2022/8755922. eCollection 2022.
8
Portable automatic text classification for adverse drug reaction detection via multi-corpus training.通过多语料库训练实现用于药物不良反应检测的便携式自动文本分类
J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8.
9
Automatic negation detection in narrative pathology reports.自动否定词检测在叙事病理学报告中的应用。
Artif Intell Med. 2015 May;64(1):41-50. doi: 10.1016/j.artmed.2015.03.001. Epub 2015 Mar 24.
10
Detecting depression tendency with multimodal features.利用多模态特征检测抑郁倾向。
Comput Methods Programs Biomed. 2023 Oct;240:107702. doi: 10.1016/j.cmpb.2023.107702. Epub 2023 Jul 6.

引用本文的文献

1
Detecting and tracking depression through temporal topic modeling of tweets: insights from a 180-day study.通过推文的时间主题建模检测和追踪抑郁症:一项为期180天研究的见解
Npj Ment Health Res. 2024 Dec 6;3(1):62. doi: 10.1038/s44184-024-00107-5.
2
Pediatric Cancer Communication on Twitter: Natural Language Processing and Qualitative Content Analysis.推特上的儿科癌症交流:自然语言处理与定性内容分析
JMIR Cancer. 2024 May 7;10:e52061. doi: 10.2196/52061.
3
Machine Learning for Multimodal Mental Health Detection: A Systematic Review of Passive Sensing Approaches.

本文引用的文献

1
Reviewing the data security and privacy policies of mobile apps for depression.审视抑郁症移动应用程序的数据安全与隐私政策。
Internet Interv. 2018 Dec 20;15:110-115. doi: 10.1016/j.invent.2018.12.001. eCollection 2019 Mar.
2
Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora.从未标注语料库中诱导特定领域情感词典。
Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:595-605. doi: 10.18653/v1/D16-1057.
3
The increasing burden of depression.抑郁症负担日益加重。
机器学习在多模态心理健康检测中的应用:被动感知方法的系统评价。
Sensors (Basel). 2024 Jan 6;24(2):348. doi: 10.3390/s24020348.
4
New chaos-integrated improved grey wolf optimization based models for automatic detection of depression in online social media and networks.基于新的混沌集成改进灰狼优化算法的模型,用于在线社交媒体和网络中抑郁症的自动检测。
PeerJ Comput Sci. 2023 Nov 8;9:e1661. doi: 10.7717/peerj-cs.1661. eCollection 2023.
5
Construction of an Emotional Lexicon of Patients With Breast Cancer: Development and Sentiment Analysis.构建乳腺癌患者情绪词汇库:编制与情感分析
J Med Internet Res. 2023 Sep 12;25:e44897. doi: 10.2196/44897.
6
Development of a prediction model for the depression level of the elderly in low-income households: using decision trees, logistic regression, neural networks, and random forest.开发一个预测模型,用于预测低收入家庭中老年人的抑郁水平:使用决策树、逻辑回归、神经网络和随机森林。
Sci Rep. 2023 Jul 16;13(1):11473. doi: 10.1038/s41598-023-38742-1.
7
Health Communication through Positive and Solidarity Messages Amid the COVID-19 Pandemic: Automated Content Analysis of Facebook Uses.在 COVID-19 大流行期间通过积极和团结的信息进行健康传播:对 Facebook 使用情况的自动化内容分析。
Int J Environ Res Public Health. 2022 May 19;19(10):6159. doi: 10.3390/ijerph19106159.
8
Psychological Analysis for Depression Detection from Social Networking Sites.基于社交网站的抑郁症检测心理分析
Comput Intell Neurosci. 2022 Apr 6;2022:4395358. doi: 10.1155/2022/4395358. eCollection 2022.
9
Developmental Trend of Subjective Well-Being of Weibo Users During COVID-19: Online Text Analysis Based on Machine Learning Method.新冠疫情期间微博用户主观幸福感的发展趋势:基于机器学习方法的网络文本分析
Front Psychol. 2022 Jan 6;12:779594. doi: 10.3389/fpsyg.2021.779594. eCollection 2021.
Neuropsychiatr Dis Treat. 2011;7(Suppl 1):3-7. doi: 10.2147/NDT.S19617. Epub 2011 May 31.
4
Recognition of depression by non-psychiatric physicians--a systematic literature review and meta-analysis.非精神科医生对抑郁症的识别——一项系统的文献综述和荟萃分析。
J Gen Intern Med. 2008 Jan;23(1):25-36. doi: 10.1007/s11606-007-0428-5. Epub 2007 Oct 26.
5
A SELF-RATING DEPRESSION SCALE.一份自评抑郁量表。
Arch Gen Psychiatry. 1965 Jan;12:63-70. doi: 10.1001/archpsyc.1965.01720310065008.
6
An inventory for measuring depression.一份用于测量抑郁的量表。
Arch Gen Psychiatry. 1961 Jun;4:561-71. doi: 10.1001/archpsyc.1961.01710120031004.
7
Development of a rating scale for primary depressive illness.原发性抑郁症评定量表的编制。
Br J Soc Clin Psychol. 1967 Dec;6(4):278-96. doi: 10.1111/j.2044-8260.1967.tb00530.x.