• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估中国社交媒体中的自杀风险和情绪困扰:一项文本挖掘与机器学习研究。

Assessing Suicide Risk and Emotional Distress in Chinese Social Media: A Text Mining and Machine Learning Study.

作者信息

Cheng Qijin, Li Tim Mh, Kwok Chi-Leung, Zhu Tingshao, Yip Paul Sf

机构信息

HKJC Center for Suicide Research and Prevention, The University of Hong Kong, Hong Kong, China (Hong Kong).

Department of Paediatrics & Adolescent Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China (Hong Kong).

出版信息

J Med Internet Res. 2017 Jul 10;19(7):e243. doi: 10.2196/jmir.7276.

DOI:10.2196/jmir.7276
PMID:28694239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5525005/
Abstract

BACKGROUND

Early identification and intervention are imperative for suicide prevention. However, at-risk people often neither seek help nor take professional assessment. A tool to automatically assess their risk levels in natural settings can increase the opportunity for early intervention.

OBJECTIVE

The aim of this study was to explore whether computerized language analysis methods can be utilized to assess one's suicide risk and emotional distress in Chinese social media.

METHODS

A Web-based survey of Chinese social media (ie, Weibo) users was conducted to measure their suicide risk factors including suicide probability, Weibo suicide communication (WSC), depression, anxiety, and stress levels. Participants' Weibo posts published in the public domain were also downloaded with their consent. The Weibo posts were parsed and fitted into Simplified Chinese-Linguistic Inquiry and Word Count (SC-LIWC) categories. The associations between SC-LIWC features and the 5 suicide risk factors were examined by logistic regression. Furthermore, the support vector machine (SVM) model was applied based on the language features to automatically classify whether a Weibo user exhibited any of the 5 risk factors.

RESULTS

A total of 974 Weibo users participated in the survey. Those with high suicide probability were marked by a higher usage of pronoun (odds ratio, OR=1.18, P=.001), prepend words (OR=1.49, P=.02), multifunction words (OR=1.12, P=.04), a lower usage of verb (OR=0.78, P<.001), and a greater total word count (OR=1.007, P=.008). Second-person plural was positively associated with severe depression (OR=8.36, P=.01) and stress (OR=11, P=.005), whereas work-related words were negatively associated with WSC (OR=0.71, P=.008), severe depression (OR=0.56, P=.005), and anxiety (OR=0.77, P=.02). Inconsistently, third-person plural was found to be negatively associated with WSC (OR=0.02, P=.047) but positively with severe stress (OR=41.3, P=.04). Achievement-related words were positively associated with depression (OR=1.68, P=.003), whereas health- (OR=2.36, P=.004) and death-related (OR=2.60, P=.01) words positively associated with stress. The machine classifiers did not achieve satisfying performance in the full sample set but could classify high suicide probability (area under the curve, AUC=0.61, P=.04) and severe anxiety (AUC=0.75, P<.001) among those who have exhibited WSC.

CONCLUSIONS

SC-LIWC is useful to examine language markers of suicide risk and emotional distress in Chinese social media and can identify characteristics different from previous findings in the English literature. Some findings are leading to new hypotheses for future verification. Machine classifiers based on SC-LIWC features are promising but still require further optimization for application in real life.

摘要

背景

早期识别和干预对于预防自杀至关重要。然而,有自杀风险的人往往既不寻求帮助,也不接受专业评估。一种能在自然环境中自动评估其风险水平的工具可以增加早期干预的机会。

目的

本研究旨在探讨计算机化语言分析方法是否可用于评估中国社交媒体用户的自杀风险和情绪困扰。

方法

对中国社交媒体(即微博)用户进行了一项基于网络的调查,以测量他们的自杀风险因素,包括自杀可能性、微博自杀交流(WSC)、抑郁、焦虑和压力水平。在参与者同意的情况下,还下载了他们在公共领域发布的微博帖子。对微博帖子进行解析,并将其归入简体中文语言查询与字数统计(SC-LIWC)类别。通过逻辑回归分析SC-LIWC特征与5种自杀风险因素之间的关联。此外,基于语言特征应用支持向量机(SVM)模型自动分类微博用户是否表现出5种风险因素中的任何一种。

结果

共有974名微博用户参与了调查。自杀可能性高的用户具有较高的代词使用率(优势比,OR=1.18,P=0.001)、前置词使用率(OR=1.49,P=0.02)、多功能词使用率(OR=1.12,P=0.04)、较低的动词使用率(OR=0.78,P<0.001)以及较多的总字数(OR=1.007,P=0.008)。第二人称复数与严重抑郁(OR=8.36,P=0.01)和压力(OR=11,P=0.005)呈正相关,而与工作相关的词汇与WSC(OR=0.71,P=0.008)、严重抑郁(OR=0.56,P=0.005)和焦虑(OR=0.77,P=0.02)呈负相关。不一致的是,第三人称复数与WSC呈负相关(OR=0.02,P=0.047),但与严重压力呈正相关(OR=41.3,P=0.04)。与成就相关的词汇与抑郁呈正相关(OR=1.68,P=0.003),而与健康相关(OR=2.36,P=0.004)和与死亡相关(OR=2.60,P=0.01)的词汇与压力呈正相关。机器分类器在全样本集中未取得令人满意的性能,但在表现出WSC的人群中能够对高自杀可能性(曲线下面积,AUC=0.61,P=0.04)和严重焦虑(AUC=0.75,P<0.001)进行分类。

结论

SC-LIWC有助于检测中国社交媒体中自杀风险和情绪困扰的语言标记,并能识别出与英文文献中先前发现不同的特征。一些发现为未来的验证提出了新的假设。基于SC-LIWC特征的机器分类器前景广阔,但仍需进一步优化以应用于现实生活。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f5f/5525005/430234f2fd75/jmir_v19i7e243_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f5f/5525005/430234f2fd75/jmir_v19i7e243_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f5f/5525005/430234f2fd75/jmir_v19i7e243_fig1.jpg

相似文献

1
Assessing Suicide Risk and Emotional Distress in Chinese Social Media: A Text Mining and Machine Learning Study.评估中国社交媒体中的自杀风险和情绪困扰:一项文本挖掘与机器学习研究。
J Med Internet Res. 2017 Jul 10;19(7):e243. doi: 10.2196/jmir.7276.
2
Proactive Suicide Prevention Online (PSPO): Machine Identification and Crisis Management for Chinese Social Media Users With Suicidal Thoughts and Behaviors.在线主动预防自杀(PSPO):针对有自杀想法和行为的中国社交媒体用户的机器识别与危机管理
J Med Internet Res. 2019 May 8;21(5):e11705. doi: 10.2196/11705.
3
Data Mining and Content Analysis of the Chinese Social Media Platform Weibo During the Early COVID-19 Outbreak: Retrospective Observational Infoveillance Study.新冠疫情早期的中文社交媒体平台微博数据挖掘和内容分析:回顾性观察性信息监测研究。
JMIR Public Health Surveill. 2020 Apr 21;6(2):e18700. doi: 10.2196/18700.
4
Suicide Communication on Social Media and Its Psychological Mechanisms: An Examination of Chinese Microblog Users.社交媒体上的自杀相关交流及其心理机制:对中国微博用户的考察
Int J Environ Res Public Health. 2015 Sep 11;12(9):11506-27. doi: 10.3390/ijerph120911506.
5
Linguistic Analysis for Identifying Depression and Subsequent Suicidal Ideation on Weibo: Machine Learning Approaches.基于微博的语言分析识别抑郁及随后的自杀意念:机器学习方法。
Int J Environ Res Public Health. 2023 Feb 2;20(3):2688. doi: 10.3390/ijerph20032688.
6
Detecting depression of Chinese microblog users text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons.检测中国微博用户的抑郁情绪:文本分析——结合语言调查词频统计(LIWC)与文化及自杀相关词汇
Front Psychiatry. 2023 Feb 9;14:1121583. doi: 10.3389/fpsyt.2023.1121583. eCollection 2023.
7
Automatic anxiety recognition method based on microblog text analysis.基于微博文本分析的自动焦虑识别方法。
Front Public Health. 2023 Mar 20;11:1080013. doi: 10.3389/fpubh.2023.1080013. eCollection 2023.
8
Public Attitudes Toward Anxiety Disorder on Sina Weibo: Content Analysis.公众对新浪微博上焦虑障碍的态度:内容分析。
J Med Internet Res. 2023 Apr 4;25:e45777. doi: 10.2196/45777.
9
Public Discourse and Sentiment Toward Dementia on Chinese Social Media: Machine Learning Analysis of Weibo Posts.中文社交媒体中公众对痴呆症的话语和情绪:微博帖子的机器学习分析。
J Med Internet Res. 2022 Sep 2;24(9):e39805. doi: 10.2196/39805.
10
Natural Language Processing for Depression Prediction on Sina Weibo: Method Study and Analysis.基于新浪微博的抑郁预测的自然语言处理方法研究与分析
JMIR Ment Health. 2024 Sep 4;11:e58259. doi: 10.2196/58259.

引用本文的文献

1
Large-Scale Deep Learning-Enabled Infodemiological Analysis of Substance Use Patterns on Social Media: Insights From the COVID-19 Pandemic.基于大规模深度学习的社交媒体上物质使用模式的信息流行病学分析:来自新冠疫情的见解
JMIR Infodemiology. 2025 Apr 17;5:e59076. doi: 10.2196/59076.
2
Text-Based Depression Prediction on Social Media Using Machine Learning: Systematic Review and Meta-Analysis.基于文本的社交媒体抑郁症预测:机器学习的系统评价与荟萃分析
J Med Internet Res. 2025 Apr 11;27:e59002. doi: 10.2196/59002.
3
Explainable AI-based suicidal and non-suicidal ideations detection from social media text with enhanced ensemble technique.

本文引用的文献

1
Detecting Changes in Suicide Content Manifested in Social Media Following Celebrity Suicides.名人自杀后社交媒体上自杀内容的变化检测
HT ACM Conf Hypertext Soc Media. 2015 Sep;2015:85-94. doi: 10.1145/2700171.2791026.
2
A Linguistic Analysis of Suicide-Related Twitter Posts.自杀相关推特帖的语言分析。
Crisis. 2017 Sep;38(5):319-329. doi: 10.1027/0227-5910/a000443. Epub 2017 Feb 23.
3
Dying online: live broadcasts of Chinese emerging adult suicides and crisis response behaviors.线上死亡:中国新兴成年人自杀的直播及危机应对行为
基于可解释人工智能的社交媒体文本自杀和非自杀意念检测及增强集成技术
Sci Rep. 2025 Jan 7;15(1):1111. doi: 10.1038/s41598-024-84275-6.
4
Automatic screening for posttraumatic stress disorder in early adolescents following the Ya'an earthquake using text mining techniques.运用文本挖掘技术对雅安地震后青少年早期创伤后应激障碍进行自动筛查。
Front Psychiatry. 2024 Dec 11;15:1439720. doi: 10.3389/fpsyt.2024.1439720. eCollection 2024.
5
Prediction techniques of movie box office using neural networks and emotional mining.基于神经网络与情感挖掘的电影票房预测技术
Sci Rep. 2024 Sep 11;14(1):21209. doi: 10.1038/s41598-024-72340-z.
6
Identifying Reddit Users at a High Risk of Suicide and Their Linguistic Features During the COVID-19 Pandemic: Growth-Based Trajectory Model.识别新冠疫情期间有高自杀风险的Reddit用户及其语言特征:基于增长的轨迹模型
J Med Internet Res. 2024 Aug 8;26:e48907. doi: 10.2196/48907.
7
Social Service Workers' Use of Social Media to Obtain Client Information: Current Practices and Perspectives on a Potential Informatics Platform.社会服务工作者利用社交媒体获取客户信息:当前实践及对潜在信息平台的看法。
J Soc Serv Res. 2022;48(6):739-752. doi: 10.1080/01488376.2022.2148037. Epub 2022 Nov 23.
8
Detection of Suicidal Ideation in Clinical Interviews for Depression Using Natural Language Processing and Machine Learning: Cross-Sectional Study.使用自然语言处理和机器学习在抑郁症临床访谈中检测自杀意念:横断面研究
JMIR Med Inform. 2023 Dec 1;11:e50221. doi: 10.2196/50221.
9
Digital Phenotyping: Data-Driven Psychiatry to Redefine Mental Health.数字表型:以数据为驱动的精神病学重新定义精神健康。
J Med Internet Res. 2023 Oct 4;25:e44502. doi: 10.2196/44502.
10
Assessing Vulnerability to Surges in Suicide-Related Tweets Using Japan Census Data: Case-Only Study.利用日本人口普查数据评估自杀相关推文激增的易感性:仅病例研究
JMIR Form Res. 2023 Aug 10;7:e47798. doi: 10.2196/47798.
BMC Public Health. 2016 Aug 11;16(1):774. doi: 10.1186/s12889-016-3415-0.
4
Validating Machine Learning Algorithms for Twitter Data Against Established Measures of Suicidality.验证针对自杀倾向的已确立衡量标准的 Twitter 数据的机器学习算法。
JMIR Ment Health. 2016 May 16;3(2):e21. doi: 10.2196/mental.4822.
5
Is suicide assessment harmful to participants? Findings from a randomized controlled trial.自杀评估对参与者有害吗?一项随机对照试验的结果。
Int J Ment Health Nurs. 2017 Apr;26(2):181-190. doi: 10.1111/inm.12223. Epub 2016 Apr 16.
6
Creating a Chinese suicide dictionary for identifying suicide risk on social media.创建一个用于识别社交媒体上自杀风险的中文自杀词典。
PeerJ. 2015 Dec 15;3:e1455. doi: 10.7717/peerj.1455. eCollection 2015.
7
Identifying Chinese Microblog Users With High Suicide Probability Using Internet-Based Profile and Linguistic Features: Classification Model.基于互联网个人资料和语言特征识别有高自杀倾向的中国微博用户:分类模型。
JMIR Ment Health. 2015 May 12;2(2):e17. doi: 10.2196/mental.4227. eCollection 2015 Apr-Jun.
8
Suicide Communication on Social Media and Its Psychological Mechanisms: An Examination of Chinese Microblog Users.社交媒体上的自杀相关交流及其心理机制:对中国微博用户的考察
Int J Environ Res Public Health. 2015 Sep 11;12(9):11506-27. doi: 10.3390/ijerph120911506.
9
The ABC's of Suicide Risk Assessment: Applying a Tripartite Approach to Individual Evaluations.自杀风险评估基础:将三方评估方法应用于个体评估
PLoS One. 2015 Jun 1;10(6):e0127442. doi: 10.1371/journal.pone.0127442. eCollection 2015.
10
Suicide-related Internet use: A review.与自杀相关的互联网使用:综述
Aust N Z J Psychiatry. 2015 Aug;49(8):697-705. doi: 10.1177/0004867415569797. Epub 2015 Feb 19.