• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用短信自动构建词汇表以改善抑郁症筛查

Automated Construction of Lexicons to Improve Depression Screening With Text Messages.

出版信息

IEEE J Biomed Health Inform. 2023 Jun;27(6):2751-2759. doi: 10.1109/JBHI.2022.3203345. Epub 2023 Jun 6.

DOI:10.1109/JBHI.2022.3203345
PMID:36044503
Abstract

Given that depression is one of the most prevalent mental illnesses, developing effective and unobtrusive diagnosis tools is of great importance. Recent work that screens for depression with text messages leverage models relying on lexical category features. Given the colloquial nature of text messages, the performance of these models may be limited by formal lexicons. We thus propose a strategy to automatically construct alternative lexicons that contain more relevant and colloquial terms. Specifically, we generate 36 lexicons from fiction, forum, and news corpuses. These lexicons are then used to extract lexical category features from the text messages. We utilize machine learning models to compare the depression screening capabilities of these lexical category features. Out of our 36 constructed lexicons, 14 achieved statistically significantly higher average F1 scores over the pre-existing formal lexicon and basic bag-of-words approach. In comparison to the pre-existing lexicon, our best performing lexicon increased the average F1 scores by 10%. We thus confirm our hypothesis that less formal lexicons can improve the performance of classification models that screen for depression with text messages. By providing our automatically constructed lexicons, we aid future machine learning research that leverages less formal text.

摘要

鉴于抑郁症是最常见的精神疾病之一,开发有效且不引人注目的诊断工具非常重要。最近使用短信筛查抑郁症的工作依赖于基于词汇类别特征的模型。考虑到短信的口语化性质,这些模型的性能可能受到正式词汇的限制。因此,我们提出了一种策略,自动构建包含更相关和口语化术语的替代词汇。具体来说,我们从小说、论坛和新闻语料库中生成了 36 个词汇。然后,我们使用这些词汇从短信中提取词汇类别特征。我们利用机器学习模型来比较这些词汇类别特征对抑郁症筛查的能力。在我们构建的 36 个词汇中,有 14 个在平均 F1 分数上明显高于现有正式词汇和基本词袋方法。与现有词汇相比,我们表现最好的词汇将平均 F1 分数提高了 10%。因此,我们证实了我们的假设,即不那么正式的词汇可以提高使用短信筛查抑郁症的分类模型的性能。通过提供我们自动构建的词汇,我们为利用不那么正式的文本的未来机器学习研究提供了帮助。

相似文献

1
Automated Construction of Lexicons to Improve Depression Screening With Text Messages.利用短信自动构建词汇表以改善抑郁症筛查
IEEE J Biomed Health Inform. 2023 Jun;27(6):2751-2759. doi: 10.1109/JBHI.2022.3203345. Epub 2023 Jun 6.
2
Screening For Depression With Retrospectively Harvested Private Versus Public Text.使用回溯采集的私人文本与公开文本筛查抑郁
IEEE J Biomed Health Inform. 2020 Nov;24(11):3326-3332. doi: 10.1109/JBHI.2020.2983035. Epub 2020 Nov 4.
3
Text Message Analysis Using Machine Learning to Assess Predictors of Engagement With Mobile Health Chronic Disease Prevention Programs: Content Analysis.基于机器学习的短信分析用于评估移动健康慢性病预防计划参与度的预测因素:内容分析。
JMIR Mhealth Uhealth. 2021 Nov 10;9(11):e27779. doi: 10.2196/27779.
4
A comparison of rule-based and machine learning approaches for classifying patient portal messages.基于规则和机器学习方法在患者门户消息分类中的比较。
Int J Med Inform. 2017 Sep;105:110-120. doi: 10.1016/j.ijmedinf.2017.06.004. Epub 2017 Jun 23.
5
Identifying Medication-Related Intents From a Bidirectional Text Messaging Platform for Hypertension Management Using an Unsupervised Learning Approach: Retrospective Observational Pilot Study.利用无监督学习方法从双向短信平台识别高血压管理相关药物意向:回顾性观察性试点研究。
J Med Internet Res. 2022 Jun 29;24(6):e36151. doi: 10.2196/36151.
6
Extracting semantic lexicons from discharge summaries using machine learning and the C-Value method.使用机器学习和C值方法从出院小结中提取语义词典。
AMIA Annu Symp Proc. 2012;2012:409-16. Epub 2012 Nov 3.
7
Using empirically constructed lexical resources for named entity recognition.使用经验构建的词汇资源进行命名实体识别。
Biomed Inform Insights. 2013 Jun 24;6(Suppl 1):17-27. doi: 10.4137/BII.S11664. Print 2013.
8
Automated Detection of Cognitive Distortions in Text Exchanges Between Clinicians and People With Serious Mental Illness.临床医生与重度精神疾病患者文本交流中认知扭曲的自动检测
Psychiatr Serv. 2023 Apr 1;74(4):407-410. doi: 10.1176/appi.ps.202100692. Epub 2022 Sep 27.
9
Text Messages for Depression, Anxiety and Alcohol Abuse Therapy-Are Construction Guidelines Needed?短信用于抑郁、焦虑和酒精滥用治疗——需要构建指南吗?
Int J Environ Res Public Health. 2022 Nov 25;19(23):15701. doi: 10.3390/ijerph192315701.
10
Detecting depression of Chinese microblog users text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons.检测中国微博用户的抑郁情绪:文本分析——结合语言调查词频统计(LIWC)与文化及自杀相关词汇
Front Psychiatry. 2023 Feb 9;14:1121583. doi: 10.3389/fpsyt.2023.1121583. eCollection 2023.

引用本文的文献

1
Mental Health and Mobile Communication Profiles of Crowdsourced Participants.众包参与者的心理健康与移动通信概况。
IEEE J Biomed Health Inform. 2024 Dec;28(12):7683-7692. doi: 10.1109/JBHI.2024.3436654. Epub 2024 Dec 5.
2
Analyzing text message linguistic features: Do people with depression communicate differently with their close and non-close contacts?分析短信语言特征:患有抑郁症的人与亲密和非亲密联系人的沟通方式是否不同?
Behav Res Ther. 2023 Jul;166:104342. doi: 10.1016/j.brat.2023.104342. Epub 2023 May 27.
3
Classification and analysis of text transcription from Thai depression assessment tasks among patients with depression.
从抑郁症患者的泰国抑郁评估任务中对文本转录进行分类和分析。
PLoS One. 2023 Mar 30;18(3):e0283095. doi: 10.1371/journal.pone.0283095. eCollection 2023.