• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用文本挖掘技术从电子健康记录中提取抑郁症状并验证重性抑郁障碍的诊断。

Using text mining to extract depressive symptoms and to validate the diagnosis of major depressive disorder from electronic health records.

机构信息

Department of Psychiatry, National Taiwan University Hospital, Taipei, Taiwan R.O.C; College of Medicine, National Taiwan University, Taipei, Taiwan R.O.C.

Taipei City Psychiatric Center, Taipei City Hospital, Taipei, Taiwan R.O.C; Department of Psychiatry, School of Medicine, College of Medicine, Taipei Medical University, Taiwan R.O.C.

出版信息

J Affect Disord. 2020 Jan 1;260:617-623. doi: 10.1016/j.jad.2019.09.044. Epub 2019 Sep 11.

DOI:10.1016/j.jad.2019.09.044
PMID:31541973
Abstract

BACKGROUND

Many studies have used Taiwan's National Health Insurance Research database (NHIRD) to conduct psychiatric research. However, the accuracy of the diagnostic codes for psychiatric disorders in NHIRD is not validated, and the symptom profiles are not available either. This study aimed to evaluate the accuracy of diagnostic codes and use text mining to extract symptom profile and functional impairment from electronic health records (EHRs) to overcome the above research limitations.

METHODS

A total of 500 discharge notes were randomly selected from a medical center's database. Three annotators reviewed the notes to establish gold standards. The accuracy of diagnostic codes for major psychiatric illness was evaluated. Text mining approaches were applied to extract depressive symptoms and function profiles and to identify patients with major depressive disorder.

RESULTS

The accuracy of the diagnostic code for major depressive disorder, schizophrenia, and dementia was acceptable but that of bipolar disorder and minor depression was less satisfactory. The performance of text mining approach to recognize depressive symptoms is satisfactory; however, the recall for functional impairment is lower resulting in lower F-scores of 0.774-0.753. Using the text mining approach to identify major depressive disorder, the recall was 0.85 but precision was only 0.69.

CONCLUSIONS

The accuracy of the diagnostic code for major depressive disorder in discharge notes was generally acceptable. This finding supports the utilization of psychiatric diagnoses in claims databases. The application of text mining to EHRs might help in overcoming current limitations in research using claims databases.

摘要

背景

许多研究利用台湾全民健康保险研究数据库(NHIRD)进行精神科研究。然而,NHIRD 中精神障碍诊断代码的准确性未经验证,症状特征也不可用。本研究旨在评估诊断代码的准确性,并利用文本挖掘从电子健康记录(EHR)中提取症状特征和功能障碍,以克服上述研究限制。

方法

从一家医学中心的数据库中随机抽取 500 份出院记录。三名注释员对记录进行审查,以建立黄金标准。评估了主要精神疾病诊断代码的准确性。应用文本挖掘方法提取抑郁症状和功能特征,并识别出患有重度抑郁症的患者。

结果

重度抑郁症、精神分裂症和痴呆症的诊断代码准确性可以接受,但双相情感障碍和轻度抑郁症的准确性则不太令人满意。文本挖掘方法识别抑郁症状的性能令人满意;然而,对功能障碍的召回率较低,导致 F 分数较低,为 0.774-0.753。使用文本挖掘方法识别重度抑郁症,召回率为 0.85,但精度仅为 0.69。

结论

出院记录中重度抑郁症诊断代码的准确性总体上可以接受。这一发现支持在理赔数据库中使用精神科诊断。将文本挖掘应用于 EHR 可能有助于克服使用理赔数据库进行研究的当前限制。

相似文献

1
Using text mining to extract depressive symptoms and to validate the diagnosis of major depressive disorder from electronic health records.利用文本挖掘技术从电子健康记录中提取抑郁症状并验证重性抑郁障碍的诊断。
J Affect Disord. 2020 Jan 1;260:617-623. doi: 10.1016/j.jad.2019.09.044. Epub 2019 Sep 11.
2
Accuracy of claim data in the identification and classification of adults with congenital heart diseases in electronic medical records.电子病历中索赔数据对成人先天性心脏病的识别和分类的准确性。
Arch Cardiovasc Dis. 2019 Jan;112(1):31-43. doi: 10.1016/j.acvd.2018.07.002. Epub 2019 Jan 3.
3
Identifying phenotypic signatures of neuropsychiatric disorders from electronic medical records.从电子病历中识别神经精神障碍的表型特征。
J Am Med Inform Assoc. 2013 Dec;20(e2):e297-305. doi: 10.1136/amiajnl-2013-001933. Epub 2013 Aug 16.
4
Improvement of the quality of medical databases: data-mining-based prediction of diagnostic codes from previous patient codes.医学数据库质量的提升:基于数据挖掘从既往患者编码预测诊断编码
Stud Health Technol Inform. 2015;210:419-23.
5
Auto-selection of DRG codes from discharge summaries by text mining in several hospitals: analysis of difference of discharge summaries.多家医院通过文本挖掘从出院小结中自动选择疾病诊断相关分组(DRG)编码:出院小结差异分析
Stud Health Technol Inform. 2010;160(Pt 2):1020-4.
6
Identifying Patients With Relapsing-Remitting Multiple Sclerosis Using Algorithms Applied to US Integrated Delivery Network Healthcare Data.使用应用于美国综合交付网络医疗保健数据的算法识别复发缓解型多发性硬化症患者。
Value Health. 2019 Jan;22(1):77-84. doi: 10.1016/j.jval.2018.06.014. Epub 2018 Aug 9.
7
Prevalence and humanistic impact of potential misdiagnosis of bipolar disorder among patients with major depressive disorder in a commercially insured population.商业保险人群中重度抑郁症患者双相情感障碍潜在误诊的患病率及人文影响
J Manag Care Pharm. 2008 Sep;14(7):631-42.
8
Symptom presence versus symptom intensity in understanding the severity of depression: Implications for documentation in electronic medical records.在理解抑郁症严重程度方面,症状出现与症状强度:对电子病历记录的启示。
J Affect Disord. 2019 Sep 1;256:344-347. doi: 10.1016/j.jad.2019.05.073. Epub 2019 May 31.
9
Text mining applied to electronic cardiovascular procedure reports to identify patients with trileaflet aortic stenosis and coronary artery disease.文本挖掘应用于电子心血管手术报告,以识别患有三叶瓣主动脉狭窄和冠状动脉疾病的患者。
J Biomed Inform. 2017 Aug;72:77-84. doi: 10.1016/j.jbi.2017.06.016. Epub 2017 Jun 15.
10
Augmenting DSM-5 diagnostic criteria with self-attention-based BiLSTM models for psychiatric diagnosis.使用基于自注意力的双向长短期记忆模型增强DSM-5诊断标准以进行精神疾病诊断。
Artif Intell Med. 2023 Feb;136:102488. doi: 10.1016/j.artmed.2023.102488. Epub 2023 Jan 11.

引用本文的文献

1
The increased risk of exposure to fine particulate matter for depression incidence is mediated by elevated TNF-R1: the Healthy Aging Longitudinal Study.抑郁发病率与细颗粒物接触风险增加之间的关联是由肿瘤坏死因子受体1(TNF-R1)水平升高介导的:健康老龄化纵向研究。
Environ Health Prev Med. 2025;30:49. doi: 10.1265/ehpm.25-00106.
2
Text mining of outpatient narrative notes to predict the risk of psychiatric hospitalization.挖掘门诊病历中的文本以预测精神科住院风险。
Transl Psychiatry. 2025 Feb 20;15(1):60. doi: 10.1038/s41398-025-03276-9.
3
Panic disorder and suicide.
惊恐障碍与自杀
Psychol Med. 2025 Feb 10;55:e38. doi: 10.1017/S0033291724003441.
4
Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder.利用自然语言处理技术识别重度抑郁症中的自杀意念和快感缺乏。
BMC Med Inform Decis Mak. 2025 Jan 13;25(1):20. doi: 10.1186/s12911-025-02851-w.
5
Advancing psychosocial disability and psychosocial rehabilitation research through large language models and computational text mining.通过大语言模型和计算文本挖掘推进心理社会残疾与心理社会康复研究。
Glob Ment Health (Camb). 2024 Dec 13;11:e123. doi: 10.1017/gmh.2024.114. eCollection 2024.
6
Automatic screening for posttraumatic stress disorder in early adolescents following the Ya'an earthquake using text mining techniques.运用文本挖掘技术对雅安地震后青少年早期创伤后应激障碍进行自动筛查。
Front Psychiatry. 2024 Dec 11;15:1439720. doi: 10.3389/fpsyt.2024.1439720. eCollection 2024.
7
Incidence and Risk of Cardiovascular Outcomes in Patients With Anorexia Nervosa.神经性厌食症患者心血管结局的发生率及风险
JAMA Netw Open. 2024 Dec 2;7(12):e2451094. doi: 10.1001/jamanetworkopen.2024.51094.
8
Familial coaggregation and shared genetic influence between major depressive disorder and gynecological diseases.抑郁症与妇科疾病之间的家族聚集性和共同遗传影响。
Eur J Epidemiol. 2024 Oct;39(10):1161-1170. doi: 10.1007/s10654-024-01166-w. Epub 2024 Nov 4.
9
A study of a PERMA-based positive psychological intervention programme in subthreshold depressed patients with newly diagnosed breast cancer: formulation and application.一项针对新诊断乳腺癌的阈下抑郁患者的基于PERMA的积极心理干预项目的研究:制定与应用
Front Psychol. 2024 Oct 15;15:1437025. doi: 10.3389/fpsyg.2024.1437025. eCollection 2024.
10
Risk of suicide in patients with atrial fibrillation receiving different oral anticoagulants: a nationwide analysis using target trial emulation framework.不同口服抗凝药物治疗心房颤动患者的自杀风险:使用目标试验模拟框架的全国性分析。
BMC Med. 2024 Oct 11;22(1):451. doi: 10.1186/s12916-024-03645-z.