• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用上下文临床BERT嵌入理解患者投诉特征。

Understanding patient complaint characteristics using contextual clinical BERT embeddings.

作者信息

Saha Budhaditya, Lisboa Sanal, Ghosh Shameek

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:5847-5850. doi: 10.1109/EMBC44109.2020.9175577.

DOI:10.1109/EMBC44109.2020.9175577
PMID:33019303
Abstract

In clinical conversational applications, extracted entities tend to capture the main subject of a patient's complaint, namely symptoms or diseases. However, they mostly fail to recognize the characterizations of a complaint such as the time, the onset, and the severity. For example, if the input is "I have a headache and it is extreme", state-of-the-art models only recognize the main symptom entity - headache, but ignore the severity factor of extreme, that characterises headache. In this paper, we design a two-fold approach to detect the characterizations of entities like symptoms presented by general users in contexts where they would describe their symptoms to a clinician. We use Word2Vec and BERT models to encode clinical text given by the patients. We transform the output and re-frame the task as a multi-label classification problem. Finally, we combine the processed encodings with the Linear Discriminant Analysis (LDA) algorithm to classify the characterizations of the main entity. Experimental results demonstrate that our method achieves 40-50% improvement in the accuracy over the state-of-the-art models.

摘要

在临床对话应用中,提取的实体往往能够捕捉患者主诉的主要主题,即症状或疾病。然而,它们大多无法识别主诉的特征,如时间、发作情况和严重程度。例如,如果输入是“我头痛,而且非常严重”,最先进的模型只会识别主要症状实体——头痛,而忽略了表征头痛的严重程度因素“非常严重”。在本文中,我们设计了一种双重方法,用于在普通用户向临床医生描述症状的情境中,检测诸如症状等实体的特征。我们使用Word2Vec和BERT模型对患者给出的临床文本进行编码。我们对输出进行转换,并将任务重新构建为多标签分类问题。最后,我们将处理后的编码与线性判别分析(LDA)算法相结合,以对主要实体的特征进行分类。实验结果表明,我们的方法在准确率上比最先进的模型提高了40%-50%。

相似文献

1
Understanding patient complaint characteristics using contextual clinical BERT embeddings.使用上下文临床BERT嵌入理解患者投诉特征。
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:5847-5850. doi: 10.1109/EMBC44109.2020.9175577.
2
Generating contextual embeddings for emergency department chief complaints.为急诊科主要症状生成上下文嵌入。
JAMIA Open. 2020 Jul 15;3(2):160-166. doi: 10.1093/jamiaopen/ooaa022. eCollection 2020 Jul.
3
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
4
BERT-based Ranking for Biomedical Entity Normalization.基于BERT的生物医学实体规范化排序
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:269-277. eCollection 2020.
5
Incremental Linear Discriminant Analysis: A Fast Algorithm and Comparisons.增量线性判别分析:一种快速算法及比较。
IEEE Trans Neural Netw Learn Syst. 2015 Nov;26(11):2716-35. doi: 10.1109/TNNLS.2015.2391201. Epub 2015 Jan 29.
6
Linking entities through an ontology using word embeddings and syntactic re-ranking.通过使用词向量和句法重新排序将实体链接到本体中。
BMC Bioinformatics. 2019 Mar 27;20(1):156. doi: 10.1186/s12859-019-2678-8.
7
Utility of General and Specific Word Embeddings for Classifying Translational Stages of Research.通用和特定词嵌入在研究转化阶段分类中的效用
AMIA Annu Symp Proc. 2018 Dec 5;2018:1405-1414. eCollection 2018.
8
Discriminative clustering via extreme learning machine.基于极端学习机的判别聚类。
Neural Netw. 2015 Oct;70:1-8. doi: 10.1016/j.neunet.2015.06.002. Epub 2015 Jun 19.
9
Local linear discriminant analysis framework using sample neighbors.使用样本邻域的局部线性判别分析框架。
IEEE Trans Neural Netw. 2011 Jul;22(7):1119-32. doi: 10.1109/TNN.2011.2152852. Epub 2011 Jun 20.
10
A general soft label based linear discriminant analysis for semi-supervised dimensionality reduction.基于通用软标签的半监督降维线性判别分析。
Neural Netw. 2014 Jul;55:83-97. doi: 10.1016/j.neunet.2014.03.005. Epub 2014 Apr 13.

引用本文的文献

1
BERT-based natural language processing analysis of French CT reports: Application to the measurement of the positivity rate for pulmonary embolism.基于BERT的法语CT报告自然语言处理分析:在肺栓塞阳性率测量中的应用
Res Diagn Interv Imaging. 2023 Mar 27;6:100027. doi: 10.1016/j.redii.2023.100027. eCollection 2023 Jun.
2
Extracting Multiple Worries From Breast Cancer Patient Blogs Using Multilabel Classification With the Natural Language Processing Model Bidirectional Encoder Representations From Transformers: Infodemiology Study of Blogs.使用基于Transformer的自然语言处理模型双向编码器表征的多标签分类从乳腺癌患者博客中提取多种担忧:博客的信息流行病学研究
JMIR Cancer. 2022 Jun 3;8(2):e37840. doi: 10.2196/37840.