• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

识别患者生成的关于新冠病毒的远程医疗查询的感知严重程度:开发和评估基于迁移学习的解决方案。

Identifying the Perceived Severity of Patient-Generated Telemedical Queries Regarding COVID: Developing and Evaluating a Transfer Learning-Based Solution.

作者信息

Gatto Joseph, Seegmiller Parker, Johnston Garrett, Preum Sarah Masud

机构信息

Department of Computer Science, Dartmouth College, Hanover, NH, United States.

出版信息

JMIR Med Inform. 2022 Sep 2;10(9):e37770. doi: 10.2196/37770.

DOI:10.2196/37770
PMID:35981230
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9446665/
Abstract

BACKGROUND

Triage of textual telemedical queries is a safety-critical task for medical service providers with limited remote health resources. The prioritization of patient queries containing medically severe text is necessary to optimize resource usage and provide care to those with time-sensitive needs.

OBJECTIVE

We aim to evaluate the effectiveness of transfer learning solutions on the task of telemedical triage and provide a thorough error analysis, identifying telemedical queries that challenge state-of-the-art natural language processing (NLP) systems. Additionally, we aim to provide a publicly available telemedical query data set with labels for severity classification for telemedical triage of respiratory issues.

METHODS

We annotated 573 medical queries from 3 online health platforms: HealthTap, HealthcareMagic, and iCliniq. We then evaluated 6 transfer learning solutions utilizing various text-embedding strategies. Specifically, we first established a baseline using a lexical classification model with term frequency-inverse document frequency (TF-IDF) features. Next, we investigated the effectiveness of global vectors for text representation (GloVe), a pretrained word-embedding method. We evaluated the performance of GloVe embeddings in the context of support vector machines (SVMs), bidirectional long short-term memory (bi-LSTM) networks, and hierarchical attention networks (HANs). Finally, we evaluated the performance of contextual text embeddings using transformer-based architectures. Specifically, we evaluated bidirectional encoder representation from transformers (BERT), Bio+Clinical-BERT, and Sentence-BERT (SBERT) on the telemedical triage task.

RESULTS

We found that a simple lexical model achieved a mean F1 score of 0.865 (SD 0.048) on the telemedical triage task. GloVe-based models using SVMs, HANs, and bi-LSTMs achieved a 0.8-, 1.5-, and 2.1-point increase in the F1 score, respectively. Transformer-based models, such as BERT, Bio+Clinical-BERT, and SBERT, achieved a mean F1 score of 0.914 (SD 0.034), 0.904 (SD 0.041), and 0.917 (SD 0.037), respectively. The highest-performing model, SBERT, provided a statistically significant improvement compared to all GloVe-based and lexical baselines. However, no statistical significance was found when comparing transformer-based models. Furthermore, our error analysis revealed highly challenging query types, including those with complex negations, temporal relationships, and patient intents.

CONCLUSIONS

We showed that state-of-the-art transfer learning techniques work well on the telemedical triage task, providing significant performance increase over lexical models. Additionally, we released a public telemedical triage data set using labeled questions from online medical question-and-answer (Q&A) platforms. Our analysis highlights various avenues for future works that explicitly model such query challenges.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d295/9446665/c40824ec8829/medinform_v10i9e37770_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d295/9446665/d19cc43461ef/medinform_v10i9e37770_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d295/9446665/c40824ec8829/medinform_v10i9e37770_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d295/9446665/d19cc43461ef/medinform_v10i9e37770_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d295/9446665/c40824ec8829/medinform_v10i9e37770_fig2.jpg
摘要

背景

对于远程医疗资源有限的医疗服务提供者而言,对文本形式的远程医疗咨询进行分诊是一项关乎安全的关键任务。对包含医学严重文本的患者咨询进行优先级排序,对于优化资源使用以及为有时间敏感性需求的患者提供护理至关重要。

目的

我们旨在评估迁移学习解决方案在远程医疗分诊任务中的有效性,并进行全面的错误分析,识别对当前最先进的自然语言处理(NLP)系统构成挑战的远程医疗咨询。此外,我们旨在提供一个公开可用的远程医疗咨询数据集,其中带有用于呼吸问题远程医疗分诊严重程度分类的标签。

方法

我们对来自3个在线健康平台(HealthTap、HealthcareMagic和iCliniq)的573条医疗咨询进行了标注。然后,我们评估了6种利用各种文本嵌入策略的迁移学习解决方案。具体而言,我们首先使用具有词频 - 逆文档频率(TF - IDF)特征的词汇分类模型建立基线。接下来,我们研究了用于文本表示的全局向量(GloVe)这种预训练词嵌入方法的有效性。我们在支持向量机(SVM)、双向长短期记忆(bi - LSTM)网络和层次注意力网络(HAN)的背景下评估了GloVe嵌入的性能。最后,我们使用基于Transformer的架构评估上下文文本嵌入的性能。具体来说,我们在远程医疗分诊任务中评估了来自Transformer的双向编码器表示(BERT)、Bio + Clinical - BERT和句子BERT(SBERT)。

结果

我们发现一个简单的词汇模型在远程医疗分诊任务上的平均F1分数为0.865(标准差0.048)。使用SVM、HAN和bi - LSTM的基于GloVe的模型在F1分数上分别提高了0.8、1.5和2.1分。基于Transformer的模型,如BERT、Bio + Clinical - BERT和SBERT,平均F1分数分别为0.914(标准差0.034)、0.904(标准差0.041)和0.917(标准差0.037)。性能最高的模型SBERT与所有基于GloVe的和词汇基线相比,提供了具有统计学意义的改进。然而,在比较基于Transformer的模型时未发现统计学意义。此外,我们的错误分析揭示了极具挑战性的查询类型,包括那些具有复杂否定、时间关系和患者意图的查询。

结论

我们表明,当前最先进的迁移学习技术在远程医疗分诊任务中表现良好,与词汇模型相比性能有显著提升。此外,我们使用来自在线医疗问答(Q&A)平台的带标签问题发布了一个公开的远程医疗分诊数据集。我们 的分析突出了未来工作的各种途径,这些工作将明确对这类查询挑战进行建模。

相似文献

1
Identifying the Perceived Severity of Patient-Generated Telemedical Queries Regarding COVID: Developing and Evaluating a Transfer Learning-Based Solution.识别患者生成的关于新冠病毒的远程医疗查询的感知严重程度:开发和评估基于迁移学习的解决方案。
JMIR Med Inform. 2022 Sep 2;10(9):e37770. doi: 10.2196/37770.
2
A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.深度学习模型在不同类别不平衡程度的非结构化医疗记录文本分类中的对比研究。
BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.
3
Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。
Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.
4
Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext + CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study.基于RoBERTa-WWM-ext + CNN(带有全词掩码扩展的基于变换器预训练方法的稳健优化双向编码器表示与卷积神经网络相结合)模型的医患对话多标签分类:命名实体研究
JMIR Med Inform. 2022 Apr 21;10(4):e35606. doi: 10.2196/35606.
5
Predicting Semantic Similarity Between Clinical Sentence Pairs Using Transformer Models: Evaluation and Representational Analysis.使用Transformer模型预测临床句子对之间的语义相似性:评估与表征分析
JMIR Med Inform. 2021 May 26;9(5):e23099. doi: 10.2196/23099.
6
Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.使用暹罗神经网络的临床自然语言处理少样本学习:算法开发与验证研究
JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.
7
When BERT meets Bilbo: a learning curve analysis of pretrained language model on disease classification.当 BERT 遇见比尔博:预训练语言模型在疾病分类上的学习曲线分析。
BMC Med Inform Decis Mak. 2022 Apr 5;21(Suppl 9):377. doi: 10.1186/s12911-022-01829-2.
8
Identification of Semantically Similar Sentences in Clinical Notes: Iterative Intermediate Training Using Multi-Task Learning.临床笔记中语义相似句子的识别:使用多任务学习的迭代中间训练
JMIR Med Inform. 2020 Nov 27;8(11):e22508. doi: 10.2196/22508.
9
Relation Classification for Bleeding Events From Electronic Health Records Using Deep Learning Systems: An Empirical Study.使用深度学习系统对电子健康记录中的出血事件进行关系分类:一项实证研究。
JMIR Med Inform. 2021 Jul 2;9(7):e27527. doi: 10.2196/27527.
10
Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).使用基于转换器的双向编码器表示 (BERT) 和领域内预训练 (IDPT) 对耳鸣患者的可操作放射学报告进行自动文本分类。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.

引用本文的文献

1
Words to live by: Using medic impressions to identify the need for prehospital lifesaving interventions.生存准则:利用医疗印象识别院前救生干预的需求。
Acad Emerg Med. 2025 May;32(5):516-525. doi: 10.1111/acem.15067. Epub 2025 Jan 24.
2
Improving Triage Accuracy in Prehospital Emergency Telemedicine: Scoping Review of Machine Learning-Enhanced Approaches.提高院前急救远程医疗中的分诊准确性:机器学习增强方法的范围综述
Interact J Med Res. 2024 Sep 11;13:e56729. doi: 10.2196/56729.

本文引用的文献

1
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
2
A Novel Deep Learning-Based System for Triage in the Emergency Department Using Electronic Medical Records: Retrospective Cohort Study.一种基于深度学习的利用电子病历进行急诊科分诊的新系统:回顾性队列研究。
J Med Internet Res. 2021 Dec 27;23(12):e27008. doi: 10.2196/27008.
3
Topics and Sentiments of Public Concerns Regarding COVID-19 Vaccines: Social Media Trend Analysis.
公众对新冠疫苗关注的话题和情绪:社交媒体趋势分析。
J Med Internet Res. 2021 Oct 21;23(10):e30765. doi: 10.2196/30765.
4
Distant Supervision for Mental Health Management in Social Media: Suicide Risk Classification System Development Study.社交媒体中精神健康管理的远程监督:自杀风险分类系统开发研究。
J Med Internet Res. 2021 Aug 26;23(8):e26119. doi: 10.2196/26119.
5
Demographics associated with US healthcare disparities are exacerbated by the telemedicine surge during the COVID-19 pandemic.与美国医疗保健差距相关的人口统计学问题因 COVID-19 大流行期间远程医疗的激增而加剧。
J Telemed Telecare. 2024 Jan;30(1):64-71. doi: 10.1177/1357633X211025939. Epub 2021 Jun 23.
6
A Review of Patient Satisfaction and Experience with Telemedicine: A Virtual Solution During and Beyond COVID-19 Pandemic.远程医疗在新冠疫情期间和之后的应用:患者满意度和体验的回顾。
Telemed J E Health. 2021 Dec;27(12):1325-1331. doi: 10.1089/tmj.2020.0570. Epub 2021 Mar 12.
7
On the Front (Phone) Lines: Results of a COVID-19 Hotline.在前线(电话)线上:新冠疫情热线的结果。
J Am Board Fam Med. 2021 Feb;34(Suppl):S95-S102. doi: 10.3122/jabfm.2021.S1.200237.
8
Toward Using Twitter for Tracking COVID-19: A Natural Language Processing Pipeline and Exploratory Data Set.用于追踪 COVID-19 的 Twitter:自然语言处理管道和探索性数据集。
J Med Internet Res. 2021 Jan 22;23(1):e25314. doi: 10.2196/25314.
9
Backslide or forward progress? Virtual care at U.S. healthcare systems beyond the COVID-19 pandemic.退步还是进步?新冠疫情之后美国医疗系统中的虚拟医疗
NPJ Digit Med. 2021 Jan 8;4(1):6. doi: 10.1038/s41746-020-00379-z.
10
"Down the Rabbit Hole" of Vaccine Misinformation on YouTube: Network Exposure Study.《YouTube 上疫苗错误信息的“兔子洞”:网络暴露研究》
J Med Internet Res. 2021 Jan 5;23(1):e23262. doi: 10.2196/23262.