• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多任务学习从消费者健康论坛中提取句子、实体和关键短语。

Sentences, entities, and keyphrases extraction from consumer health forums using multi-task learning.

作者信息

Naufal Tsaqif, Mahendra Rahmad, Wicaksono Alfan Farizki

机构信息

Faculty of Computer Science, Universitas Indonesia, Kampus UI, 16424, Depok, West Java, Indonesia.

出版信息

J Biomed Semantics. 2025 May 6;16(1):8. doi: 10.1186/s13326-025-00329-2.

DOI:10.1186/s13326-025-00329-2
PMID:40329333
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12057135/
Abstract

PURPOSE

Online consumer health forums offer an alternative source of health-related information for internet users seeking specific details that may not be readily available through articles or other one-way communication channels. However, the effectiveness of these forums can be constrained by the limited number of healthcare professionals actively participating, which can impact response times to user inquiries. One potential solution to this issue is the integration of a semi-automatic system. A critical component of such a system is question processing, which often involves sentence recognition (SR), medical entity recognition (MER), and keyphrase extraction (KE) modules. We posit that the development of these three modules would enable the system to identify critical components of the question, thereby facilitating a deeper understanding of the question, and allowing for the re-formulation of more effective questions with extracted key information.

METHODS

This work contributes to two key aspects related to these three tasks. First, we expand and publicly release an Indonesian dataset for each task. Second, we establish a baseline for all three tasks within the Indonesian language domain by employing transformer-based models with nine distinct encoder variations. Our feature studies revealed an interdependence among these three tasks. Consequently, we propose several multi-task learning (MTL) models, both in pairwise and three-way configurations, incorporating parallel and hierarchical architectures.

RESULTS

Using F1-score at the chunk level, the inter-annotator agreements for SR, MER, and KE tasks were , and respectively. In single-task learning (STL) settings, the best performance for each task was achieved by different model, with obtained the highest average score. These results suggested that a larger model did not always perform better. We also found no indication of which ones between Indonesian and multilingual language models that generally performed better for our tasks. In pairwise MTL settings, we found that pairing tasks could outperform the STL baseline for all three tasks. Despite varying loss weights across our three-way MTL models, we did not identify a consistent pattern. While some configurations improved MER and KE performance, none surpassed the best pairwise MTL model for the SR task.

CONCLUSION

We extended an Indonesian dataset for SR, MER, and KE tasks, resulted in 1, 173 labeled data points which splitted into 773 training instances, 200 validation instances, and 200 testing instances. We then used transformer-based models to set a baseline for all three tasks. Our MTL experiments suggested that additional information regarding the other two tasks could help the learning process for MER and KE tasks, while had only a small effect for SR task.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/51a8fbc99e9d/13326_2025_329_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/c2fc48242684/13326_2025_329_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/4af0909a0f48/13326_2025_329_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/659dc8f58198/13326_2025_329_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/c626df0d5a1c/13326_2025_329_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/51a8fbc99e9d/13326_2025_329_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/c2fc48242684/13326_2025_329_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/4af0909a0f48/13326_2025_329_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/659dc8f58198/13326_2025_329_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/c626df0d5a1c/13326_2025_329_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f96/12057135/51a8fbc99e9d/13326_2025_329_Fig5_HTML.jpg
摘要

目的

在线消费者健康论坛为寻求特定细节的互联网用户提供了一个与健康相关信息的替代来源,这些细节可能无法通过文章或其他单向沟通渠道轻易获取。然而,这些论坛的有效性可能会受到积极参与的医疗保健专业人员数量有限的限制,这可能会影响对用户询问的响应时间。解决这个问题的一个潜在方案是集成一个半自动系统。这种系统的一个关键组件是问题处理,它通常涉及句子识别(SR)、医学实体识别(MER)和关键短语提取(KE)模块。我们认为,这三个模块的开发将使系统能够识别问题的关键组件,从而促进对问题的更深入理解,并允许用提取的关键信息重新表述更有效的问题。

方法

这项工作在与这三个任务相关的两个关键方面做出了贡献。首先,我们为每个任务扩展并公开发布了一个印尼语数据集。其次,我们通过采用具有九种不同编码器变体的基于变压器的模型,在印尼语领域内为所有三个任务建立了一个基线。我们的特征研究揭示了这三个任务之间的相互依存关系。因此,我们提出了几种多任务学习(MTL)模型,包括成对和三向配置,采用了并行和分层架构。

结果

在块级别使用F1分数,SR、MER和KE任务的标注者间一致性分别为 、 和 。在单任务学习(STL)设置中,每个任务的最佳性能由不同的模型实现, 获得了最高平均分。这些结果表明,更大的模型并不总是表现得更好。我们也没有发现印尼语和多语言模型中哪一个在我们的任务中总体表现更好的迹象。在成对MTL设置中,我们发现配对任务在所有三个任务上都能优于STL基线。尽管我们的三向MTL模型的损失权重各不相同,但我们没有发现一致的模式。虽然一些配置提高了MER和KE的性能,但没有一个超过SR任务的最佳成对MTL模型。

结论

我们扩展了用于SR、MER和KE任务的印尼语数据集,产生了1173个标注数据点,分为773个训练实例、200个验证实例和200个测试实例。然后我们使用基于变压器的模型为所有三个任务设置了一个基线。我们的MTL实验表明,关于其他两个任务的额外信息可以帮助MER和KE任务的学习过程,而对SR任务只有很小的影响。

相似文献

1
Sentences, entities, and keyphrases extraction from consumer health forums using multi-task learning.使用多任务学习从消费者健康论坛中提取句子、实体和关键短语。
J Biomed Semantics. 2025 May 6;16(1):8. doi: 10.1186/s13326-025-00329-2.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
Semantic classification of Indonesian consumer health questions.印度尼西亚消费者健康问题的语义分类。
J Biomed Semantics. 2025 Jul 28;16(1):13. doi: 10.1186/s13326-025-00334-5.
6
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
7
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
8
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
9
Maternal and neonatal outcomes of elective induction of labor.择期引产的母婴结局
Evid Rep Technol Assess (Full Rep). 2009 Mar(176):1-257.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

1
Improving large language models for clinical named entity recognition via prompt engineering.通过提示工程改进临床命名实体识别的大型语言模型。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1812-1820. doi: 10.1093/jamia/ocad259.
2
Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition.基于 CNN 和 LSTM 的组合特征嵌入的生物医学命名实体识别。
J Biomed Inform. 2020 Mar;103:103381. doi: 10.1016/j.jbi.2020.103381. Epub 2020 Jan 28.
3
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.
BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
4
Clinical Named Entity Recognition Using Deep Learning Models.使用深度学习模型的临床命名实体识别
AMIA Annu Symp Proc. 2018 Apr 16;2017:1812-1819. eCollection 2017.
5
A New Data Representation Based on Training Data Characteristics to Extract Drug Name Entity in Medical Text.一种基于训练数据特征的新数据表示方法,用于在医学文本中提取药物名称实体。
Comput Intell Neurosci. 2016;2016:3483528. doi: 10.1155/2016/3483528. Epub 2016 Oct 24.
6
Building gold standard corpora for medical natural language processing tasks.为医学自然语言处理任务构建金标准语料库。
AMIA Annu Symp Proc. 2012;2012:144-53. Epub 2012 Nov 3.
7
AskHERMES: An online question answering system for complex clinical questions.AskHERMES:一个用于复杂临床问题的在线问答系统。
J Biomed Inform. 2011 Apr;44(2):277-88. doi: 10.1016/j.jbi.2011.01.004. Epub 2011 Jan 21.
8
Automatically extracting information needs from complex clinical questions.从复杂的临床问题中自动提取信息需求。
J Biomed Inform. 2010 Dec;43(6):962-71. doi: 10.1016/j.jbi.2010.07.007. Epub 2010 Jul 27.
9
Understanding interobserver agreement: the kappa statistic.理解观察者间一致性:kappa统计量。
Fam Med. 2005 May;37(5):360-3.
10
Agreement, the f-measure, and reliability in information retrieval.信息检索中的一致性、F值与可靠性。
J Am Med Inform Assoc. 2005 May-Jun;12(3):296-8. doi: 10.1197/jamia.M1733. Epub 2005 Jan 31.