• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向自动化消费者问答:在医疗保健领域自动区分消费者问题和专业问题。

Toward automated consumer question answering: automatically separating consumer questions from professional questions in the healthcare domain.

机构信息

Department of Health Sciences, University of Wisconsin-Milwaukee, Milwaukee, WI 53211, United States.

出版信息

J Biomed Inform. 2011 Dec;44(6):1032-8. doi: 10.1016/j.jbi.2011.08.008. Epub 2011 Aug 12.

DOI:10.1016/j.jbi.2011.08.008
PMID:21856442
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3226885/
Abstract

OBJECTIVE

Both healthcare professionals and healthcare consumers have information needs that can be met through the use of computers, specifically via medical question answering systems. However, the information needs of both groups are different in terms of literacy levels and technical expertise, and an effective question answering system must be able to account for these differences if it is to formulate the most relevant responses for users from each group. In this paper, we propose that a first step toward answering the queries of different users is automatically classifying questions according to whether they were asked by healthcare professionals or consumers.

DESIGN

We obtained two sets of consumer questions (~10,000 questions in total) from Yahoo answers. The professional questions consist of two question collections: 4654 point-of-care questions (denoted as PointCare) obtained from interviews of a group of family doctors following patient visits and 5378 questions from physician practices through professional online services (denoted as OnlinePractice). With more than 20,000 questions combined, we developed supervised machine-learning models for automatic classification between consumer questions and professional questions. To evaluate the robustness of our models, we tested the model that was trained on the Consumer-PointCare dataset on the Consumer-OnlinePractice dataset. We evaluated both linguistic features and statistical features and examined how the characteristics in two different types of professional questions (PointCare vs. OnlinePractice) may affect the classification performance. We explored information gain for feature reduction and the back-off linguistic category features.

RESULTS

The 10-fold cross-validation results showed the best F1-measure of 0.936 and 0.946 on Consumer-PointCare and Consumer-OnlinePractice respectively, and the best F1-measure of 0.891 when testing the Consumer-PointCare model on the Consumer-OnlinePractice dataset.

CONCLUSION

Healthcare consumer questions posted at Yahoo online communities can be reliably classified from professional questions posted by point-of-care clinicians and online physicians. The supervised machine-learning models are robust for this task. Our study will significantly benefit further development in automated consumer question answering.

摘要

目的

医疗保健专业人员和医疗保健消费者都有信息需求,可以通过使用计算机来满足,特别是通过医学问答系统。然而,这两个群体的信息需求在文化程度和技术专长方面存在差异,如果问答系统要为每个群体的用户制定最相关的回复,就必须能够考虑到这些差异。在本文中,我们提出,回答不同用户查询的第一步是根据问题是由医疗保健专业人员还是消费者提出,自动对问题进行分类。

设计

我们从雅虎问答中获得了两组消费者问题(总共约 10000 个问题)。专业问题包括两组问题集:从一组家庭医生在患者就诊后进行的访谈中获得的 4654 个即时护理问题(记为 PointCare),以及通过专业在线服务(记为 OnlinePractice)从医生实践中获得的 5378 个问题。结合超过 20000 个问题,我们开发了用于消费者问题和专业问题自动分类的监督机器学习模型。为了评估模型的稳健性,我们在 Consumer-OnlinePractice 数据集上测试了在 Consumer-PointCare 数据集上训练的模型。我们评估了语言特征和统计特征,并研究了两种不同类型的专业问题(PointCare 与 OnlinePractice)中的特征如何影响分类性能。我们探索了特征减少的信息增益和回退语言类别特征。

结果

10 折交叉验证结果分别在 Consumer-PointCare 和 Consumer-OnlinePractice 上达到了最佳的 F1 度量值 0.936 和 0.946,在 Consumer-PointCare 模型在 Consumer-OnlinePractice 数据集上测试时达到了最佳的 F1 度量值 0.891。

结论

在雅虎在线社区发布的医疗保健消费者问题可以与即时护理临床医生和在线医生发布的专业问题可靠地区分。监督机器学习模型在这项任务中具有稳健性。我们的研究将极大地促进自动化消费者问答的进一步发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f410/3226885/5407a7f8c830/nihms324105f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f410/3226885/dc376c16a399/nihms324105f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f410/3226885/5407a7f8c830/nihms324105f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f410/3226885/dc376c16a399/nihms324105f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f410/3226885/5407a7f8c830/nihms324105f2.jpg

相似文献

1
Toward automated consumer question answering: automatically separating consumer questions from professional questions in the healthcare domain.迈向自动化消费者问答:在医疗保健领域自动区分消费者问题和专业问题。
J Biomed Inform. 2011 Dec;44(6):1032-8. doi: 10.1016/j.jbi.2011.08.008. Epub 2011 Aug 12.
2
Classifying Chinese Questions Related to Health Care Posted by Consumers Via the Internet.对消费者通过互联网发布的与医疗保健相关的中文问题进行分类。
J Med Internet Res. 2017 Jun 20;19(6):e220. doi: 10.2196/jmir.7156.
3
Consumer health information and question answering: helping consumers find answers to their health-related information needs.消费者健康信息与问答:帮助消费者寻找与其健康相关的信息需求的答案。
J Am Med Inform Assoc. 2020 Feb 1;27(2):194-201. doi: 10.1093/jamia/ocz152.
4
Concept based auto-assignment of healthcare questions to domain experts in online Q&A communities.基于概念的在线问答社区中医疗问题自动分配给领域专家
Int J Med Inform. 2020 May;137:104108. doi: 10.1016/j.ijmedinf.2020.104108. Epub 2020 Mar 6.
5
Automatically extracting information needs from complex clinical questions.从复杂的临床问题中自动提取信息需求。
J Biomed Inform. 2010 Dec;43(6):962-71. doi: 10.1016/j.jbi.2010.07.007. Epub 2010 Jul 27.
6
Automatically extracting information needs from Ad Hoc clinical questions.从临时临床问题中自动提取信息需求。
AMIA Annu Symp Proc. 2008 Nov 6;2008:96-100.
7
Toward automated classification of consumers' cancer-related questions with a new taxonomy of expected answer types.利用新的预期答案类型分类法实现消费者癌症相关问题的自动分类。
Health Informatics J. 2016 Sep;22(3):523-35. doi: 10.1177/1460458215571643. Epub 2015 Mar 10.
8
A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering.一种基于机器学习的生物医学问答中问题类型分类方法。
Methods Inf Med. 2017 May 18;56(3):209-216. doi: 10.3414/ME16-01-0116. Epub 2017 Mar 31.
9
A Semi-Supervised Learning Approach to Enhance Health Care Community-Based Question Answering: A Case Study in Alcoholism.一种基于半监督学习的方法,用于增强医疗保健社区问答:以酗酒为例的研究。
JMIR Med Inform. 2016 Aug 2;4(3):e24. doi: 10.2196/medinform.5490.
10
SimQ: real-time retrieval of similar consumer health questions.SimQ:相似消费者健康问题的实时检索
J Med Internet Res. 2015 Feb 17;17(2):e43. doi: 10.2196/jmir.3388.

引用本文的文献

1
Semantic classification of Indonesian consumer health questions.印度尼西亚消费者健康问题的语义分类。
J Biomed Semantics. 2025 Jul 28;16(1):13. doi: 10.1186/s13326-025-00334-5.
2
quEHRy: a question answering system to query electronic health records.QueHRy:一个问答系统,用于查询电子健康记录。
J Am Med Inform Assoc. 2023 May 19;30(6):1091-1102. doi: 10.1093/jamia/ocad050.
3
A conversational agent system for dietary supplements use.膳食补充剂使用的会话代理系统。

本文引用的文献

1
Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions.迈向口语临床问答:评估和调整自动语音识别系统以应对口语临床问题。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):625-30. doi: 10.1136/amiajnl-2010-000071. Epub 2011 Jun 24.
2
Automatically classifying sentences in full-text biomedical articles into introduction, methods, results and discussion.将全文生物医学文章中的句子自动分类为引言、方法、结果和讨论部分。
Summit Transl Bioinform. 2009 Mar 1;2009:6-10.
3
Profiling characteristics of internet medical information users.
BMC Med Inform Decis Mak. 2022 Jul 7;22(Suppl 1):153. doi: 10.1186/s12911-022-01888-5.
4
Search like an expert: Reducing expertise disparity using a hybrid neural index for COVID-19 queries.专家级搜索:利用混合神经网络索引减少 COVID-19 查询中的专业知识差距。
J Biomed Inform. 2022 Mar;127:104005. doi: 10.1016/j.jbi.2022.104005. Epub 2022 Feb 8.
5
Consumer health information and question answering: helping consumers find answers to their health-related information needs.消费者健康信息与问答:帮助消费者寻找与其健康相关的信息需求的答案。
J Am Med Inform Assoc. 2020 Feb 1;27(2):194-201. doi: 10.1093/jamia/ocz152.
6
Qcorp: an annotated classification corpus of Chinese health questions.Qcorp:一个带注释的中文健康问题分类语料库。
BMC Med Inform Decis Mak. 2018 Mar 22;18(Suppl 1):16. doi: 10.1186/s12911-018-0593-y.
7
Semantic annotation of consumer health questions.消费者健康问题的语义标注。
BMC Bioinformatics. 2018 Feb 6;19(1):34. doi: 10.1186/s12859-018-2045-1.
8
Classifying Chinese Questions Related to Health Care Posted by Consumers Via the Internet.对消费者通过互联网发布的与医疗保健相关的中文问题进行分类。
J Med Internet Res. 2017 Jun 20;19(6):e220. doi: 10.2196/jmir.7156.
9
Annotating Logical Forms for EHR Questions.为电子健康记录问题标注逻辑形式
LREC Int Conf Lang Resour Eval. 2016 May;2016:3772-3778.
10
Resource Classification for Medical Questions.医学问题的资源分类
AMIA Annu Symp Proc. 2017 Feb 10;2016:1040-1049. eCollection 2016.
互联网医疗信息用户特征分析。
J Am Med Inform Assoc. 2009 Sep-Oct;16(5):714-22. doi: 10.1197/jamia.M3150. Epub 2009 Jun 30.
4
Automatically extracting information needs from Ad Hoc clinical questions.从临时临床问题中自动提取信息需求。
AMIA Annu Symp Proc. 2008 Nov 6;2008:96-100.
5
Striking jump in consumers seeking health care information.寻求医疗保健信息的消费者数量大幅跃升。
Track Rep. 2008 Aug(20):1-8.
6
Answering clinical questions in the ED.在急诊科回答临床问题。
Am J Emerg Med. 2008 Feb;26(2):144-7. doi: 10.1016/j.ajem.2007.03.031.
7
Patient literacy and question-asking behavior during the medical encounter: a mixed-methods analysis.医疗问诊过程中的患者素养与提问行为:一项混合方法分析
J Gen Intern Med. 2007 Jun;22(6):782-6. doi: 10.1007/s11606-007-0184-6. Epub 2007 Apr 12.
8
Information needs of nurse care managers.护理管理者的信息需求。
AMIA Annu Symp Proc. 2006;2006:913.
9
"Bag of words" is not enough for strength of evidence classification.“词袋法”对于证据强度分类是不够的。
AMIA Annu Symp Proc. 2005;2005:1031.
10
Exploring and developing consumer health vocabularies.探索和开发消费者健康词汇表。
J Am Med Inform Assoc. 2006 Jan-Feb;13(1):24-9. doi: 10.1197/jamia.M1761. Epub 2005 Oct 12.