• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将深度匹配网络应用于中文医学问答:一项研究与数据集。

Applying deep matching networks to Chinese medical question answering: a study and a dataset.

机构信息

Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing, 100190, China.

University of Chinese Academy of Sciences, Beijing, 100049, China.

出版信息

BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):52. doi: 10.1186/s12911-019-0761-8.

DOI:10.1186/s12911-019-0761-8
PMID:30961607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6454599/
Abstract

BACKGROUND

Medical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward. It can be attributed to the difficulty of Chinese text processing and the lack of large-scale datasets. To bridge the gap, this paper introduces a Chinese medical QA dataset and proposes effective methods for the task.

METHODS

We first construct a large scale Chinese medical QA dataset. Then we leverage deep matching neural networks to capture semantic interaction between words in questions and answers. Considering that Chinese Word Segmentation (CWS) tools may fail to identify clinical terms, we design a module to merge the word segments and produce a new representation. It learns the common compositions of words or segments by using convolutional kernels and selects the strongest signals by windowed pooling.

RESULTS

The best performer among popular CWS tools on our dataset is found. In our experiments, deep matching models substantially outperform existing methods. Results also show that our proposed semantic clustered representation module improves the performance of models by up to 5.5% Precision at 1 and 4.9% Mean Average Precision.

CONCLUSIONS

In this paper, we introduce a large scale Chinese medical QA dataset and cast the task into a semantic matching problem. We also compare different CWS tools and input units. Among the two state-of-the-art deep matching neural networks, MatchPyramid performs better. Results also show the effectiveness of the proposed semantic clustered representation module.

摘要

背景

医学和临床问答(QA)是研究人员最近高度关注的问题。尽管在这个领域取得了显著的进展,但中文医学领域的发展相对落后。这可以归因于中文文本处理的难度和缺乏大规模数据集。为了弥补这一差距,本文介绍了一个中文医学 QA 数据集,并提出了该任务的有效方法。

方法

我们首先构建了一个大规模的中文医学 QA 数据集。然后,我们利用深度匹配神经网络来捕捉问题和答案中单词之间的语义交互。考虑到中文分词(CWS)工具可能无法识别临床术语,我们设计了一个模块来合并词段并生成新的表示。它通过卷积核学习单词或词段的常见组合,并通过窗口池选择最强信号。

结果

在我们的数据集上,找到表现最好的流行 CWS 工具。在我们的实验中,深度匹配模型大大优于现有方法。结果还表明,我们提出的语义聚类表示模块通过在 1 个和 4.9%的平均精度上提高 5.5%的精度来提高模型的性能。

结论

在本文中,我们引入了一个大规模的中文医学 QA 数据集,并将任务转化为语义匹配问题。我们还比较了不同的 CWS 工具和输入单元。在两种最先进的深度匹配神经网络中,MatchPyramid 的表现更好。结果还表明了所提出的语义聚类表示模块的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/8dbb8e480a6c/12911_2019_761_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/d06ce31e240d/12911_2019_761_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/c8d879bed7a4/12911_2019_761_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/21e2bad29cf9/12911_2019_761_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/39abfceb9873/12911_2019_761_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/8dbb8e480a6c/12911_2019_761_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/d06ce31e240d/12911_2019_761_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/c8d879bed7a4/12911_2019_761_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/21e2bad29cf9/12911_2019_761_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/39abfceb9873/12911_2019_761_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7f7/6454599/8dbb8e480a6c/12911_2019_761_Fig5_HTML.jpg

相似文献

1
Applying deep matching networks to Chinese medical question answering: a study and a dataset.将深度匹配网络应用于中文医学问答:一项研究与数据集。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):52. doi: 10.1186/s12911-019-0761-8.
2
CapsTM: capsule network for Chinese medical text matching.CapsTM:用于中文医疗文本匹配的胶囊网络。
BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):94. doi: 10.1186/s12911-021-01442-9.
3
Answering medical questions in Chinese using automatically mined knowledge and deep neural networks: an end-to-end solution.利用自动挖掘的知识和深度神经网络用中文回答医学问题:一种端到端的解决方案。
BMC Bioinformatics. 2022 Apr 15;23(1):136. doi: 10.1186/s12859-022-04658-2.
4
A Stacked BiLSTM Neural Network Based on Coattention Mechanism for Question Answering.基于注意力机制的堆叠 BiLSTM 神经网络问答方法。
Comput Intell Neurosci. 2019 Aug 21;2019:9543490. doi: 10.1155/2019/9543490. eCollection 2019.
5
IARNN-Based Semantic-Containing Double-Level Embedding Bi-LSTM for Question-and-Answer Matching.基于 IARNN 的语义包含双层嵌入双向 LSTM 的问答匹配
Comput Intell Neurosci. 2019 Mar 3;2019:6074840. doi: 10.1155/2019/6074840. eCollection 2019.
6
Construction and Research on Chinese Semantic Mapping Based on Linguistic Features and Sparse Self-Learning Neural Networks.基于语言特征和稀疏自学习神经网络的中文语义映射构建与研究。
Comput Intell Neurosci. 2022 Jun 20;2022:2315802. doi: 10.1155/2022/2315802. eCollection 2022.
7
A benchmark dataset and case study for Chinese medical question intent classification.用于中文医学问题意图分类的基准数据集和案例研究。
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):125. doi: 10.1186/s12911-020-1122-3.
8
MAGE: Multi-scale Context-aware Interaction based on Multi-granularity Embedding for Chinese Medical Question Answer Matching.MAGE:基于多粒度嵌入的多尺度上下文感知交互,用于中医问答匹配
Comput Methods Programs Biomed. 2023 Jan;228:107249. doi: 10.1016/j.cmpb.2022.107249. Epub 2022 Nov 17.
9
Automatic SNOMED CT coding of Chinese clinical terms via attention-based semantic matching.通过基于注意力的语义匹配对中文临床术语进行自动SNOMED CT编码。
Int J Med Inform. 2022 Mar;159:104676. doi: 10.1016/j.ijmedinf.2021.104676. Epub 2021 Dec 28.
10
Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks.基于卷积神经网络的中文电子病历智能诊断。
BMC Bioinformatics. 2019 Feb 1;20(1):62. doi: 10.1186/s12859-019-2617-8.

引用本文的文献

1
Answering medical questions in Chinese using automatically mined knowledge and deep neural networks: an end-to-end solution.利用自动挖掘的知识和深度神经网络用中文回答医学问题:一种端到端的解决方案。
BMC Bioinformatics. 2022 Apr 15;23(1):136. doi: 10.1186/s12859-022-04658-2.

本文引用的文献

1
A Depth Evidence Score Fusion Algorithm for Chinese Medical Intelligence Question Answering System.一种用于中文医疗智能问答系统的深度证据融合算法。
J Healthc Eng. 2018 Jul 10;2018:1205354. doi: 10.1155/2018/1205354. eCollection 2018.
2
An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.BIOASQ大规模生物医学语义索引与问答竞赛概述。
BMC Bioinformatics. 2015 Apr 30;16:138. doi: 10.1186/s12859-015-0564-6.
3
Beyond information retrieval--medical question answering.超越信息检索——医学问答
AMIA Annu Symp Proc. 2006;2006:469-73.
4
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.