• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CapsTM:用于中文医疗文本匹配的胶囊网络。

CapsTM: capsule network for Chinese medical text matching.

机构信息

School of Political Science and Public Management, WuHan University, Wuhan, China.

Department of Computer Science, Harbin Institute of Technology, Shenzhen, China.

出版信息

BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):94. doi: 10.1186/s12911-021-01442-9.

DOI:10.1186/s12911-021-01442-9
PMID:34330253
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8322831/
Abstract

BACKGROUND

Text Matching (TM) is a fundamental task of natural language processing widely used in many application systems such as information retrieval, automatic question answering, machine translation, dialogue system, reading comprehension, etc. In recent years, a large number of deep learning neural networks have been applied to TM, and have refreshed benchmarks of TM repeatedly. Among the deep learning neural networks, convolutional neural network (CNN) is one of the most popular networks, which suffers from difficulties in dealing with small samples and keeping relative structures of features. In this paper, we propose a novel deep learning architecture based on capsule network for TM, called CapsTM, where capsule network is a new type of neural network architecture proposed to address some of the short comings of CNN and shows great potential in many tasks.

METHODS

CapsTM is a five-layer neural network, including an input layer, a representation layer, an aggregation layer, a capsule layer and a prediction layer. In CapsTM, two pieces of text are first individually converted into sequences of embeddings and are further transformed by a highway network in the input layer. Then, Bidirectional Long Short-Term Memory (BiLSTM) is used to represent each piece of text and attention-based interaction matrix is used to represent interactive information of the two pieces of text in the representation layer. Subsequently, the two kinds of representations are fused together by BiLSTM in the aggregation layer, and are further represented with capsules (vectors) in the capsule layer. Finally, the prediction layer is a connected network used for classification. CapsTM is an extension of ESIM by adding a capsule layer before the prediction layer.

RESULTS

We construct a corpus of Chinese medical question matching, which contains 36,360 question pairs. This corpus is randomly split into three parts: a training set of 32,360 question pairs, a development set of 2000 question pairs and a test set of 2000 question pairs. On this corpus, we conduct a series of experiments to evaluate the proposed CapsTM and compare it with other state-of-the-art methods. CapsTM achieves the highest F-score of 0.8666.

CONCLUSION

The experimental results demonstrate that CapsTM is effective for Chinese medical question matching and outperforms other state-of-the-art methods for comparison.

摘要

背景

文本匹配(TM)是自然语言处理的一项基本任务,广泛应用于信息检索、自动问答、机器翻译、对话系统、阅读理解等许多应用系统中。近年来,大量的深度学习神经网络被应用于 TM,并多次刷新 TM 的基准。在这些深度学习神经网络中,卷积神经网络(CNN)是最受欢迎的网络之一,但在处理小样本和保持特征的相对结构方面存在困难。在本文中,我们提出了一种基于胶囊网络的 TM 的新型深度学习架构,称为 CapsTM,其中胶囊网络是一种新的神经网络架构,旨在解决 CNN 的一些缺点,并在许多任务中显示出巨大的潜力。

方法

CapsTM 是一个五层神经网络,包括输入层、表示层、聚合层、胶囊层和预测层。在 CapsTM 中,首先将两段文本分别转换为嵌入序列,并在输入层中通过高速公路网络进一步转换。然后,使用双向长短期记忆(BiLSTM)表示每段文本,并在表示层中使用基于注意力的交互矩阵表示两段文本的交互信息。随后,聚合层通过 BiLSTM 将两种表示融合在一起,并在胶囊层中进一步用胶囊(向量)表示。最后,预测层是一个用于分类的连接网络。CapsTM 是 ESIM 的扩展,在预测层之前添加了一个胶囊层。

结果

我们构建了一个包含 36360 对问题的中文医疗问题匹配语料库。该语料库随机分为三部分:训练集包含 32360 对问题,开发集包含 2000 对问题,测试集包含 2000 对问题。在这个语料库上,我们进行了一系列实验来评估所提出的 CapsTM,并与其他最先进的方法进行比较。CapsTM 达到了 0.8666 的最高 F 值。

结论

实验结果表明,CapsTM 对中文医疗问题匹配是有效的,并且优于其他最先进的方法进行比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c00/8323213/b05a7155c924/12911_2021_1442_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c00/8323213/08c507b90272/12911_2021_1442_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c00/8323213/df2c1ebedaaf/12911_2021_1442_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c00/8323213/b05a7155c924/12911_2021_1442_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c00/8323213/08c507b90272/12911_2021_1442_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c00/8323213/df2c1ebedaaf/12911_2021_1442_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c00/8323213/b05a7155c924/12911_2021_1442_Fig3_HTML.jpg

相似文献

1
CapsTM: capsule network for Chinese medical text matching.CapsTM:用于中文医疗文本匹配的胶囊网络。
BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):94. doi: 10.1186/s12911-021-01442-9.
2
Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF.基于注意力机制的卷积神经网络-长短时记忆网络-条件随机场在中文临床文本中的实体识别。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):74. doi: 10.1186/s12911-019-0787-y.
3
A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。
BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.
4
Clinical text classification with rule-based features and knowledge-guided convolutional neural networks.基于规则特征和知识引导卷积神经网络的临床文本分类。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):71. doi: 10.1186/s12911-019-0781-4.
5
Temporal indexing of medical entity in Chinese clinical notes.中文临床记录中医疗实体的时间索引。
BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):17. doi: 10.1186/s12911-019-0735-x.
6
Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext + CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study.基于RoBERTa-WWM-ext + CNN(带有全词掩码扩展的基于变换器预训练方法的稳健优化双向编码器表示与卷积神经网络相结合)模型的医患对话多标签分类:命名实体研究
JMIR Med Inform. 2022 Apr 21;10(4):e35606. doi: 10.2196/35606.
7
Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.使用多任务卷积神经网络从自由文本病理报告中自动提取癌症登记报告信息。
J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.
8
An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.基于注意力的深度学习模型在中文电子病历临床命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):235. doi: 10.1186/s12911-019-0933-6.
9
Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks.基于卷积神经网络的中文电子病历智能诊断。
BMC Bioinformatics. 2019 Feb 1;20(1):62. doi: 10.1186/s12859-019-2617-8.
10
A Stacked BiLSTM Neural Network Based on Coattention Mechanism for Question Answering.基于注意力机制的堆叠 BiLSTM 神经网络问答方法。
Comput Intell Neurosci. 2019 Aug 21;2019:9543490. doi: 10.1155/2019/9543490. eCollection 2019.