DCCL：双通道混合神经网络与自注意力相结合的文本分类方法。

DCCL: Dual-channel hybrid neural network combined with self-attention for text classification.

机构信息

The Yancheng School of Clinical Medicine of Nanjing Medical University, Jiangsu 224008, China.

Quality Management Division, Yancheng Third People's Hospital, Jiangsu 224008, China.

出版信息

Math Biosci Eng. 2023 Jan;20(2):1981-1992. doi: 10.3934/mbe.2023091. Epub 2022 Nov 9.

DOI:10.3934/mbe.2023091

PMID:36899518

Abstract

Text classification is a fundamental task in natural language processing. The Chinese text classification task suffers from sparse text features, ambiguity in word segmentation, and poor performance of classification models. A text classification model is proposed based on the self-attention mechanism combined with CNN and LSTM. The proposed model uses word vectors as input to a dual-channel neural network structure, using multiple CNNs to extract the N-Gram information of different word windows and enrich the local feature representation through the concatenation operation, the BiLSTM is used to extract the semantic association information of the context to obtain the high-level feature representation at the sentence level. The output of BiLSTM is feature weighted with self-attention to reduce the influence of noisy features. The outputs of the dual channels are concatenated and fed into the softmax layer for classification. The results of the multiple comparison experiments showed that the DCCL model obtained 90.07% and 96.26% F1-score on the Sougou and THUNews datasets, respectively. Compared to the baseline model, the improvement was 3.24% and 2.19%, respectively. The proposed DCCL model can alleviate the problem of CNN losing word order information and the gradient of BiLSTM when processing text sequences, effectively integrate local and global text features, and highlight key information. The classification performance of the DCCL model is excellent and suitable for text classification tasks.

摘要

文本分类是自然语言处理中的一项基本任务。中文文本分类任务面临文本特征稀疏、分词歧义、分类模型性能差等问题。提出了一种基于自注意力机制与 CNN 和 LSTM 相结合的文本分类模型。该模型使用词向量作为输入，采用双通道神经网络结构，使用多个 CNN 提取不同词窗的 N-Gram 信息，并通过拼接操作丰富局部特征表示，使用 BiLSTM 提取上下文的语义关联信息，获取句子级别的高层特征表示。通过自注意力对 BiLSTM 的输出进行特征加权，以减少噪声特征的影响。双通道的输出进行拼接并送入 softmax 层进行分类。多项对比实验的结果表明，DCCL 模型在 Sougou 和 THUNews 数据集上的 F1 值分别达到 90.07%和 96.26%，与基线模型相比，分别提高了 3.24%和 2.19%。所提出的 DCCL 模型可以缓解 CNN 在处理文本序列时丢失词序信息和 BiLSTM 的梯度问题，有效整合局部和全局文本特征，并突出关键信息。DCCL 模型的分类性能优异，适用于文本分类任务。

相似文献

DCCL: Dual-channel hybrid neural network combined with self-attention for text classification.DCCL：双通道混合神经网络与自注意力相结合的文本分类方法。

Math Biosci Eng. 2023 Jan;20(2):1981-1992. doi: 10.3934/mbe.2023091. Epub 2022 Nov 9.

A sentiment analysis approach for travel-related Chinese online review content.一种针对与旅行相关的中文在线评论内容的情感分析方法。

PeerJ Comput Sci. 2023 Aug 23;9:e1538. doi: 10.7717/peerj-cs.1538. eCollection 2023.

Application of Dual-Channel Convolutional Neural Network Algorithm in Semantic Feature Analysis of English Text Big Data.双通道卷积神经网络算法在英文文本大数据语义特征分析中的应用。

Comput Intell Neurosci. 2021 Nov 6;2021:7085412. doi: 10.1155/2021/7085412. eCollection 2021.

A BERT based dual-channel explainable text emotion recognition system.基于 BERT 的双通道可解释文本情感识别系统。

Neural Netw. 2022 Jun;150:392-407. doi: 10.1016/j.neunet.2022.03.017. Epub 2022 Mar 18.

A Multimodel-Based Deep Learning Framework for Short Text Multiclass Classification with the Imbalanced and Extremely Small Data Set.基于多模型的深度学习框架，用于处理不平衡且超小规模数据集的短文本多分类问题。

Comput Intell Neurosci. 2022 Oct 6;2022:7183207. doi: 10.1155/2022/7183207. eCollection 2022.

Entity recognition of Chinese medical text based on multi-head self-attention combined with BILSTM-CRF.基于多头自注意力机制结合 BiLSTM-CRF 的中文医疗文本实体识别

Math Biosci Eng. 2022 Jan 4;19(3):2206-2218. doi: 10.3934/mbe.2022103.

Chinese text classification method based on sentence information enhancement and feature fusion.基于句子信息增强与特征融合的中文文本分类方法

Heliyon. 2024 Aug 24;10(17):e36861. doi: 10.1016/j.heliyon.2024.e36861. eCollection 2024 Sep 15.

Author identification of literary works based on text analysis and deep learning.基于文本分析和深度学习的文学作品作者身份识别。

Heliyon. 2024 Jan 29;10(3):e25464. doi: 10.1016/j.heliyon.2024.e25464. eCollection 2024 Feb 15.

Long short-term memory (LSTM)-based news classification model.基于长短时记忆网络（LSTM）的新闻分类模型。

PLoS One. 2024 May 30;19(5):e0301835. doi: 10.1371/journal.pone.0301835. eCollection 2024.

Investigating Multi-Level Semantic Extraction with Squash Capsules for Short Text Classification.使用挤压胶囊进行短文本分类的多级语义提取研究

Entropy (Basel). 2022 Apr 23;24(5):590. doi: 10.3390/e24050590.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DCCL：双通道混合神经网络与自注意力相结合的文本分类方法。

DCCL: Dual-channel hybrid neural network combined with self-attention for text classification.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献