Tan Zhipeng, Chen Jing, Kang Qi, Zhou Mengchu, Abusorrah Abdullah, Sedraoui Khaled
IEEE Trans Neural Netw Learn Syst. 2022 Mar;33(3):973-982. doi: 10.1109/TNNLS.2020.3036192. Epub 2022 Feb 28.
Text classification is a fundamental and important area of natural language processing for assigning a text into at least one predefined tag or category according to its content. Most of the advanced systems are either too simple to get high accuracy or centered on using complex structures to capture the genuinely required category information, which requires long time to converge during their training stage. In order to address such challenging issues, we propose a dynamic embedding projection-gated convolutional neural network (DEP-CNN) for multi-class and multi-label text classification. Its dynamic embedding projection gate (DEPG) transforms and carries word information by using gating units and shortcut connections to control how much context information is incorporated into each specific position of a word-embedding matrix in a text. To our knowledge, we are the first to apply DEPG over a word-embedding matrix. The experimental results on four known benchmark datasets display that DEP-CNN outperforms its recent peers.
文本分类是自然语言处理中的一个基础且重要的领域,其目的是根据文本内容将其归入至少一个预定义的标签或类别。大多数先进的系统要么过于简单而无法获得高精度,要么侧重于使用复杂结构来捕捉真正所需的类别信息,这在其训练阶段需要很长时间才能收敛。为了解决这些具有挑战性的问题,我们提出了一种用于多类和多标签文本分类的动态嵌入投影门控卷积神经网络(DEP-CNN)。其动态嵌入投影门(DEPG)通过使用门控单元和捷径连接来变换和传递单词信息,以控制有多少上下文信息被纳入文本中单词嵌入矩阵的每个特定位置。据我们所知,我们是第一个在单词嵌入矩阵上应用DEPG的。在四个已知基准数据集上的实验结果表明,DEP-CNN优于其最近的同类模型。