• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于新冠疫情在线评论文本分类的联合坐标注意力机制与实例归一化

Joint coordinate attention mechanism and instance normalization for COVID online comments text classification.

作者信息

Zhu Rong, Gao Hua-Hui, Wang Yong

机构信息

School of Computer Science, Qufu Normal University, Rizhao, China.

Laboratory Experimental Teaching and Equipment Management Center, Qufu Normal University, Rizhao, China.

出版信息

PeerJ Comput Sci. 2024 Aug 19;10:e2240. doi: 10.7717/peerj-cs.2240. eCollection 2024.

DOI:10.7717/peerj-cs.2240
PMID:39314739
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11419621/
Abstract

BACKGROUND

The majority of extant methodologies for text classification prioritize the extraction of feature representations from texts with high degrees of distinction, a process that may result in computational inefficiencies. To address this limitation, the current study proposes a novel approach by directly leveraging label information to construct text representations. This integration aims to optimize the use of label data alongside textual content.

METHODS

The methodology initiated with separate pre-processing of texts and labels, followed by encoding through a projection layer. This research then utilized a conventional self-attention model enhanced by instance normalization (IN) and Gaussian Error Linear Unit (GELU) functions to assess emotional valences in review texts. An advanced self-attention mechanism was further developed to enable the efficient integration of text and label information. In the final stage, an adaptive label encoder was employed to extract relevant label information from the combined text-label data efficiently.

RESULTS

Empirical evaluations demonstrate that the proposed model achieves a significant improvement in classification performance, outperforming existing methodologies. This enhancement is quantitatively evidenced by its superior micro-F1 score, indicating the efficacy of integrating label information into text classification processes. This suggests that the model not only addresses computational inefficiencies but also enhances the accuracy of text classification.

摘要

背景

大多数现有的文本分类方法优先从具有高度区分度的文本中提取特征表示,这一过程可能导致计算效率低下。为解决这一局限性,本研究提出一种新颖的方法,即直接利用标签信息来构建文本表示。这种整合旨在优化标签数据与文本内容的使用。

方法

该方法首先对文本和标签进行单独预处理,然后通过投影层进行编码。本研究随后利用通过实例归一化(IN)和高斯误差线性单元(GELU)函数增强的传统自注意力模型来评估评论文本中的情感效价。进一步开发了一种先进的自注意力机制,以实现文本和标签信息的有效整合。在最后阶段,采用自适应标签编码器从组合的文本-标签数据中高效提取相关标签信息。

结果

实证评估表明,所提出的模型在分类性能上取得了显著提升,优于现有方法。这种提升在其卓越的微F1分数上得到了定量证明,表明将标签信息整合到文本分类过程中的有效性。这表明该模型不仅解决了计算效率低下的问题,还提高了文本分类的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8034/11419621/179c7361b677/peerj-cs-10-2240-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8034/11419621/cbddf3dd1f24/peerj-cs-10-2240-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8034/11419621/6952a4f33dc8/peerj-cs-10-2240-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8034/11419621/179c7361b677/peerj-cs-10-2240-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8034/11419621/cbddf3dd1f24/peerj-cs-10-2240-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8034/11419621/6952a4f33dc8/peerj-cs-10-2240-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8034/11419621/179c7361b677/peerj-cs-10-2240-g003.jpg

相似文献

1
Joint coordinate attention mechanism and instance normalization for COVID online comments text classification.用于新冠疫情在线评论文本分类的联合坐标注意力机制与实例归一化
PeerJ Comput Sci. 2024 Aug 19;10:e2240. doi: 10.7717/peerj-cs.2240. eCollection 2024.
2
Chinese text classification method based on sentence information enhancement and feature fusion.基于句子信息增强与特征融合的中文文本分类方法
Heliyon. 2024 Aug 24;10(17):e36861. doi: 10.1016/j.heliyon.2024.e36861. eCollection 2024 Sep 15.
3
ML-Net: multi-label classification of biomedical texts with deep neural networks.ML-Net:基于深度神经网络的生物医学文本多标签分类
J Am Med Inform Assoc. 2019 Nov 1;26(11):1279-1285. doi: 10.1093/jamia/ocz085.
4
DeBERTa-BiLSTM: A multi-label classification model of Arabic medical questions using pre-trained models and deep learning.基于预训练模型和深度学习的阿拉伯文医学问题多标签分类模型:DeBERTa-BiLSTM
Comput Biol Med. 2024 Mar;170:107921. doi: 10.1016/j.compbiomed.2024.107921. Epub 2024 Jan 4.
5
Enhanced industrial text classification hyper variational graph-guided global context integration.增强型工业文本分类:超变分图引导的全局上下文整合
PeerJ Comput Sci. 2024 Jan 5;10:e1788. doi: 10.7717/peerj-cs.1788. eCollection 2024.
6
Multi-Label Classification in Patient-Doctor Dialogues With the RoBERTa-WWM-ext + CNN (Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach With Whole Word Masking Extended Combining a Convolutional Neural Network) Model: Named Entity Study.基于RoBERTa-WWM-ext + CNN(带有全词掩码扩展的基于变换器预训练方法的稳健优化双向编码器表示与卷积神经网络相结合)模型的医患对话多标签分类:命名实体研究
JMIR Med Inform. 2022 Apr 21;10(4):e35606. doi: 10.2196/35606.
7
Attention Mechanisms in Clinical Text Classification: A Comparative Evaluation.临床文本分类中的注意力机制:一项比较评估。
IEEE J Biomed Health Inform. 2024 Jan 19;PP. doi: 10.1109/JBHI.2024.3355951.
8
Large scale biomedical texts classification: a kNN and an ESA-based approaches.大规模生物医学文本分类:基于k近邻算法和基于词嵌入语义分析的方法。
J Biomed Semantics. 2016 Jun 16;7:40. doi: 10.1186/s13326-016-0073-1.
9
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
10
Automated classification of clinical trial eligibility criteria text based on ensemble learning and metric learning.基于集成学习和度量学习的临床试验资格标准文本的自动分类。
BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):129. doi: 10.1186/s12911-021-01492-z.

本文引用的文献

1
Single-Cell RNA Sequencing Technology Landscape in 2023.2023 年单细胞 RNA 测序技术全景图
Stem Cells. 2024 Jan 13;42(1):1-12. doi: 10.1093/stmcls/sxad077.
2
IChrom-Deep: An Attention-Based Deep Learning Model for Identifying Chromatin Interactions.IChrom-Deep:一种基于注意力的深度学习模型,用于识别染色质相互作用。
IEEE J Biomed Health Inform. 2023 Sep;27(9):4559-4568. doi: 10.1109/JBHI.2023.3292299. Epub 2023 Sep 6.
3
SCMcluster: a high-precision cell clustering algorithm integrating marker gene set with single-cell RNA sequencing data.
SCMcluster:一种高精度的细胞聚类算法,整合了标记基因集与单细胞 RNA 测序数据。
Brief Funct Genomics. 2023 Jul 17;22(4):329-340. doi: 10.1093/bfgp/elad004.
4
SOSPCNN: Structurally Optimized Stochastic Pooling Convolutional Neural Network for Tetralogy of Fallot recognition.SOSPCNN:用于法洛四联症识别的结构优化随机池化卷积神经网络
Wirel Commun Mob Comput. 2021 Jul 1;2021:1-17. doi: 10.1155/2021/5792975.
5
An automated method to enrich consumer health vocabularies using GloVe word embeddings and an auxiliary lexical resource.一种使用GloVe词嵌入和辅助词汇资源来丰富消费者健康词汇表的自动化方法。
PeerJ Comput Sci. 2021 Aug 9;7:e668. doi: 10.7717/peerj-cs.668. eCollection 2021.
6
Framewise phoneme classification with bidirectional LSTM and other neural network architectures.使用双向长短期记忆网络和其他神经网络架构进行逐帧音素分类。
Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042.