基于智能模型的新闻文本数据情感分类

Sentiment Classification of News Text Data Using Intelligent Model.

作者信息

Zhang Shitao

机构信息

School of Network Communication, Zhejiang Yuexiu University, Shaoxing, China.

出版信息

Front Psychol. 2021 Sep 28;12:758967. doi: 10.3389/fpsyg.2021.758967. eCollection 2021.

DOI:10.3389/fpsyg.2021.758967

PMID:34650498

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8509032/

Abstract

Text sentiment classification is a fundamental sub-area in natural language processing. The sentiment classification algorithm is highly domain-dependent. For example, the phrase "traffic jam" expresses negative sentiment in the sentence "I was stuck in a traffic jam on the elevated for 2 h." But in the domain of transportation, the phrase "traffic jam" in the sentence "Bread and water are essential terms in traffic jams" is without any sentiment. The most common method is to use the domain-specific data samples to classify the text in this domain. However, text sentiment analysis based on machine learning relies on sufficient labeled training data. Aiming at the problem of sentiment classification of news text data with insufficient label news data and the domain adaptation of text sentiment classifiers, an intelligent model, i.e., transfer learning discriminative dictionary learning algorithm (TLDDL) is proposed for cross-domain text sentiment classification. Based on the framework of dictionary learning, the samples from the different domains are projected into a subspace, and a domain-invariant dictionary is built to connect two different domains. To improve the discriminative performance of the proposed algorithm, the discrimination information preserved term and principal component analysis (PCA) term are combined into the objective function. The experiments are performed on three public text datasets. The experimental results show that the proposed algorithm improves the sentiment classification performance of texts in the target domain.

摘要

文本情感分类是自然语言处理中的一个基本子领域。情感分类算法高度依赖于领域。例如，短语“交通堵塞”在句子“我在高架桥上堵了两个小时”中表达负面情绪。但在交通领域，句子“面包和水是交通堵塞中的必备物品”中的短语“交通堵塞”没有任何情感倾向。最常见的方法是使用特定领域的数据样本对该领域的文本进行分类。然而，基于机器学习的文本情感分析依赖于足够的标注训练数据。针对新闻文本数据标注不足以及文本情感分类器的领域适应性问题，提出了一种智能模型，即用于跨领域文本情感分类的迁移学习判别字典学习算法（TLDDL）。基于字典学习框架，将来自不同领域的样本投影到一个子空间中，并构建一个领域不变字典来连接两个不同领域。为了提高所提算法的判别性能，将判别信息保留项和主成分分析（PCA）项组合到目标函数中。在三个公开文本数据集上进行了实验。实验结果表明，所提算法提高了目标领域中文本的情感分类性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/8509032/8fcf4ad8a6b3/fpsyg-12-758967-g001.jpg

相似文献

Sentiment Classification of News Text Data Using Intelligent Model.

Front Psychol. 2021 Sep 28;12:758967. doi: 10.3389/fpsyg.2021.758967. eCollection 2021.

Sentiment Classification for Financial Texts Based on Deep Learning.

Comput Intell Neurosci. 2021 Oct 11;2021:9524705. doi: 10.1155/2021/9524705. eCollection 2021.

A BERT-Based Aspect-Level Sentiment Analysis Algorithm for Cross-Domain Text.

Comput Intell Neurosci. 2022 Jun 27;2022:8726621. doi: 10.1155/2022/8726621. eCollection 2022.

News Text Mining-Based Business Sentiment Analysis and Its Significance in Economy.

Front Psychol. 2022 Jul 14;13:918447. doi: 10.3389/fpsyg.2022.918447. eCollection 2022.

Multi-level aspect based sentiment classification of Twitter data: using hybrid approach in deep learning.

PeerJ Comput Sci. 2021 Apr 13;7:e433. doi: 10.7717/peerj-cs.433. eCollection 2021.

Cross-Domain Sentiment Analysis Based on Feature Projection and Multi-Source Attention in IoT.

Sensors (Basel). 2023 Aug 20;23(16):7282. doi: 10.3390/s23167282.

Investigating the transferring capability of capsule networks for text classification.

Neural Netw. 2019 Oct;118:247-261. doi: 10.1016/j.neunet.2019.06.014. Epub 2019 Jul 8.

Connecting Text Classification with Image Classification: A New Preprocessing Method for Implicit Sentiment Text Classification.

Sensors (Basel). 2022 Feb 28;22(5):1899. doi: 10.3390/s22051899.

Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm.

PLoS One. 2018 Apr 23;13(4):e0194852. doi: 10.1371/journal.pone.0194852. eCollection 2018.

Domain adaptive learning for multi realm sentiment classification on big data.

PLoS One. 2024 Apr 1;19(4):e0297028. doi: 10.1371/journal.pone.0297028. eCollection 2024.

引用本文的文献

Does media sentiment affect stock prices? Evidence from China's STAR market.

Front Psychol. 2022 Dec 1;13:1040171. doi: 10.3389/fpsyg.2022.1040171. eCollection 2022.

Sentiment Thesaurus, Synset and Word2Vec Based Improvement in Bigram Model for Classifying Product Reviews.

SN Comput Sci. 2022;3(6):422. doi: 10.1007/s42979-022-01305-8. Epub 2022 Aug 6.

本文引用的文献

A Domain Adaptation Sparse Representation Classifier for Cross-Domain Electroencephalogram-Based Emotion Classification.

Front Psychol. 2021 Jul 29;12:721266. doi: 10.3389/fpsyg.2021.721266. eCollection 2021.

Optimized Projection and Fisher Discriminative Dictionary Learning for EEG Emotion Recognition.

Front Psychol. 2021 Jun 28;12:705528. doi: 10.3389/fpsyg.2021.705528. eCollection 2021.

A Novel Negative-Transfer-Resistant Fuzzy Clustering Model With a Shared Cross-Domain Transfer Latent Space and its Application to Brain CT Image Segmentation.

IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):40-52. doi: 10.1109/TCBB.2019.2963873. Epub 2021 Feb 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于智能模型的新闻文本数据情感分类

Sentiment Classification of News Text Data Using Intelligent Model.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献