Suppr超能文献

基于图论的深度学习在情感分析中的应用研究。

An investigation into the deep learning approach in sentimental analysis using graph-based theories.

机构信息

School of Computing and Engineering, University of Huddersfield, Huddersfield, West- Yorkshire, United Kingdom.

出版信息

PLoS One. 2021 Dec 2;16(12):e0260761. doi: 10.1371/journal.pone.0260761. eCollection 2021.

Abstract

Sentiment analysis is a branch of natural language analytics that aims to correlate what is expressed which comes normally within unstructured format with what is believed and learnt. Several attempts have tried to address this gap (i.e., Naive Bayes, RNN, LSTM, word embedding, etc.), even though the deep learning models achieved high performance, their generative process remains a "black-box" and not fully disclosed due to the high dimensional feature and the non-deterministic weights assignment. Meanwhile, graphs are becoming more popular when modeling complex systems while being traceable and understood. Here, we reveal that a good trade-off transparency and efficiency could be achieved with a Deep Neural Network by exploring the Credit Assignment Paths theory. To this end, we propose a novel algorithm which alleviates the features' extraction mechanism and attributes an importance level of selected neurons by applying a deterministic edge/node embeddings with attention scores on the input unit and backward path respectively. We experiment on the Twitter Health News dataset were the model has been extended to approach different approximations (tweet/aspect and tweets' source levels, frequency, polarity/subjectivity), it was also transparent and traceable. Moreover, results of comparing with four recent models on same data corpus for tweets analysis showed a rapid convergence with an overall accuracy of ≈83% and 94% of correctly identified true positive sentiments. Therefore, weights can be ideally assigned to specific active features by following the proposed method. As opposite to other compared works, the inferred features are conditioned through the users' preferences (i.e., frequency degree) and via the activation's derivatives (i.e., reject feature if not scored). Future direction will address the inductive aspect of graph embeddings to include dynamic graph structures and expand the model resiliency by considering other datasets like SemEval task7, covid-19 tweets, etc.

摘要

情感分析是自然语言分析的一个分支,旨在将通常以非结构化格式表达的内容与所相信和学习的内容相关联。已经有几种尝试试图解决这一差距(例如朴素贝叶斯、RNN、LSTM、词嵌入等),尽管深度学习模型取得了很高的性能,但由于高维特征和非确定性权重分配,它们的生成过程仍然是一个“黑盒”,没有完全公开。同时,在对复杂系统进行建模时,图变得越来越流行,同时也具有可追溯性和可理解性。在这里,我们通过探索信用分配路径理论发现,通过深度神经网络可以在透明性和效率之间取得良好的平衡。为此,我们提出了一种新的算法,通过应用确定性边缘/节点嵌入,并分别在输入单元和反向路径上应用注意力分数,来缓解特征提取机制,并为选定神经元赋予重要性级别。我们在 Twitter Health News 数据集上进行了实验,该模型已经扩展到了不同的近似方法(推文/方面和推文的来源级别、频率、极性/主观性),同时也具有透明性和可追溯性。此外,在对相同数据语料库进行的关于推文分析的四个最新模型的比较结果表明,该模型具有快速收敛性,整体准确率约为 83%,正确识别的真实正性情感比例为 94%。因此,可以通过使用所提出的方法为特定的活跃特征分配理想的权重。与其他对比工作相反,通过用户偏好(即频率程度)和通过激活导数(即,如果未评分则拒绝特征)来对推断特征进行条件化。未来的方向将解决图嵌入的归纳方面,包括动态图结构,并通过考虑其他数据集(如 SemEval task7、covid-19 推文等)来扩展模型的弹性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/841d/8638889/e2e5d3407bba/pone.0260761.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验