• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合知识图谱和词嵌入进行球形主题建模。

Combining Knowledge Graph and Word Embeddings for Spherical Topic Modeling.

作者信息

Ennajari Hafsa, Bouguila Nizar, Bentahar Jamal

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3609-3623. doi: 10.1109/TNNLS.2021.3112045. Epub 2023 Jul 6.

DOI:10.1109/TNNLS.2021.3112045
PMID:34559665
Abstract

Probabilistic topic models are considered as an effective framework for text analysis that uncovers the main topics in an unlabeled set of documents. However, the inferred topics by traditional topic models are often unclear and not easy to interpret because they do not account for semantic structures in language. Recently, a number of topic modeling approaches tend to leverage domain knowledge to enhance the quality of the learned topics, but they still assume a multinomial or Gaussian document likelihood in the Euclidean space, which often results in information loss and poor performance. In this article, we propose a Bayesian embedded spherical topic model (ESTM) that combines both knowledge graph and word embeddings in a non-Euclidean curved space, the hypersphere, for better topic interpretability and discriminative text representations. Extensive experimental results show that our proposed model successfully uncovers interpretable topics and learns high-quality text representations useful for common natural language processing (NLP) tasks across multiple benchmark datasets.

摘要

概率主题模型被视为文本分析的有效框架,它能揭示一组未标记文档中的主要主题。然而,传统主题模型推断出的主题往往不清晰且难以解释,因为它们没有考虑语言中的语义结构。最近,许多主题建模方法倾向于利用领域知识来提高所学习主题的质量,但它们仍然在欧几里得空间中假设多项式或高斯文档似然性,这常常导致信息丢失和性能不佳。在本文中,我们提出了一种贝叶斯嵌入球面主题模型(ESTM),该模型在非欧几里得弯曲空间(超球面)中结合了知识图谱和词嵌入,以实现更好的主题可解释性和判别性文本表示。大量实验结果表明,我们提出的模型成功地揭示了可解释的主题,并学习到了对多个基准数据集上的常见自然语言处理(NLP)任务有用的高质量文本表示。

相似文献

1
Combining Knowledge Graph and Word Embeddings for Spherical Topic Modeling.结合知识图谱和词嵌入进行球形主题建模。
IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3609-3623. doi: 10.1109/TNNLS.2021.3112045. Epub 2023 Jul 6.
2
Investigating the Efficient Use of Word Embedding with Neural-Topic Models for Interpretable Topics from Short Texts.研究基于神经主题模型的词向量有效利用,以实现短文本的可解释主题。
Sensors (Basel). 2022 Jan 23;22(3):852. doi: 10.3390/s22030852.
3
Multi-granularity heterogeneous graph attention networks for extractive document summarization.多粒度异质图注意力网络在抽取式文档摘要中的应用。
Neural Netw. 2022 Nov;155:340-347. doi: 10.1016/j.neunet.2022.08.021. Epub 2022 Sep 5.
4
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
5
Utility of General and Specific Word Embeddings for Classifying Translational Stages of Research.通用和特定词嵌入在研究转化阶段分类中的效用
AMIA Annu Symp Proc. 2018 Dec 5;2018:1405-1414. eCollection 2018.
6
A Topic Recognition Method of News Text Based on Word Embedding Enhancement.基于词向量增强的新闻文本主题识别方法。
Comput Intell Neurosci. 2022 Feb 16;2022:4582480. doi: 10.1155/2022/4582480. eCollection 2022.
7
A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.用于临床文本中命名实体识别的神经词嵌入研究
AMIA Annu Symp Proc. 2015 Nov 5;2015:1326-33. eCollection 2015.
8
Combining background knowledge and learned topics.结合背景知识和所学主题。
Top Cogn Sci. 2011 Jan;3(1):18-47. doi: 10.1111/j.1756-8765.2010.01097.x. Epub 2010 May 27.
9
Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.使用词和图嵌入来衡量统一医学语言系统概念之间的语义相关性。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1538-1546. doi: 10.1093/jamia/ocaa136.
10
Nonparametric Spherical Topic Modeling with Word Embeddings.基于词嵌入的非参数球面主题模型
Proc Conf Assoc Comput Linguist Meet. 2016 Aug;2016:537-542. doi: 10.18653/v1/P16-2087.