• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关键词评估中直接匹配的失败之外:基于图的解决方案概述。

Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution.

作者信息

Kölbl Max, Kyogoku Yuki, Philipp J Nathanael, Richter Michael, Rietdorf Clements, Yousef Tariq

机构信息

Institute of Computer Science, NLP Group, Universität Leipzig, Leipzig, Germany.

出版信息

Front Artif Intell. 2022 Mar 24;5:801564. doi: 10.3389/frai.2022.801564. eCollection 2022.

DOI:10.3389/frai.2022.801564
PMID:35402902
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8988042/
Abstract

The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a . We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets.

摘要

本文的出发点是观察到基于关键词直接匹配的方法并不充分,因为它们没有考虑概念形成和抽象的认知能力。我们认为,关键词评估需要基于一种语言语义模型,该模型能够捕捉词的语义相关性,以满足类似人类概念形成和抽象能力的要求,并取得更好的评估结果。由于关键词评估需要语义信息,因此这一过程颇具难度。该模型必须能够识别诸如同义、上位、下位以及基于位置的抽象等语义关系。例如,从在线来源收集文本时,通常会为每篇文本找到一些关键词。然而,这些关键词集对于文本而言既不完整,其本身也不是封闭的,即,在大多数情况下,关键词只是所有可能关键词的随机子集,相对于完整的关键词集而言信息量不足。因此,基于此的所有算法都无法取得良好的评估结果,也无法为文本提供优质/更好的关键词,甚至完整的关键词集。作为一种解决方案,我们提出一种词图,它能够捕捉给定语言中的所有这些语义关系。下位词/上位词关系的问题在于,与同义词不同,它不是双向的。因此,关键词集的空间需要一种非对称的度量,换句话说,是一种……我们勾勒出一种适用于我们的图的度量。由于几乎不可能为一种语言获得如此完整的词图,我们针对关键词任务提出一种基于基础文本的更简单的图,应根据该基础文本评估关键词集。这种简化通常足以评估关键词集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f71c/8988042/51563dd839e2/frai-05-801564-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f71c/8988042/be8d53250ad9/frai-05-801564-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f71c/8988042/4d4f88d030aa/frai-05-801564-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f71c/8988042/51563dd839e2/frai-05-801564-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f71c/8988042/be8d53250ad9/frai-05-801564-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f71c/8988042/4d4f88d030aa/frai-05-801564-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f71c/8988042/51563dd839e2/frai-05-801564-g0003.jpg

相似文献

1
Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution.关键词评估中直接匹配的失败之外:基于图的解决方案概述。
Front Artif Intell. 2022 Mar 24;5:801564. doi: 10.3389/frai.2022.801564. eCollection 2022.
2
Word synonym relationships for text analysis: A graph-based approach.基于图的文本分析词同义词关系方法。
PLoS One. 2021 Jul 27;16(7):e0255127. doi: 10.1371/journal.pone.0255127. eCollection 2021.
3
Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification.基于分布式表示的专利分类专利关键词提取算法
Entropy (Basel). 2018 Feb 2;20(2):104. doi: 10.3390/e20020104.
4
Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.使用词和图嵌入来衡量统一医学语言系统概念之间的语义相关性。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1538-1546. doi: 10.1093/jamia/ocaa136.
5
Sketch Matching on Topology Product Graph.拓扑积图上的草图匹配。
IEEE Trans Pattern Anal Mach Intell. 2015 Aug;37(8):1723-9. doi: 10.1109/TPAMI.2014.2369031.
6
FNG-IE: an improved graph-based method for keyword extraction from scholarly big-data.FNG-IE:一种用于从学术大数据中提取关键词的改进的基于图的方法。
PeerJ Comput Sci. 2021 Mar 11;7:e389. doi: 10.7717/peerj-cs.389. eCollection 2021.
7
Automatic medical image annotation and keyword-based image retrieval using relevance feedback.基于相关性反馈的自动医学图像标注和基于关键词的图像检索。
J Digit Imaging. 2012 Aug;25(4):454-65. doi: 10.1007/s10278-011-9443-5.
8
Text-Graph Enhanced Knowledge Graph Representation Learning.文本-图增强的知识图谱表示学习
Front Artif Intell. 2021 Aug 17;4:697856. doi: 10.3389/frai.2021.697856. eCollection 2021.
9
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion.基于路径的知识推理与文本语义信息融合的医疗知识图谱补全方法
BMC Med Inform Decis Mak. 2021 Nov 29;21(Suppl 9):335. doi: 10.1186/s12911-021-01622-7.
10
Using a high-dimensional graph of semantic space to model relationships among words.利用语义空间的高维图来建模词与词之间的关系。
Front Psychol. 2014 May 12;5:385. doi: 10.3389/fpsyg.2014.00385. eCollection 2014.

本文引用的文献

1
How children explore the phonological network in child-directed speech: A survival analysis of children's first word productions.儿童如何在儿童导向言语中探索语音网络:对儿童首次单词产出的生存分析
J Mem Lang. 2014 Aug;75:159-180. doi: 10.1016/j.jml.2014.05.005.
2
Networks in cognitive science.认知科学中的网络。
Trends Cogn Sci. 2013 Jul;17(7):348-60. doi: 10.1016/j.tics.2013.04.010. Epub 2013 May 30.
3
What can graph theory tell us about word learning and lexical retrieval?图论能告诉我们关于词汇学习和词汇检索的哪些信息?
J Speech Lang Hear Res. 2008 Apr;51(2):408-22. doi: 10.1044/1092-4388(2008/030).
4
Restructuring of similarity neighbourhoods in the developing mental lexicon.发展中的心理词汇库中相似性邻域的重组。
J Child Lang. 2002 May;29(2):251-74. doi: 10.1017/s0305000902005032.
5
Global organization of the Wordnet lexicon.Wordnet词典的全球组织架构。
Proc Natl Acad Sci U S A. 2002 Feb 5;99(3):1742-7. doi: 10.1073/pnas.022341799.
6
Language as an evolving word web.语言是一个不断演变的词汇网络。
Proc Biol Sci. 2001 Dec 22;268(1485):2603-6. doi: 10.1098/rspb.2001.1824.