• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结构语义互连:一种基于知识的词义消歧方法。

Structural semantic interconnections: a knowledge-based approach to word sense disambiguation.

作者信息

Navigli Roberto, Velardi Paola

机构信息

Dipartimento di Informatica, Università of Roma La Sapienza, via Salaria 113, 00198 Roma, Italy.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1075-86. doi: 10.1109/TPAMI.2005.149.

DOI:10.1109/TPAMI.2005.149
PMID:16013755
Abstract

Word Sense Disambiguation (WSD) is traditionally considered an Al-hard problem. A break-through in this field would have a significant impact on many relevant Web-based applications, such as Web information retrieval, improved access to Web services, information extraction, etc. Early approaches to WSD, based on knowledge representation techniques, have been replaced in the past few years by more robust machine learning and statistical techniques. The results of recent comparative evaluations of WSD systems, however, show that these methods have inherent limitations. On the other hand, the increasing availability of large-scale, rich lexical knowledge resources seems to provide new challenges to knowledge-based approaches. In this paper, we present a method, called structural semantic interconnections (SSI), which creates structural specifications of the possible senses for each word in a context and selects the best hypothesis according to a grammar G, describing relations between sense specifications. Sense specifications are created from several available lexical resources that we integrated in part manually, in part with the help of automatic procedures. The SSI algorithm has been applied to different semantic disambiguation problems, like automatic ontology population, disambiguation of sentences in generic texts, disambiguation of words in glossary definitions. Evaluation experiments have been performed on specific knowledge domains (e.g., tourism, computer networks, enterprise interoperability), as well as on standard disambiguation test sets.

摘要

词义消歧(WSD)传统上被认为是一个人工智能难题。该领域的一项突破将对许多相关的基于网络的应用产生重大影响,如网络信息检索、改进对网络服务的访问、信息提取等。早期基于知识表示技术的WSD方法在过去几年中已被更强大的机器学习和统计技术所取代。然而,最近WSD系统的比较评估结果表明,这些方法存在固有局限性。另一方面,大规模、丰富的词汇知识资源的日益可得似乎给基于知识的方法带来了新的挑战。在本文中,我们提出了一种称为结构语义互连(SSI)的方法,该方法为上下文中的每个单词创建可能词义的结构规范,并根据描述词义规范之间关系的语法G选择最佳假设。词义规范是从几个可用的词汇资源中创建的,我们部分通过手动方式、部分借助自动程序对这些资源进行了整合。SSI算法已应用于不同的语义消歧问题,如自动本体填充、通用文本中句子的消歧、词汇表定义中单词的消歧。已针对特定知识领域(如旅游、计算机网络、企业互操作性)以及标准消歧测试集进行了评估实验。

相似文献

1
Structural semantic interconnections: a knowledge-based approach to word sense disambiguation.结构语义互连:一种基于知识的词义消歧方法。
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1075-86. doi: 10.1109/TPAMI.2005.149.
2
An experimental study of graph connectivity for unsupervised word sense disambiguation.无监督词义消歧的图连接性实验研究。
IEEE Trans Pattern Anal Mach Intell. 2010 Apr;32(4):678-92. doi: 10.1109/TPAMI.2009.36.
3
Probabilistic finite-state machines--part I.概率有限状态机——第一部分。
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1013-25. doi: 10.1109/TPAMI.2005.147.
4
Parsing with probabilistic strictly locally testable tree languages.使用概率严格局部可测试树语言进行解析。
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1040-50. doi: 10.1109/TPAMI.2005.144.
5
Probabilistic finite-state machines--part II.概率有限状态机——第二部分。
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1026-39. doi: 10.1109/TPAMI.2005.148.
6
Grammatical inference in bioinformatics.生物信息学中的语法推断
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1051-62. doi: 10.1109/TPAMI.2005.140.
7
Learning deterministic finite automata with a smart state labeling evolutionary algorithm.使用智能状态标记进化算法学习确定性有限自动机。
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1063-74. doi: 10.1109/TPAMI.2005.143.
8
A scale space approach for automatically segmenting words from historical handwritten documents.一种用于从历史手写文档中自动分割单词的尺度空间方法。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1212-25. doi: 10.1109/TPAMI.2005.150.
9
Effects of information and machine learning algorithms on word sense disambiguation with small datasets.信息和机器学习算法对小数据集词义消歧的影响。
Int J Med Inform. 2005 Aug;74(7-8):573-85. doi: 10.1016/j.ijmedinf.2005.03.013.
10
The learning vector quantization algorithm applied to automatic text classification tasks.学习向量量化算法应用于自动文本分类任务。
Neural Netw. 2007 Aug;20(6):748-56. doi: 10.1016/j.neunet.2006.12.005. Epub 2007 Feb 9.

引用本文的文献

1
Automated extraction of standardized antibiotic resistance and prescription data from laboratory information systems and electronic health records: a narrative review.从实验室信息系统和电子健康记录中自动提取标准化抗生素耐药性和处方数据:一篇叙述性综述。
Front Antibiot. 2024 Mar 8;3:1380380. doi: 10.3389/frabi.2024.1380380. eCollection 2024.
2
Evolutionary Algorithm based Ensemble Extractive Summarization for Developing Smart Medical System.基于进化算法的集成抽取式摘要在开发智能医疗系统中的应用。
Interdiscip Sci. 2021 Jun;13(2):229-259. doi: 10.1007/s12539-020-00412-5. Epub 2021 Feb 12.
3
Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.
医学概念规范化中的歧义:电子健康记录数据集的类型和覆盖范围分析。
J Am Med Inform Assoc. 2021 Mar 1;28(3):516-532. doi: 10.1093/jamia/ocaa269.
4
Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes.用中文常用同义词丰富国际临床术语表,并在医生记录中进行概念识别。
BMC Med Inform Decis Mak. 2017 May 2;17(1):54. doi: 10.1186/s12911-017-0455-z.
5
A novel approach to word sense disambiguation based on topical and semantic association.一种基于主题和语义关联的词义消歧新方法。
ScientificWorldJournal. 2013 Oct 31;2013:586327. doi: 10.1155/2013/586327. eCollection 2013.
6
Natural Language Processing methods and systems for biomedical ontology learning.自然语言处理方法和系统在生物医学本体学习中的应用。
J Biomed Inform. 2011 Feb;44(1):163-79. doi: 10.1016/j.jbi.2010.07.006. Epub 2010 Jul 18.