• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用词嵌入识别关系短语之间的同义关系。

Identifying synonymy between relational phrases using word embeddings.

作者信息

Nguyen Nhung T H, Miwa Makoto, Tsuruoka Yoshimasa, Tojo Satoshi

机构信息

University of Science, Vietnam National University, Ho Chi Minh City, 227 Nguyen Van Cu St., Ward 4, Dist. 5, Ho Chi Minh City, Viet Nam; Japan Advanced Institute of Science and Technology, 1-8 Asahidai, Nomi-shi, Ishikawa 923-1292, Japan.

Toyota Technological Institute, 2-12-1 Hisakata, Tempaku-ku, Nagoya 468-8511, Japan.

出版信息

J Biomed Inform. 2015 Aug;56:94-102. doi: 10.1016/j.jbi.2015.05.010. Epub 2015 May 22.

DOI:10.1016/j.jbi.2015.05.010
PMID:26004792
Abstract

Many text mining applications in the biomedical domain benefit from automatic clustering of relational phrases into synonymous groups, since it alleviates the problem of spurious mismatches caused by the diversity of natural language expressions. Most of the previous work that has addressed this task of synonymy resolution uses similarity metrics between relational phrases based on textual strings or dependency paths, which, for the most part, ignore the context around the relations. To overcome this shortcoming, we employ a word embedding technique to encode relational phrases. We then apply the k-means algorithm on top of the distributional representations to cluster the phrases. Our experimental results show that this approach outperforms state-of-the-art statistical models including latent Dirichlet allocation and Markov logic networks.

摘要

生物医学领域的许多文本挖掘应用都受益于将关系短语自动聚类为同义组,因为这缓解了由自然语言表达的多样性所导致的虚假不匹配问题。之前处理同义性解析这项任务的大多数工作都使用基于文本字符串或依存路径的关系短语之间的相似性度量,而这些度量在很大程度上忽略了关系周围的上下文。为了克服这一缺点,我们采用词嵌入技术对关系短语进行编码。然后,我们在分布式表示之上应用k均值算法对短语进行聚类。我们的实验结果表明,这种方法优于包括潜在狄利克雷分配和马尔可夫逻辑网络在内的现有统计模型。

相似文献

1
Identifying synonymy between relational phrases using word embeddings.使用词嵌入识别关系短语之间的同义关系。
J Biomed Inform. 2015 Aug;56:94-102. doi: 10.1016/j.jbi.2015.05.010. Epub 2015 May 22.
2
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
3
Knowledge based word-concept model estimation and refinement for biomedical text mining.用于生物医学文本挖掘的基于知识的词概念模型估计与优化。
J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12.
4
deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.深度生物词汇语义消歧:生物医学文本数据的有效深度神经网络词汇语义消歧。
J Am Med Inform Assoc. 2019 May 1;26(5):438-446. doi: 10.1093/jamia/ocy189.
5
Expanding a radiology lexicon using contextual patterns in radiology reports.利用放射科报告中的上下文模式扩展放射学词汇。
J Am Med Inform Assoc. 2018 Jun 1;25(6):679-685. doi: 10.1093/jamia/ocx152.
6
Constructing a Graph Database for Semantic Literature-Based Discovery.构建用于基于语义文献发现的图形数据库。
Stud Health Technol Inform. 2015;216:1094.
7
Deep contextualized embeddings for quantifying the informative content in biomedical text summarization.用于量化生物医学文本摘要是信息内容的深度语境化嵌入。
Comput Methods Programs Biomed. 2020 Feb;184:105117. doi: 10.1016/j.cmpb.2019.105117. Epub 2019 Oct 4.
8
Filtering large-scale event collections using a combination of supervised and unsupervised learning for event trigger classification.结合监督学习和无监督学习对事件触发分类进行大规模事件集合过滤。
J Biomed Semantics. 2016 May 11;7:27. doi: 10.1186/s13326-016-0070-4. eCollection 2016.
9
Speculation detection for Chinese clinical notes: Impacts of word segmentation and embedding models.中文临床笔记中的推测检测:分词和嵌入模型的影响
J Biomed Inform. 2016 Apr;60:334-41. doi: 10.1016/j.jbi.2016.02.011. Epub 2016 Feb 26.
10
A novel framework for biomedical entity sense induction.一种用于生物医学实体感知归纳的新框架。
J Biomed Inform. 2018 Aug;84:31-41. doi: 10.1016/j.jbi.2018.06.007. Epub 2018 Jun 20.