• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于依存树模式从医学文献中提取信息性因果关系

Informative Causality Extraction from Medical Literature via Dependency-Tree-Based Patterns.

作者信息

Kabir M Ahsanul, Almulhim AlJohara, Luo Xiao, Al Hasan Mohammad

机构信息

Department of Computer Science, Indiana University Purdue University Indianapolis, Indianapolis, IN USA.

Department of Computer Information and Graphics Technology, Indiana University Purdue University, Indianapolis, IN USA.

出版信息

J Healthc Inform Res. 2022 May 25;6(3):295-316. doi: 10.1007/s41666-022-00116-z. eCollection 2022 Sep.

DOI:10.1007/s41666-022-00116-z
PMID:35637864
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9131716/
Abstract

Extracting cause-effect entities from medical literature is an important task in medical information retrieval. A solution for solving this task can be used for compilation of various causality relations, such as causality between disease and symptoms, between medications and side effects, and between genes and diseases. Existing solutions for extracting cause-effect entities work well for sentences where the cause and the effect phrases are name entities, single-word nouns, or noun phrases consisting of two to three words. Unfortunately, in medical literature, cause and effect phrases in a sentence are not simply nouns or noun phrases, rather they are complex phrases consisting of several words, and existing methods fail to correctly extract the cause and effect entities in such sentences. Partial extraction of cause and effect entities conveys poor quality, non-informative, and often, contradictory facts, comparing to the one intended in the given sentence. In this work, we solve this problem by designing an unsupervised method for cause and effect phrase extraction, PatternCausality, which is specifically suitable for the medical literature. Our proposed approach first uses a collection of cause-effect dependency patterns as template to extract head words of cause and effect phrases and then it uses a novel phrase extraction method to obtain complete and meaningful cause and effect phrases from a sentence. Experiments on a cause-effect dataset built from sentences from PubMed articles show that for extracting cause and effect entities, PatternCausality is substantially better than the existing methods-with an order of magnitude improvement in the -score metric over the best of the existing methods. We also build different variants of PatternCausality, which use different phrase extraction methods; all variants are better than the existing methods. PatternCausality and its variants also show modest performance improvement over the existing methods for extracting cause and effect entities in a domain-neutral benchmark dataset, in which cause and effect entities are nouns or noun phrases consisting of one to two words.

摘要

从医学文献中提取因果实体是医学信息检索中的一项重要任务。解决此任务的一种方法可用于编译各种因果关系,例如疾病与症状之间、药物与副作用之间以及基因与疾病之间的因果关系。现有的提取因果实体的方法对于因果短语为命名实体、单字名词或由两到三个单词组成的名词短语的句子效果良好。不幸的是,在医学文献中,句子中的因果短语并非简单的名词或名词短语,而是由几个单词组成的复杂短语,现有方法无法正确提取此类句子中的因果实体。与给定句子中预期的完整提取相比,因果实体的部分提取传达的质量较差、信息不足且往往相互矛盾。在这项工作中,我们通过设计一种无监督的因果短语提取方法PatternCausality来解决此问题,该方法特别适用于医学文献。我们提出的方法首先使用一组因果依赖模式作为模板来提取因果短语的中心词,然后使用一种新颖的短语提取方法从句子中获取完整且有意义的因果短语。对从PubMed文章句子构建的因果数据集进行的实验表明,对于提取因果实体,PatternCausality比现有方法要好得多——在F1分数指标上比现有最佳方法有一个数量级的提升。我们还构建了PatternCausality的不同变体,它们使用不同的短语提取方法;所有变体都比现有方法更好。在一个领域中立的基准数据集中,因果实体是由一到两个单词组成的名词或名词短语,PatternCausality及其变体在提取因果实体方面也比现有方法有适度的性能提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/cf423cb18a4f/41666_2022_116_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/12aa38eab4bd/41666_2022_116_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/ba6aad201e21/41666_2022_116_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/8e344455483d/41666_2022_116_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/cf423cb18a4f/41666_2022_116_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/12aa38eab4bd/41666_2022_116_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/ba6aad201e21/41666_2022_116_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/8e344455483d/41666_2022_116_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62e7/9309116/cf423cb18a4f/41666_2022_116_Fig4_HTML.jpg

相似文献

1
Informative Causality Extraction from Medical Literature via Dependency-Tree-Based Patterns.基于依存树模式从医学文献中提取信息性因果关系
J Healthc Inform Res. 2022 May 25;6(3):295-316. doi: 10.1007/s41666-022-00116-z. eCollection 2022 Sep.
2
Extracting noun phrases for all of MEDLINE.提取整个医学文献数据库(MEDLINE)中的名词短语。
Proc AMIA Symp. 1999:671-5.
3
Parallelism Between Sentence Structure and Nominal Phrases in Japanese: Evidence from Scrambled Instrumental and Locative Adverbial Phrases.日语句子结构与名词短语的平行关系:来自乱序工具格和处所状语短语的证据。
J Psycholinguist Res. 2022 Jun;51(3):501-519. doi: 10.1007/s10936-022-09843-1. Epub 2022 Apr 6.
4
Concept abstractness and the representation of noun-noun combinations.概念抽象性与名词-名词组合的表征
J Psycholinguist Res. 2013 Oct;42(5):413-31. doi: 10.1007/s10936-012-9226-2.
5
Effects of phrase and word frequencies in noun phrase production.名词短语生成中短语和单词频率的影响。
J Exp Psychol Learn Mem Cogn. 2019 Jan;45(1):147-165. doi: 10.1037/xlm0000570. Epub 2018 Apr 26.
6
High gamma response tracks different syntactic structures in homophonous phrases.高伽马响应可追踪同音异义词短语中的不同句法结构。
Sci Rep. 2020 May 5;10(1):7537. doi: 10.1038/s41598-020-64375-9.
7
PubMed Phrases, an open set of coherent phrases for searching biomedical literature.PubMed 词组,一组用于搜索生物医学文献的开放式连贯词组。
Sci Data. 2018 Jun 12;5:180104. doi: 10.1038/sdata.2018.104.
8
Automatic extraction of semantic relations between medical entities: a rule based approach.医学实体之间语义关系的自动提取:一种基于规则的方法。
J Biomed Semantics. 2011 Oct 6;2 Suppl 5(Suppl 5):S4. doi: 10.1186/2041-1480-2-S5-S4.
9
Exploiting graph kernels for high performance biomedical relation extraction.利用图核进行高性能生物医学关系提取。
J Biomed Semantics. 2018 Jan 30;9(1):7. doi: 10.1186/s13326-017-0168-3.
10
The correspondence between sentence production and corpus frequencies in modifier attachment.修饰语附着中句子生成与语料库频率之间的对应关系。
Q J Exp Psychol A. 2002 Jul;55(3):879-96. doi: 10.1080/02724980143000604.

引用本文的文献

1
Causal relationships between diseases mined from the literature improve the use of polygenic risk scores.从文献中挖掘出的疾病因果关系可提高多基因风险评分的使用。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae639.