文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

BIOSMILE: a semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features.

作者信息

Tsai Richard Tzong-Han, Chou Wen-Chi, Su Ying-Shan, Lin Yu-Chun, Sung Cheng-Lung, Dai Hong-Jie, Yeh Irene Tzu-Hsuan, Ku Wei, Sung Ting-Yi, Hsu Wen-Lian

机构信息

Institute of Information Science, Academia Sinica, Nankang, Taipei 115, Taiwan, PRoC.

出版信息

BMC Bioinformatics. 2007 Sep 1;8:325. doi: 10.1186/1471-2105-8-325.


DOI:10.1186/1471-2105-8-325
PMID:17764570
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2072962/
Abstract

BACKGROUND: Bioinformatics tools for automatic processing of biomedical literature are invaluable for both the design and interpretation of large-scale experiments. Many information extraction (IE) systems that incorporate natural language processing (NLP) techniques have thus been developed for use in the biomedical field. A key IE task in this field is the extraction of biomedical relations, such as protein-protein and gene-disease interactions. However, most biomedical relation extraction systems usually ignore adverbial and prepositional phrases and words identifying location, manner, timing, and condition, which are essential for describing biomedical relations. Semantic role labeling (SRL) is a natural language processing technique that identifies the semantic roles of these words or phrases in sentences and expresses them as predicate-argument structures. We construct a biomedical SRL system called BIOSMILE that uses a maximum entropy (ME) machine-learning model to extract biomedical relations. BIOSMILE is trained on BioProp, our semi-automatic, annotated biomedical proposition bank. Currently, we are focusing on 30 biomedical verbs that are frequently used or considered important for describing molecular events. RESULTS: To evaluate the performance of BIOSMILE, we conducted two experiments to (1) compare the performance of SRL systems trained on newswire and biomedical corpora; and (2) examine the effects of using biomedical-specific features. The experimental results show that using BioProp improves the F-score of the SRL system by 21.45% over an SRL system that uses a newswire corpus. It is noteworthy that adding automatically generated template features improves the overall F-score by a further 0.52%. Specifically, ArgM-LOC, ArgM-MNR, and Arg2 achieve statistically significant performance improvements of 3.33%, 2.27%, and 1.44%, respectively. CONCLUSION: We demonstrate the necessity of using a biomedical proposition bank for training SRL systems in the biomedical domain. Besides the different characteristics of biomedical and newswire sentences, factors such as cross-domain framesets and verb usage variations also influence the performance of SRL systems. For argument classification, we find that NE (named entity) features indicating if the target node matches with NEs are not effective, since NEs may match with a node of the parsing tree that does not have semantic role labels in the training set. We therefore incorporate templates composed of specific words, NE types, and POS tags into the SRL system. As a result, the classification accuracy for adjunct arguments, which is especially important for biomedical SRL, is improved significantly.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/f9ba65e0d372/1471-2105-8-325-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/ae693e6728f8/1471-2105-8-325-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/2f835cf7b0a6/1471-2105-8-325-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/9e14c3d6d2d2/1471-2105-8-325-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/a4b3a714b374/1471-2105-8-325-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/b84d31e7c5ba/1471-2105-8-325-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/f9ba65e0d372/1471-2105-8-325-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/ae693e6728f8/1471-2105-8-325-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/2f835cf7b0a6/1471-2105-8-325-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/9e14c3d6d2d2/1471-2105-8-325-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/a4b3a714b374/1471-2105-8-325-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/b84d31e7c5ba/1471-2105-8-325-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f9f/2072962/f9ba65e0d372/1471-2105-8-325-2.jpg

相似文献

[1]
BIOSMILE: a semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features.

BMC Bioinformatics. 2007-9-1

[2]
Semi-automatic conversion of BioProp semantic annotation to PASBio annotation.

BMC Bioinformatics. 2008-12-12

[3]
Semantic role labeling for protein transport predicates.

BMC Bioinformatics. 2008-6-11

[4]
A resource-saving collective approach to biomedical semantic role labeling.

BMC Bioinformatics. 2014-5-27

[5]
Domain adaptation for semantic role labeling in the biomedical domain.

Bioinformatics. 2010-2-23

[6]
Domain adaptation for semantic role labeling of clinical text.

J Am Med Inform Assoc. 2015-9

[7]
A critical review of PASBio's argument structures for biomedical verbs.

BMC Bioinformatics. 2006-11-24

[8]
Automatic identification and classification of noun argument structures in biomedical literature.

IEEE/ACM Trans Comput Biol Bioinform. 2012

[9]
A hybrid method for relation extraction from biomedical literature.

Int J Med Inform. 2006-6

[10]
Construction of an annotated corpus to support biomedical information extraction.

BMC Bioinformatics. 2009-10-23

引用本文的文献

[1]
A context-based ABC model for literature-based discovery.

PLoS One. 2019-4-24

[2]
Evaluating Casama: Contextualized semantic maps for summarization of lung cancer studies.

Comput Biol Med. 2017-11-3

[3]
Toward patient-tailored summarization of lung cancer literature.

IEEE EMBS Int Conf Biomed Health Inform. 2016-2

[4]
Semantic Role Labeling of Clinical Text: Comparing Syntactic Parsers and Features.

AMIA Annu Symp Proc. 2017-2-10

[5]
BelSmile: a biomedical semantic role labeling approach for extracting biological expression language from text.

Database (Oxford). 2016-5-12

[6]
Domain adaptation for semantic role labeling of clinical text.

J Am Med Inform Assoc. 2015-9

[7]
BioC interoperability track overview.

Database (Oxford). 2014-6-30

[8]
A resource-saving collective approach to biomedical semantic role labeling.

BMC Bioinformatics. 2014-5-27

[9]
The BioLexicon: a large-scale terminological resource for biomedical text mining.

BMC Bioinformatics. 2011-10-12

[10]
eFIP: a tool for mining functional impact of phosphorylation from literature.

Methods Mol Biol. 2011

本文引用的文献

[1]
Towards semantic role labeling & IE in the medical literature.

AMIA Annu Symp Proc. 2005

[2]
An online literature mining tool for protein phosphorylation.

Bioinformatics. 2006-7-1

[3]
LSAT: learning about alternative transcripts in MEDLINE.

Bioinformatics. 2006-4-1

[4]
Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome.

Genome Biol. 2005

[5]
A probabilistic functional network of yeast genes.

Science. 2004-11-26

[6]
PASBio: predicate-argument structures for event extraction in molecular biology.

BMC Bioinformatics. 2004-10-19

[7]
Content-rich biological network constructed by mining PubMed abstracts.

BMC Bioinformatics. 2004-10-8

[8]
Extending the mutual information measure to rank inferred literature relationships.

BMC Bioinformatics. 2004-10-7

[9]
GENIA corpus--semantically annotated corpus for bio-textmining.

Bioinformatics. 2003

[10]
PreBIND and Textomy--mining the biomedical literature for protein-protein interactions using a support vector machine.

BMC Bioinformatics. 2003-3-27

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索