• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于联合评分机制的生物医学事件抽取不平衡数据的新型样本选择策略。

A Novel Sample Selection Strategy for Imbalanced Data of Biomedical Event Extraction with Joint Scoring Mechanism.

作者信息

Lu Yang, Ma Xiaolei, Lu Yinan, Zhou Yuxin, Pei Zhili

机构信息

College of Computer Science and Technology, Jilin University, Changchun, Jilin 130000, China; Library, Inner Mongolia University for Nationalities, Tongliao, Inner Mongolia 028000, China.

College of Computer Science and Technology, Jilin University, Changchun, Jilin 130000, China.

出版信息

Comput Math Methods Med. 2016;2016:7536494. doi: 10.1155/2016/7536494. Epub 2016 Dec 14.

DOI:10.1155/2016/7536494
PMID:28096894
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5206857/
Abstract

Biomedical event extraction is an important and difficult task in bioinformatics. With the rapid growth of biomedical literature, the extraction of complex events from unstructured text has attracted more attention. However, the annotated biomedical corpus is highly imbalanced, which affects the performance of the classification algorithms. In this study, a sample selection algorithm based on sequential pattern is proposed to filter negative samples in the training phase. Considering the joint information between the trigger and argument of multiargument events, we extract triplets of multiargument events directly using a support vector machine classifier. A joint scoring mechanism, which is based on sentence similarity and importance of trigger in the training data, is used to correct the predicted results. Experimental results indicate that the proposed method can extract events efficiently.

摘要

生物医学事件提取是生物信息学中一项重要且困难的任务。随着生物医学文献的快速增长,从非结构化文本中提取复杂事件受到了更多关注。然而,带注释的生物医学语料库高度不均衡,这影响了分类算法的性能。在本研究中,提出了一种基于序列模式的样本选择算法,用于在训练阶段过滤负样本。考虑到多论点事件的触发词和论据之间的联合信息,我们直接使用支持向量机分类器提取多论点事件的三元组。一种基于句子相似度和训练数据中触发词重要性的联合评分机制用于校正预测结果。实验结果表明,所提出的方法能够高效地提取事件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/c34fbc0f4c55/CMMM2016-7536494.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/1bdff6cb9fda/CMMM2016-7536494.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/718cdac52c2e/CMMM2016-7536494.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/3de5923efc4a/CMMM2016-7536494.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/9ed07ab1d0ff/CMMM2016-7536494.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/c34fbc0f4c55/CMMM2016-7536494.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/1bdff6cb9fda/CMMM2016-7536494.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/718cdac52c2e/CMMM2016-7536494.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/3de5923efc4a/CMMM2016-7536494.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/9ed07ab1d0ff/CMMM2016-7536494.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8b3/5206857/c34fbc0f4c55/CMMM2016-7536494.alg.001.jpg

相似文献

1
A Novel Sample Selection Strategy for Imbalanced Data of Biomedical Event Extraction with Joint Scoring Mechanism.一种基于联合评分机制的生物医学事件抽取不平衡数据的新型样本选择策略。
Comput Math Methods Med. 2016;2016:7536494. doi: 10.1155/2016/7536494. Epub 2016 Dec 14.
2
Extracting biomedical events from pairs of text entities.从文本实体对中提取生物医学事件。
BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-16-S10-S8. Epub 2015 Jul 13.
3
Automatic identification and classification of noun argument structures in biomedical literature.生物医学文献中名词论元结构的自动识别与分类。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1639-48. doi: 10.1109/TCBB.2012.111.
4
A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set.一种使用混合特征集的生物医学文本关系提取框架。
Comput Math Methods Med. 2015;2015:910423. doi: 10.1155/2015/910423. Epub 2015 Aug 10.
5
Classification of imbalanced bioinformatics data by using boundary movement-based ELM.基于边界移动的极限学习机对不平衡生物信息学数据的分类
Biomed Mater Eng. 2015;26 Suppl 1:S1855-62. doi: 10.3233/BME-151488.
6
Named entity recognition and classification in biomedical text using classifier ensemble.使用分类器集成进行生物医学文本中的命名实体识别与分类
Int J Data Min Bioinform. 2015;11(4):365-91. doi: 10.1504/ijdmb.2015.067954.
7
Self-training in significance space of support vectors for imbalanced biomedical event data.针对不平衡生物医学事件数据在支持向量的显著性空间中进行自训练。
BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-16-S7-S6. Epub 2015 Apr 23.
8
Biomedical events extraction using the hidden vector state model.基于隐向量状态模型的生物医学事件抽取。
Artif Intell Med. 2011 Nov;53(3):205-13. doi: 10.1016/j.artmed.2011.08.002. Epub 2011 Sep 25.
9
Drug-Drug Interaction Extraction via Convolutional Neural Networks.通过卷积神经网络进行药物-药物相互作用提取
Comput Math Methods Med. 2016;2016:6918381. doi: 10.1155/2016/6918381. Epub 2016 Jan 31.
10
Active learning for ontological event extraction incorporating named entity recognition and unknown word handling.结合命名实体识别和未知词处理的本体事件抽取的主动学习
J Biomed Semantics. 2016 Apr 27;7:22. doi: 10.1186/s13326-016-0059-z. eCollection 2016.

引用本文的文献

1
Involvement of Machine Learning Tools in Healthcare Decision Making.机器学习工具在医疗保健决策中的应用。
J Healthc Eng. 2021 Jan 27;2021:6679512. doi: 10.1155/2021/6679512. eCollection 2021.

本文引用的文献

1
Extracting Biomedical Event with Dual Decomposition Integrating Word Embeddings.结合词嵌入通过对偶分解提取生物医学事件
IEEE/ACM Trans Comput Biol Bioinform. 2016 Jul-Aug;13(4):669-77. doi: 10.1109/TCBB.2015.2476876. Epub 2015 Sep 7.
2
Self-training in significance space of support vectors for imbalanced biomedical event data.针对不平衡生物医学事件数据在支持向量的显著性空间中进行自训练。
BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-16-S7-S6. Epub 2015 Apr 23.
3
Event trigger identification for biomedical events extraction using domain knowledge.
利用领域知识进行生物医学事件抽取的事件触发词识别。
Bioinformatics. 2014 Jun 1;30(11):1587-94. doi: 10.1093/bioinformatics/btu061. Epub 2014 Jan 30.
4
TrigNER: automatically optimized biomedical event trigger recognition on scientific documents.TrigNER:科学文档上自动优化的生物医学事件触发识别
Source Code Biol Med. 2014 Jan 8;9(1):1. doi: 10.1186/1751-0473-9-1.
5
An overview of ABC and SLC drug transporter gene regulation.ABC 和 SLC 药物转运体基因调控概述。
Curr Drug Metab. 2013 Feb;14(2):253-64.
6
Event extraction across multiple levels of biological organization.跨多个生物学组织层次的事件提取。
Bioinformatics. 2012 Sep 15;28(18):i575-i581. doi: 10.1093/bioinformatics/bts407.
7
University of Turku in the BioNLP'11 Shared Task.图尔库大学在 BioNLP'11 共享任务中的贡献。
BMC Bioinformatics. 2012 Jun 26;13 Suppl 11(Suppl 11):S4. doi: 10.1186/1471-2105-13-S11-S4.
8
Disease gene identification strategies for exome sequencing.外显子组测序的疾病基因鉴定策略。
Eur J Hum Genet. 2012 May;20(5):490-7. doi: 10.1038/ejhg.2011.258. Epub 2012 Jan 18.
9
Towards the prediction of protein interaction partners using physical docking.利用物理对接预测蛋白质相互作用伙伴。
Mol Syst Biol. 2011 Feb 15;7:469. doi: 10.1038/msb.2011.3.
10
Event extraction with complex event classification using rich features.利用丰富特征进行复杂事件分类的事件抽取。
J Bioinform Comput Biol. 2010 Feb;8(1):131-46. doi: 10.1142/s0219720010004586.