Suppr超能文献

用于生物关系提取的多通道卷积神经网络。

Multichannel Convolutional Neural Network for Biological Relation Extraction.

作者信息

Quan Chanqin, Hua Lei, Sun Xiao, Bai Wenjun

机构信息

Graduate School of System Informatics, Kobe University, Kobe, Japan.

Department of Computer and Information Science, Hefei University of Technology, Hefei, China.

出版信息

Biomed Res Int. 2016;2016:1850404. doi: 10.1155/2016/1850404. Epub 2016 Dec 7.

Abstract

The plethora of biomedical relations which are embedded in medical logs (records) demands researchers' attention. Previous theoretical and practical focuses were restricted on traditional machine learning techniques. However, these methods are susceptible to the issues of and data sparseness and the unattainable automation process in feature extraction. To address aforementioned issues, in this work, we propose a multichannel convolutional neural network (MCCNN) for automated biomedical relation extraction. The proposed model has the following two contributions: (1) it enables the fusion of multiple (e.g., five) versions in word embeddings; (2) the need for manual feature engineering can be obviated by automated feature learning with convolutional neural network (CNN). We evaluated our model on two biomedical relation extraction tasks: drug-drug interaction (DDI) extraction and protein-protein interaction (PPI) extraction. For DDI task, our system achieved an overall -score of 70.2% compared to the standard linear SVM based system (e.g., 67.0%) on DDIExtraction 2013 challenge dataset. And for PPI task, we evaluated our system on Aimed and BioInfer PPI corpus; our system exceeded the state-of-art ensemble SVM system by 2.7% and 5.6% on -scores.

摘要

医学日志(记录)中大量的生物医学关系需要研究人员关注。以往的理论和实践重点局限于传统机器学习技术。然而,这些方法容易受到数据稀疏性以及特征提取中难以实现自动化过程等问题的影响。为了解决上述问题,在这项工作中,我们提出了一种用于自动生物医学关系提取的多通道卷积神经网络(MCCNN)。所提出的模型有以下两个贡献:(1)它能够融合词嵌入中的多个(例如五个)版本;(2)通过卷积神经网络(CNN)进行自动特征学习可以避免手动特征工程的需求。我们在两个生物医学关系提取任务上评估了我们的模型:药物 - 药物相互作用(DDI)提取和蛋白质 - 蛋白质相互作用(PPI)提取。对于DDI任务,在2013年DDIExtraction挑战数据集上,与基于标准线性支持向量机的系统(例如67.0%)相比,我们的系统总体F1分数达到了70.2%。对于PPI任务,我们在Aimed和BioInfer PPI语料库上评估了我们的系统;我们的系统在F1分数上比当前最先进的集成支持向量机系统分别高出2.7%和5.6%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/207e/5174749/6d62eb8b97ee/BMRI2016-1850404.001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验