Suppr超能文献

MfeCNN:用于数据映射的混合特征嵌入卷积神经网络。

MfeCNN: Mixture Feature Embedding Convolutional Neural Network for Data Mapping.

出版信息

IEEE Trans Nanobioscience. 2018 Jul;17(3):165-171. doi: 10.1109/TNB.2018.2841053. Epub 2018 May 28.

Abstract

Data mapping plays an important role in data integration and exchanges among institutions and organizations with different data standards. However, traditional rule-based approaches and machine learning methods fail to achieve satisfactory results for the data mapping problem. In this paper, we propose a novel and sophisticated deep learning framework for data mapping called mixture feature embedding convolutional neural network (MfeCNN). The MfeCNN model converts the data mapping task to a multiple classification problem. In the model, we incorporated multimodal learning and multiview embedding into a CNN for mixture feature tensor generation and classification prediction. Multimodal features were extracted from various linguistic spaces with a medical natural language processing package. Then, powerful feature embeddings were learned by using the CNN. As many as 10 classes could be simultaneously classified by a softmax prediction layer based on multiview embedding. MfeCNN achieved the best results on unbalanced data (average F1 score, 82.4%) among the traditional state-of-the-art machine learning models and CNN without mixture feature embedding. Our model also outperformed a very deep CNN with 29 layers, which took free texts as inputs. The combination of mixture feature embedding and a deep neural network can achieve high accuracy for data mapping and multiple classification.

摘要

数据映射在具有不同数据标准的机构和组织之间的数据集成和交换中起着重要作用。然而,传统的基于规则的方法和机器学习方法无法为数据映射问题提供令人满意的结果。在本文中,我们提出了一种新颖而复杂的深度学习框架,称为混合特征嵌入卷积神经网络(MfeCNN),用于数据映射。MfeCNN 模型将数据映射任务转换为多分类问题。在该模型中,我们将多模态学习和多视图嵌入到 CNN 中,用于混合特征张量生成和分类预测。多模态特征是使用医学自然语言处理包从各种语言空间中提取出来的。然后,使用 CNN 学习强大的特征嵌入。基于多视图嵌入的 softmax 预测层可以同时对多达 10 个类进行分类。在传统的最先进的机器学习模型和没有混合特征嵌入的 CNN 中,MfeCNN 在不平衡数据(平均 F1 得分 82.4%)上取得了最佳结果。我们的模型也优于一个具有 29 层的非常深的 CNN,该模型以自由文本作为输入。混合特征嵌入和深度神经网络的结合可以实现数据映射和多分类的高精度。

相似文献

1
MfeCNN: Mixture Feature Embedding Convolutional Neural Network for Data Mapping.
IEEE Trans Nanobioscience. 2018 Jul;17(3):165-171. doi: 10.1109/TNB.2018.2841053. Epub 2018 May 28.
2
Intelligent diagnosis with Chinese electronic medical records based on convolutional neural networks.
BMC Bioinformatics. 2019 Feb 1;20(1):62. doi: 10.1186/s12859-019-2617-8.
3
Clinical text classification with rule-based features and knowledge-guided convolutional neural networks.
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):71. doi: 10.1186/s12911-019-0781-4.
6
CNN-Siam: multimodal siamese CNN-based deep learning approach for drug‒drug interaction prediction.
BMC Bioinformatics. 2023 Mar 23;24(1):110. doi: 10.1186/s12859-023-05242-y.
10
Amino acid encoding for deep learning applications.
BMC Bioinformatics. 2020 Jun 9;21(1):235. doi: 10.1186/s12859-020-03546-x.

引用本文的文献

1
Evaluating global and local sequence alignment methods for comparing patient medical records.
BMC Med Inform Decis Mak. 2019 Dec 19;19(Suppl 6):263. doi: 10.1186/s12911-019-0965-y.
2
Deep learning in clinical natural language processing: a methodical review.
J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200.
3
Serendipity-A Machine-Learning Application for Mining Serendipitous Drug Usage From Social Media.
IEEE Trans Nanobioscience. 2019 Jul;18(3):324-334. doi: 10.1109/TNB.2019.2909094. Epub 2019 Apr 4.

本文引用的文献

1
A comparison of word embeddings for the biomedical natural language processing.
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
2
Clinical information extraction applications: A literature review.
J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.
4
Quality Assurance of UMLS Semantic Type Assignments Using SNOMED CT Hierarchies.
Methods Inf Med. 2016;55(2):158-65. doi: 10.3414/ME14-01-0104. Epub 2015 Apr 30.
5
Rule-based support system for multiple UMLS semantic type assignments.
J Biomed Inform. 2013 Feb;46(1):97-110. doi: 10.1016/j.jbi.2012.09.007. Epub 2012 Oct 3.
6
7
Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2).
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):124-30. doi: 10.1136/jamia.2009.000893.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验