Suppr超能文献

一种基于CNN-XGBoost模型的阿尔茨海默病新型蛋白质亚细胞定位方法。

A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease.

作者信息

Pang Long, Wang Junjie, Zhao Lingling, Wang Chunyu, Zhan Hui

机构信息

Harbin Nebula Bioinformatics Technology Development Co., Ltd., Harbin, China.

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.

出版信息

Front Genet. 2019 Jan 18;9:751. doi: 10.3389/fgene.2018.00751. eCollection 2018.

Abstract

The disorder distribution of protein in the compartment or organelle leads to many human diseases, including neurodegenerative diseases such as Alzheimer's disease. The prediction of protein subcellular localization play important roles in the understanding of the mechanism of protein function, pathogenes and disease therapy. This paper proposes a novel subcellular localization method by integrating the Convolutional Neural Network (CNN) and eXtreme Gradient Boosting (XGBoost), where CNN acts as a feature extractor to automatically obtain features from the original sequence information and a XGBoost classifier as a recognizer to identify the protein subcellular localization based on the output of the CNN. Experiments are implemented on three protein datasets. The results prove that the CNN-XGBoost method performs better than the general protein subcellular localization methods.

摘要

蛋白质在区室或细胞器中的无序分布会导致许多人类疾病,包括阿尔茨海默病等神经退行性疾病。蛋白质亚细胞定位的预测在理解蛋白质功能机制、发病机制和疾病治疗方面发挥着重要作用。本文提出了一种将卷积神经网络(CNN)和极端梯度提升(XGBoost)相结合的新型亚细胞定位方法,其中CNN作为特征提取器,从原始序列信息中自动获取特征,而XGBoost分类器作为识别器,根据CNN的输出识别蛋白质亚细胞定位。在三个蛋白质数据集上进行了实验。结果证明,CNN-XGBoost方法比一般的蛋白质亚细胞定位方法表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cef/6345701/48db8ef76e12/fgene-09-00751-g0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验