Suppr
超能文献

DeepPred-SubMito：一种基于多通道卷积神经网络和数据集平衡处理的新型亚线粒体定位预测器。

DeepPred-SubMito: A Novel Submitochondrial Localization Predictor Based on Multi-Channel Convolutional Neural Network and Dataset Balancing Treatment.

机构信息

School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China.

出版信息

Int J Mol Sci. 2020 Aug 9;21(16):5710. doi: 10.3390/ijms21165710.

DOI:10.3390/ijms21165710

PMID:32784927

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7460811/

Abstract

Mitochondrial proteins are physiologically active in different compartments, and their abnormal location will trigger the pathogenesis of human mitochondrial pathologies. Correctly identifying submitochondrial locations can provide information for disease pathogenesis and drug design. A mitochondrion has four submitochondrial compartments, the matrix, the outer membrane, the inner membrane, and the intermembrane space, but various existing studies ignored the intermembrane space. The majority of researchers used traditional machine learning methods for predicting mitochondrial protein localization. Those predictors required expert-level knowledge of biology to be encoded as features rather than allowing the underlying predictor to extract features through a data-driven procedure. Besides, few researchers have considered the imbalance in datasets. In this paper, we propose a novel end-to-end predictor employing deep neural networks, DeepPred-SubMito, for protein submitochondrial location prediction. First, we utilize random over-sampling to decrease the influence caused by unbalanced datasets. Next, we train a multi-channel bilayer convolutional neural network for multiple subsequences to learn high-level features. Third, the prediction result is outputted through the fully connected layer. The performance of the predictor is measured by 10-fold cross-validation and 5-fold cross-validation on the SM424-18 dataset and the SubMitoPred dataset, respectively. Experimental results show that the predictor outperforms state-of-the-art predictors. In addition, the prediction of results in the M983 dataset also confirmed its effectiveness in predicting submitochondrial locations.

摘要

线粒体蛋白在不同的隔室中具有生理活性，其异常定位会引发人类线粒体疾病的发病机制。正确识别亚线粒体定位可为疾病发病机制和药物设计提供信息。线粒体有四个亚线粒体隔室，即基质、外膜、内膜和膜间空间，但现有的各种研究都忽略了膜间空间。大多数研究人员使用传统的机器学习方法来预测线粒体蛋白的定位。这些预测器需要生物学方面的专家级知识才能被编码为特征，而不是通过数据驱动的过程让底层预测器提取特征。此外，很少有研究人员考虑到数据集的不平衡问题。在本文中，我们提出了一种新颖的端到端预测器 DeepPred-SubMito，它采用深度神经网络来进行蛋白质亚线粒体定位预测。首先，我们利用随机过采样来减少不平衡数据集带来的影响。接下来，我们训练一个多通道双层卷积神经网络来对多个子序列进行学习，以提取高级特征。最后，通过全连接层输出预测结果。我们在 SM424-18 数据集和 SubMitoPred 数据集上分别进行了 10 折交叉验证和 5 折交叉验证，以衡量预测器的性能。实验结果表明，该预测器优于现有的最先进的预测器。此外，在 M983 数据集上的预测结果也证实了其在预测亚线粒体定位方面的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5bb/7460811/d998ea3be59e/ijms-21-05710-g001.jpg

相似文献

DeepPred-SubMito: A Novel Submitochondrial Localization Predictor Based on Multi-Channel Convolutional Neural Network and Dataset Balancing Treatment.

Int J Mol Sci. 2020 Aug 9;21(16):5710. doi: 10.3390/ijms21165710.

iDeepSubMito: identification of protein submitochondrial localization with deep learning.

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab288.

Predicting protein submitochondrial locations by incorporating the positional-specific physicochemical properties into Chou's general pseudo-amino acid compositions.

J Theor Biol. 2017 Mar 7;416:81-87. doi: 10.1016/j.jtbi.2016.12.026. Epub 2017 Jan 8.

DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks.

Bioinformatics. 2020 Jan 1;36(1):56-64. doi: 10.1093/bioinformatics/btz512.

SubMito-XGBoost: predicting protein submitochondrial localization by fusing multiple feature information and eXtreme gradient boosting.

Bioinformatics. 2020 Feb 15;36(4):1074-1081. doi: 10.1093/bioinformatics/btz734.

Computer-Aided Prediction of Protein Mitochondrial Localization.

Methods Mol Biol. 2021;2275:433-452. doi: 10.1007/978-1-0716-1262-0_28.

SubMito-PSPCP: predicting protein submitochondrial locations by hybridizing positional specific physicochemical properties with pseudoamino acid compositions.

Biomed Res Int. 2013;2013:263829. doi: 10.1155/2013/263829. Epub 2013 Aug 21.

Mammalian Oxa1 protein is useful for assessment of submitochondrial protein localization and mitochondrial membrane integrity.

Anal Biochem. 2010 Feb 15;397(2):250-2. doi: 10.1016/j.ab.2009.10.035. Epub 2009 Oct 23.

Protein submitochondrial localization from integrated sequence representation and SVM-based backward feature extraction.

Mol Biosyst. 2015 Jan;11(1):170-7. doi: 10.1039/c4mb00340c. Epub 2014 Oct 21.

New insights into the mechanism of precursor protein insertion into the mitochondrial membranes.

Int Rev Cell Mol Biol. 2008;268:147-90. doi: 10.1016/S1937-6448(08)00805-8.

引用本文的文献

Identification of plant vacuole proteins by using graph neural network and contact maps.

BMC Bioinformatics. 2023 Sep 22;24(1):357. doi: 10.1186/s12859-023-05475-x.

Method for Classifying Schizophrenia Patients Based on Machine Learning.

J Clin Med. 2023 Jun 29;12(13):4375. doi: 10.3390/jcm12134375.

OrganelX web server for sub-peroxisomal and sub-mitochondrial protein localization and peroxisomal target signal detection.

Comput Struct Biotechnol J. 2022 Dec 5;21:128-133. doi: 10.1016/j.csbj.2022.11.058. eCollection 2023.

Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics.

Front Bioinform. 2022 May 19;2:910531. doi: 10.3389/fbinf.2022.910531. eCollection 2022.

Computational methods for protein localization prediction.

Comput Struct Biotechnol J. 2021 Oct 19;19:5834-5844. doi: 10.1016/j.csbj.2021.10.023. eCollection 2021.

In-Pero: Exploiting Deep Learning Embeddings of Protein Sequences to Predict the Localisation of Peroxisomal Proteins.

Int J Mol Sci. 2021 Jun 15;22(12):6409. doi: 10.3390/ijms22126409.

Predicting the 14-Day Hospital Readmission of Patients with Pneumonia Using Artificial Neural Networks (ANN).

Int J Environ Res Public Health. 2021 May 12;18(10):5110. doi: 10.3390/ijerph18105110.

Predicting Active NBA Players Most Likely to Be Inducted into the Basketball Hall of Famers Using Artificial Neural Networks in Microsoft Excel: Development and Usability Study.

Int J Environ Res Public Health. 2021 Apr 16;18(8):4256. doi: 10.3390/ijerph18084256.

本文引用的文献

SCLpred-EMS: subcellular localization prediction of endomembrane system and secretory pathway proteins by Deep N-to-1 Convolutional Neural Networks.

Bioinformatics. 2020 Jun 1;36(11):3343-3349. doi: 10.1093/bioinformatics/btaa156.

SubMito-XGBoost: predicting protein submitochondrial localization by fusing multiple feature information and eXtreme gradient boosting.

Bioinformatics. 2020 Feb 15;36(4):1074-1081. doi: 10.1093/bioinformatics/btz734.

DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks.

Bioinformatics. 2020 Jan 1;36(1):56-64. doi: 10.1093/bioinformatics/btz512.

A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease.

Front Genet. 2019 Jan 18;9:751. doi: 10.3389/fgene.2018.00751. eCollection 2018.

A systematic study of the class imbalance problem in convolutional neural networks.

Neural Netw. 2018 Oct;106:249-259. doi: 10.1016/j.neunet.2018.07.011. Epub 2018 Jul 29.

Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.

BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.

Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks.

Bioinformatics. 2018 Oct 15;34(20):3427-3436. doi: 10.1093/bioinformatics/bty364.

Predicting protein submitochondrial locations by incorporating the pseudo-position specific scoring matrix into the general Chou's pseudo-amino acid composition.

J Theor Biol. 2018 Aug 7;450:86-103. doi: 10.1016/j.jtbi.2018.04.026. Epub 2018 Apr 18.

The lncLocator: a subcellular localization predictor for long non-coding RNAs based on a stacked ensemble classifier.

Bioinformatics. 2018 Jul 1;34(13):2185-2194. doi: 10.1093/bioinformatics/bty085.

Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.

J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

DeepPred-SubMito：一种基于多通道卷积神经网络和数据集平衡处理的新型亚线粒体定位预测器。

DeepPred-SubMito: A Novel Submitochondrial Localization Predictor Based on Multi-Channel Convolutional Neural Network and Dataset Balancing Treatment.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译