PredAmyl-MLP：使用多层感知机预测淀粉样蛋白

PredAmyl-MLP: Prediction of Amyloid Proteins Using Multilayer Perceptron.

机构信息

College of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, China.

College of Computer Science and Technology, Harbin Institute of Technology, Harbin 150040, China.

出版信息

Comput Math Methods Med. 2020 Nov 20;2020:8845133. doi: 10.1155/2020/8845133. eCollection 2020.

DOI:10.1155/2020/8845133

PMID:33294004

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7700051/

Abstract

Amyloid is generally an aggregate of insoluble fibrin; its abnormal deposition is the pathogenic mechanism of various diseases, such as Alzheimer's disease and type II diabetes. Therefore, accurately identifying amyloid is necessary to understand its role in pathology. We proposed a machine learning-based prediction model called PredAmyl-MLP, which consists of the following three steps: feature extraction, feature selection, and classification. In the step of feature extraction, seven feature extraction algorithms and different combinations of them are investigated, and the combination of SVMProt-188D and tripeptide composition (TPC) is selected according to the experimental results. In the step of feature selection, maximum relevant maximum distance (MRMD) and binomial distribution (BD) are, respectively, used to remove the redundant or noise features, and the appropriate features are selected according to the experimental results. In the step of classification, we employed multilayer perceptron (MLP) to train the prediction model. The 10-fold cross-validation results show that the overall accuracy of PredAmyl-MLP reached 91.59%, and the performance was better than the existing methods.

摘要

淀粉样蛋白通常是不溶性纤维蛋白的聚集物；其异常沉积是各种疾病（如阿尔茨海默病和 2 型糖尿病）的发病机制。因此，准确识别淀粉样蛋白对于了解其在病理学中的作用是必要的。我们提出了一种基于机器学习的预测模型，称为 PredAmyl-MLP，它由以下三个步骤组成：特征提取、特征选择和分类。在特征提取步骤中，研究了七种特征提取算法及其不同组合，并根据实验结果选择了 SVMProt-188D 和三肽组成（TPC）的组合。在特征选择步骤中，分别使用最大相关最大距离（MRMD）和二项式分布（BD）来去除冗余或噪声特征，并根据实验结果选择适当的特征。在分类步骤中，我们采用多层感知器（MLP）来训练预测模型。10 折交叉验证结果表明，PredAmyl-MLP 的总体准确率达到 91.59%，性能优于现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b259/7700051/366f6a64524b/CMMM2020-8845133.001.jpg

相似文献

PredAmyl-MLP: Prediction of Amyloid Proteins Using Multilayer Perceptron.PredAmyl-MLP：使用多层感知机预测淀粉样蛋白

Comput Math Methods Med. 2020 Nov 20;2020:8845133. doi: 10.1155/2020/8845133. eCollection 2020.

RFAmyloid: A Web Server for Predicting Amyloid Proteins.RFAmyloid：用于预测淀粉样蛋白的网络服务器。

Int J Mol Sci. 2018 Jul 16;19(7):2071. doi: 10.3390/ijms19072071.

Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合，以预测放射性肺损伤。

Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.

ReRF-Pred: predicting amyloidogenic regions of proteins based on their pseudo amino acid composition and tripeptide composition.ReRF-Pred：基于蛋白质的伪氨基酸组成和三肽组成预测蛋白质的淀粉样纤维形成区域。

BMC Bioinformatics. 2021 Nov 9;22(1):545. doi: 10.1186/s12859-021-04446-4.

AMYPred-FRL is a novel approach for accurate prediction of amyloid proteins by using feature representation learning.AMYPred-FRL 是一种通过使用特征表示学习来准确预测淀粉样蛋白的新方法。

Sci Rep. 2022 May 11;12(1):7697. doi: 10.1038/s41598-022-11897-z.

ECAmyloid: An amyloid predictor based on ensemble learning and comprehensive sequence-derived features.ECAmyloid：一种基于集成学习和综合序列衍生特征的淀粉样蛋白预测器。

Comput Biol Chem. 2023 Jun;104:107853. doi: 10.1016/j.compbiolchem.2023.107853. Epub 2023 Mar 23.

EAGA-MLP-An Enhanced and Adaptive Hybrid Classification Model for Diabetes Diagnosis.EAGA-MLP：一种用于糖尿病诊断的增强型自适应混合分类模型。

Sensors (Basel). 2020 Jul 20;20(14):4036. doi: 10.3390/s20144036.

A PCA aided cross-covariance scheme for discriminative feature extraction from EEG signals.基于主成分分析的脑电信号判别特征提取的互协方差方法。

Comput Methods Programs Biomed. 2017 Jul;146:47-57. doi: 10.1016/j.cmpb.2017.05.009. Epub 2017 May 24.

Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods.基于转录组谱特征选择和机器学习方法的乳腺癌预测。

BMC Bioinformatics. 2022 Oct 1;23(1):410. doi: 10.1186/s12859-022-04965-8.

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

引用本文的文献

Machine learning-based multiparametric MRI radiomics nomogram for predicting WHO/ISUP nuclear grading of clear cell renal cell carcinoma.基于机器学习的多参数MRI影像组学列线图预测透明细胞肾细胞癌的WHO/ISUP核分级

Front Oncol. 2024 Nov 7;14:1467775. doi: 10.3389/fonc.2024.1467775. eCollection 2024.

Deep learning in structural bioinformatics: current applications and future perspectives.结构生物信息学中的深度学习：当前应用与未来展望。

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae042.

Advanced computational approaches to understand protein aggregation.用于理解蛋白质聚集的先进计算方法。

Biophys Rev (Melville). 2024 Apr 24;5(2):021302. doi: 10.1063/5.0180691. eCollection 2024 Jun.

NRPreTo: A Machine Learning-Based Nuclear Receptor and Subfamily Prediction Tool.NRPreTo：一种基于机器学习的核受体和亚家族预测工具。

ACS Omega. 2023 May 30;8(23):20379-20388. doi: 10.1021/acsomega.3c00286. eCollection 2023 Jun 13.

Machine Learning Approaches in Diagnosis, Prognosis and Treatment Selection of Cardiac Amyloidosis.机器学习在心脏淀粉样变的诊断、预后和治疗选择中的应用。

Int J Mol Sci. 2023 Mar 16;24(6):5680. doi: 10.3390/ijms24065680.

ENTAIL: yEt aNoTher amyloid fIbrils cLassifier.又一个淀粉样纤维分类器。

BMC Bioinformatics. 2022 Dec 1;23(1):517. doi: 10.1186/s12859-022-05070-6.

Sci Rep. 2022 May 11;12(1):7697. doi: 10.1038/s41598-022-11897-z.

Pseudo-188D: Phage Protein Prediction Based on a Model of Pseudo-188D.伪188D：基于伪188D模型的噬菌体蛋白质预测

Front Genet. 2021 Dec 1;12:796327. doi: 10.3389/fgene.2021.796327. eCollection 2021.

BMC Bioinformatics. 2021 Nov 9;22(1):545. doi: 10.1186/s12859-021-04446-4.

本文引用的文献

Memristive Circuit Implementation of Biological Nonassociative Learning Mechanism and Its Applications.忆阻电路实现生物非联想学习机制及其应用。

IEEE Trans Biomed Circuits Syst. 2020 Oct;14(5):1036-1050. doi: 10.1109/TBCAS.2020.3018777. Epub 2020 Aug 24.

DeepAVP: A Dual-Channel Deep Neural Network for Identifying Variable-Length Antiviral Peptides.深 AV 肽：一种用于识别可变长度抗病毒肽的双通道深度神经网络。

IEEE J Biomed Health Inform. 2020 Oct;24(10):3012-3019. doi: 10.1109/JBHI.2020.2977091. Epub 2020 Feb 28.

Design powerful predictor for mRNA subcellular location prediction in Homo sapiens.设计用于预测人类 mRNA 亚细胞定位的强大预测器。

Brief Bioinform. 2021 Jan 18;22(1):526-535. doi: 10.1093/bib/bbz177.

Diffuse Hepatosplenic 99mTc-Pyrophosphate Activity Caused by Amyloidosis.弥漫性肝脾 99mTc-焦磷酸盐活性所致淀粉样变性。

Clin Nucl Med. 2020 Mar;45(3):246-247. doi: 10.1097/RLU.0000000000002877.

Network-based prediction of drug-target interactions using an arbitrary-order proximity embedded deep forest.基于任意阶近邻嵌入深度森林的药物-靶标相互作用的网络预测。

Bioinformatics. 2020 May 1;36(9):2805-2812. doi: 10.1093/bioinformatics/btaa010.

An Overview on Predicting Protein Subchloroplast Localization by using Machine Learning Methods.基于机器学习方法预测蛋白亚叶绿体定位的研究综述。

Curr Protein Pept Sci. 2020;21(12):1229-1241. doi: 10.2174/1389203721666200117153412.

Identification of Highest-Affinity Binding Sites of Yeast Transcription Factor Families.鉴定酵母转录因子家族的高亲和力结合位点。

J Chem Inf Model. 2020 Mar 23;60(3):1876-1883. doi: 10.1021/acs.jcim.9b01012. Epub 2020 Jan 28.

DTI-CDF: a cascade deep forest model towards the prediction of drug-target interactions based on hybrid features.DTI-CDF：一种基于混合特征的药物-靶标相互作用预测的级联深度森林模型。

Brief Bioinform. 2021 Jan 18;22(1):451-462. doi: 10.1093/bib/bbz152.

The application of machine learning to disease diagnosis and treatment.机器学习在疾病诊断与治疗中的应用。

Math Biosci. 2020 Feb;320:108305. doi: 10.1016/j.mbs.2019.108305. Epub 2019 Dec 16.

SGL-SVM: A novel method for tumor classification via support vector machine with sparse group Lasso.SGL-SVM：一种通过带稀疏组套索的支持向量机进行肿瘤分类的新方法。

J Theor Biol. 2020 Feb 7;486:110098. doi: 10.1016/j.jtbi.2019.110098. Epub 2019 Nov 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

PredAmyl-MLP：使用多层感知机预测淀粉样蛋白

PredAmyl-MLP: Prediction of Amyloid Proteins Using Multilayer Perceptron.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献