iAMY-RECMFF：利用残基对能量含量矩阵和特征融合算法识别淀粉样肽。

iAMY-RECMFF: Identifying amyloidgenic peptides by using residue pairwise energy content matrix and features fusion algorithm.

机构信息

School of Communications and Electronics Jiangxi, Science and Technology Normal University, Nanchang 330013, P. R. China.

Jiangxi Engineering Research Center of Unattended Perception System and Artificial Intelligence Technology Jiangxi Science and Technology Normal University, Jiangxi 330088, P. R. China.

出版信息

J Bioinform Comput Biol. 2023 Oct;21(5):2350023. doi: 10.1142/S0219720023500233. Epub 2023 Oct 27.

DOI:10.1142/S0219720023500233

PMID:37899353

Abstract

Various diseases, including Huntington's disease, Alzheimer's disease, and Parkinson's disease, have been reported to be linked to amyloid. Therefore, it is crucial to distinguish amyloid from non-amyloid proteins or peptides. While experimental approaches are typically preferred, they are costly and time-consuming. In this study, we have developed a machine learning framework called iAMY-RECMFF to discriminate amyloidgenic from non-amyloidgenic peptides. In our model, we first encoded the peptide sequences using the residue pairwise energy content matrix. We then utilized Pearson's correlation coefficient and distance correlation to extract useful information from this matrix. Additionally, we employed an improved similarity network fusion algorithm to integrate features from different perspectives. The Fisher approach was adopted to select the optimal feature subset. Finally, the selected features were inputted into a support vector machine for identifying amyloidgenic peptides. Experimental results demonstrate that our proposed method significantly improves the identification of amyloidgenic peptides compared to existing predictors. This suggests that our method may serve as a powerful tool in identifying amyloidgenic peptides. To facilitate academic use, the dataset and codes used in the current study are accessible at https://figshare.com/articles/online_resource/iAMY-RECMFF/22816916.

摘要

多种疾病，包括亨廷顿氏病、阿尔茨海默病和帕金森病，都被报道与淀粉样蛋白有关。因此，区分淀粉样蛋白和非淀粉样蛋白或肽至关重要。虽然实验方法通常是首选，但它们既昂贵又耗时。在这项研究中，我们开发了一种称为 iAMY-RECMFF 的机器学习框架，用于区分致淀粉样的和非致淀粉样的肽。在我们的模型中，我们首先使用残基对能量含量矩阵对肽序列进行编码。然后，我们利用皮尔逊相关系数和距离相关从该矩阵中提取有用信息。此外，我们采用改进的相似网络融合算法从不同角度整合特征。采用 Fisher 方法选择最优特征子集。最后，将选择的特征输入支持向量机以识别致淀粉样的肽。实验结果表明，与现有预测器相比，我们提出的方法显著提高了致淀粉样肽的识别能力。这表明我们的方法可能成为识别致淀粉样肽的有力工具。为了便于学术使用，本研究中使用的数据集和代码可在 https://figshare.com/articles/online_resource/iAMY-RECMFF/22816916 上获取。

相似文献

iAMY-RECMFF: Identifying amyloidgenic peptides by using residue pairwise energy content matrix and features fusion algorithm.iAMY-RECMFF：利用残基对能量含量矩阵和特征融合算法识别淀粉样肽。

J Bioinform Comput Biol. 2023 Oct;21(5):2350023. doi: 10.1142/S0219720023500233. Epub 2023 Oct 27.

Integrating multiple sequence features for identifying anticancer peptides.整合多种序列特征以识别抗癌肽。

Comput Biol Chem. 2022 Aug;99:107711. doi: 10.1016/j.compbiolchem.2022.107711. Epub 2022 Jun 1.

Identification of tumor homing peptides by utilizing hybrid feature representation.利用混合特征表示法鉴定肿瘤归巢肽。

J Biomol Struct Dyn. 2023 May;41(8):3405-3412. doi: 10.1080/07391102.2022.2049368. Epub 2022 Mar 9.

iTTCA-MFF: identifying tumor T cell antigens based on multiple feature fusion.iTTCA-MFF：基于多特征融合的肿瘤 T 细胞抗原识别。

Immunogenetics. 2022 Oct;74(5):447-454. doi: 10.1007/s00251-022-01258-5. Epub 2022 Mar 5.

Integrating temporal and spatial variabilities for identifying ion binding proteins in phage.整合时空变异性以鉴定噬菌体中的离子结合蛋白。

J Bioinform Comput Biol. 2023 Jun;21(3):2350010. doi: 10.1142/S0219720023500105. Epub 2023 Jun 15.

m7G-DPP: Identifying N7-methylguanosine sites based on dinucleotide physicochemical properties of RNA.m7G-DPP：基于RNA二核苷酸理化性质识别N7-甲基鸟苷位点。

Biophys Chem. 2021 Dec;279:106697. doi: 10.1016/j.bpc.2021.106697. Epub 2021 Oct 5.

iAMY-SCM: Improved prediction and analysis of amyloid proteins using a scoring card method with propensity scores of dipeptides.iAMY-SCM：使用具有二肽倾向得分的评分卡方法改进淀粉样蛋白的预测与分析

Genomics. 2021 Jan;113(1 Pt 2):689-698. doi: 10.1016/j.ygeno.2020.09.065. Epub 2020 Oct 2.

UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning.UMPred-FRL：一种使用特征表示学习准确预测鲜味肽的新方法。

Int J Mol Sci. 2021 Dec 4;22(23):13124. doi: 10.3390/ijms222313124.

Integrating Low-Order and High-Order Correlation Information for Identifying Phage Virion Proteins.整合低阶和高阶相关信息以鉴定噬菌体病毒粒子蛋白

J Comput Biol. 2023 Oct;30(10):1131-1143. doi: 10.1089/cmb.2022.0237. Epub 2023 Sep 20.

Prediction of anti-inflammatory proteins/peptides: an insilico approach.抗炎蛋白/肽的预测：一种计算机模拟方法。

J Transl Med. 2017 Jan 6;15(1):7. doi: 10.1186/s12967-016-1103-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验