Suppr超能文献

多分支卷积神经网络:使用多分支卷积神经网络对离子通道相互作用肽进行分类。

Multi-Branch-CNN: Classification of ion channel interacting peptides using multi-branch convolutional neural network.

机构信息

PAMI Research Group, Department of Computer and Information Science, University of Macau, Taipa, Macao Special Administrative Region of China.

PAMI Research Group, Department of Computer and Information Science, University of Macau, Taipa, Macao Special Administrative Region of China.

出版信息

Comput Biol Med. 2022 Aug;147:105717. doi: 10.1016/j.compbiomed.2022.105717. Epub 2022 Jun 8.

Abstract

Ligand peptides that have high affinity for ion channels are critical for regulating ion flux across the plasma membrane. These peptides are now being considered as potential drug candidates for many diseases, such as cardiovascular disease and cancers. In this work, we developed Multi-Branch-CNN, a CNN method with multiple input branches for identifying three types of ion channel peptide binders (sodium, potassium, and calcium) from intra- and inter-feature types. As for its real-world applications, prediction models that are able to recognize novel sequences having high or low similarities to training sequences are required. To this end, we tested our models on two test sets: a general test set including sequences spanning different similarity levels to those of the training set, and a novel-test set consisting of only sequences that bear little resemblance to sequences from the training set. Our experiments showed that the Multi-Branch-CNN method performs better than thirteen traditional ML algorithms (TML13), yielding an improvement in accuracy of 3.2%, 1.2%, and 2.3% on the test sets as well as 8.8%, 14.3%, and 14.6% on the novel-test sets for sodium, potassium, and calcium ion channels, respectively. We confirmed the effectiveness of Multi-Branch-CNN by comparing it to the standard CNN method with one input branch (Single-Branch-CNN) and an ensemble method (TML13-Stack). The data sets, script files to reproduce the experiments, and the final predictive models are freely available at https://github.com/jieluyan/Multi-Branch-CNN.

摘要

配体肽与离子通道具有高亲和力,对于调节质膜两侧的离子通量至关重要。这些肽现在被认为是许多疾病(如心血管疾病和癌症)的潜在药物候选物。在这项工作中,我们开发了 Multi-Branch-CNN,这是一种具有多个输入分支的 CNN 方法,用于从内部分支和特征类型之间识别三种类型的离子通道肽结合物(钠、钾和钙)。就其实际应用而言,需要能够识别与训练序列具有高或低相似性的新序列的预测模型。为此,我们在两个测试集上测试了我们的模型:一个包含与训练集序列跨度不同相似度水平的序列的通用测试集,以及一个仅由与训练集序列相似性较小的序列组成的新型测试集。我们的实验表明,Multi-Branch-CNN 方法比十三种传统机器学习算法(TML13)表现更好,在钠、钾和钙离子通道的测试集上分别提高了 3.2%、1.2%和 2.3%的准确率,在新型测试集上提高了 8.8%、14.3%和 14.6%的准确率。我们通过将其与具有一个输入分支的标准 CNN 方法(Single-Branch-CNN)和集成方法(TML13-Stack)进行比较,证实了 Multi-Branch-CNN 的有效性。数据集、可重现实验的脚本文件和最终的预测模型可在 https://github.com/jieluyan/Multi-Branch-CNN 上免费获得。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验