PredinID：通过带有图采样技术的图卷积神经网络预测人类中的致病性框内插入缺失

PredinID: Predicting Pathogenic Inframe Indels in Human Through Graph Convolution Neural Network With Graph Sampling Technique.

作者信息

Yue Zhenyu, Xiang Ying, Chen Guojun, Wang Xiaosong, Li Ke, Zhang Youhua

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):3226-3233. doi: 10.1109/TCBB.2023.3266232. Epub 2023 Oct 9.

DOI:10.1109/TCBB.2023.3266232

Abstract

Inframe insertion/deletion (indel) variants may alter protein sequence and function, which are closely related to an extensive variety of diseases. Although recent researches have paid attention to the associations between inframe indels and diseases, modeling indels in silico and interpreting their pathogenicity remain challenging, mainly due to the lack of experimental information and computational methodologies. In this article, we propose a novel computational method named PredinID (Predictor for inframe InDels) via graph convolutional network (GCN). PredinID leverages k-nearest neighbor algorithm to construct the feature graph for aggregating more informative representation, regarding the pathogenic inframe indel prediction as a node classification task. An edge-based sampling strategy is designed for extracting information from both the potential connections of feature space and the topological structure of subgraphs. Evaluated by 5-fold cross-validations, the PredinID method achieves satisfactory performance and is superior to four classic machine learning algorithms and two GCN methods. Comprehensive experiments show that PredinID has superior performances when compared with the state-of-the-art methods on the independent test set. Moreover, we also implement a web server at http://predinid.bio.aielab.cc/, to facilitate the use of the model.

摘要

框内插入/缺失（indel）变异可能会改变蛋白质序列和功能，这与多种疾病密切相关。尽管最近的研究已经关注到框内indel与疾病之间的关联，但在计算机上对indel进行建模并解释其致病性仍然具有挑战性，主要是由于缺乏实验信息和计算方法。在本文中，我们通过图卷积网络（GCN）提出了一种名为PredinID（框内InDels预测器）的新型计算方法。PredinID利用k近邻算法构建特征图，以聚合更多信息丰富的表示，将致病性框内indel预测视为节点分类任务。设计了一种基于边的采样策略，用于从特征空间的潜在连接和子图的拓扑结构中提取信息。通过五折交叉验证评估，PredinID方法取得了令人满意的性能，优于四种经典机器学习算法和两种GCN方法。综合实验表明，在独立测试集上，与现有最先进方法相比，PredinID具有更优的性能。此外，我们还在http://predinid.bio.aielab.cc/上实现了一个网络服务器，以方便模型的使用。

相似文献

PredinID: Predicting Pathogenic Inframe Indels in Human Through Graph Convolution Neural Network With Graph Sampling Technique.

IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):3226-3233. doi: 10.1109/TCBB.2023.3266232. Epub 2023 Oct 9.

GRA-GCN: Dense Granule Protein Prediction in Apicomplexa Protozoa Through Graph Convolutional Network.

IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):1963-1970. doi: 10.1109/TCBB.2022.3224836. Epub 2023 Jun 5.

SHINE: protein language model-based pathogenicity prediction for short inframe insertion and deletion variants.

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac584.

Identifying the Impact of Inframe Insertions and Deletions on Protein Function in Cancer.

J Comput Biol. 2020 May;27(5):786-795. doi: 10.1089/cmb.2018.0192. Epub 2019 Aug 28.

Exploring the role of edge distribution in graph convolutional networks.

Neural Netw. 2023 Nov;168:459-470. doi: 10.1016/j.neunet.2023.09.048. Epub 2023 Oct 4.

A Convolutional Neural Network and Graph Convolutional Network Based Framework for Classification of Breast Histopathological Images.

IEEE J Biomed Health Inform. 2022 Jul;26(7):3163-3173. doi: 10.1109/JBHI.2022.3153671. Epub 2022 Jul 1.

MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.

Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.

MDA-GCNFTG: identifying miRNA-disease associations based on graph convolutional networks via graph sampling through the feature and topology graph.

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab165.

Feature-Attention Graph Convolutional Networks for Noise Resilient Learning.

IEEE Trans Cybern. 2022 Aug;52(8):7719-7731. doi: 10.1109/TCYB.2022.3143798. Epub 2022 Jul 19.

MAGCN: A Multiple Attention Graph Convolution Networks for Predicting Synthetic Lethality.

IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):2681-2689. doi: 10.1109/TCBB.2022.3221736. Epub 2023 Oct 9.

引用本文的文献

Prediction of human pathogenic start loss variants based on self-supervised contrastive learning.

BMC Biol. 2025 Aug 8;23(1):250. doi: 10.1186/s12915-025-02348-y.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PredinID：通过带有图采样技术的图卷积神经网络预测人类中的致病性框内插入缺失

PredinID: Predicting Pathogenic Inframe Indels in Human Through Graph Convolution Neural Network With Graph Sampling Technique.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献