• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于网络嵌入和单类分类的功能基因和疾病基因预测。

Function-Genes and Disease-Genes Prediction Based on Network Embedding and One-Class Classification.

机构信息

College of Maritime Economics and Management, Dalian Maritime University, Dalian, 116026, China.

Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, Dalian, 116026, China.

出版信息

Interdiscip Sci. 2024 Dec;16(4):781-801. doi: 10.1007/s12539-024-00638-7. Epub 2024 Sep 4.

DOI:10.1007/s12539-024-00638-7
PMID:39230798
Abstract

Using genes which have been experimentally-validated for diseases (functions) can develop machine learning methods to predict new disease/function-genes. However, the prediction of both function-genes and disease-genes faces the same problem: there are only certain positive examples, but no negative examples. To solve this problem, we proposed a function/disease-genes prediction algorithm based on network embedding (Variational Graph Auto-Encoders, VGAE) and one-class classification (Fast Minimum Covariance Determinant, Fast-MCD): VGAEMCD. Firstly, we constructed a protein-protein interaction (PPI) network centered on experimentally-validated genes; then VGAE was used to get the embeddings of nodes (genes) in the network; finally, the embeddings were input into the improved deep learning one-class classifier based on Fast-MCD to predict function/disease-genes. VGAEMCD can predict function-gene and disease-gene in a unified way, and only the experimentally-verified genes are needed to provide (no need for expression profile). VGAEMCD outperforms classical one-class classification algorithms in Recall, Precision, F-measure, Specificity, and Accuracy. Further experiments show that seven metrics of VGAEMCD are higher than those of state-of-art function/disease-genes prediction algorithms. The above results indicate that VGAEMCD can well learn the distribution characteristics of positive examples and accurately identify function/disease-genes.

摘要

利用经过实验验证的与疾病(功能)相关的基因,可以开发机器学习方法来预测新的疾病/功能基因。然而,功能基因和疾病基因的预测都面临着一个相同的问题:只有某些阳性例子,但没有阴性例子。为了解决这个问题,我们提出了一种基于网络嵌入(变分图自动编码器,VGAE)和单类分类(快速最小协方差确定,Fast-MCD)的功能/疾病基因预测算法:VGAEMCD。首先,我们构建了一个以实验验证基因为中心的蛋白质-蛋白质相互作用(PPI)网络;然后使用 VGAE 获得网络中节点(基因)的嵌入;最后,将嵌入输入到基于 Fast-MCD 的改进深度学习单类分类器中,以预测功能/疾病基因。VGAEMCD 可以统一地预测功能基因和疾病基因,只需要提供经过实验验证的基因(不需要表达谱)。VGAEMCD 在召回率、精度、F 值、特异性和准确性方面均优于经典的单类分类算法。进一步的实验表明,VGAEMCD 的七个指标均高于最先进的功能/疾病基因预测算法。上述结果表明,VGAEMCD 可以很好地学习阳性例子的分布特征,并准确地识别功能/疾病基因。

相似文献

1
Function-Genes and Disease-Genes Prediction Based on Network Embedding and One-Class Classification.基于网络嵌入和单类分类的功能基因和疾病基因预测。
Interdiscip Sci. 2024 Dec;16(4):781-801. doi: 10.1007/s12539-024-00638-7. Epub 2024 Sep 4.
2
DeepHE: Accurately predicting human essential genes based on deep learning.DeepHE:基于深度学习的人类必需基因精准预测。
PLoS Comput Biol. 2020 Sep 16;16(9):e1008229. doi: 10.1371/journal.pcbi.1008229. eCollection 2020 Sep.
3
Variational graph auto-encoders for miRNA-disease association prediction.基于变分图自编码器的 miRNA-疾病关联预测。
Methods. 2021 Aug;192:25-34. doi: 10.1016/j.ymeth.2020.08.004. Epub 2020 Aug 13.
4
Predicting drug-drug interactions using multi-modal deep auto-encoders based network embedding and positive-unlabeled learning.基于多模态深度自动编码器的网络嵌入和正无标签学习预测药物-药物相互作用。
Methods. 2020 Jul 1;179:37-46. doi: 10.1016/j.ymeth.2020.05.007. Epub 2020 Jun 1.
5
Graph-based prediction of Protein-protein interactions with attributed signed graph embedding.基于属性有向图嵌入的蛋白质-蛋白质相互作用的图预测。
BMC Bioinformatics. 2020 Jul 21;21(1):323. doi: 10.1186/s12859-020-03646-8.
6
Using PPI network autocorrelation in hierarchical multi-label classification trees for gene function prediction.利用 PPI 网络自相关性在层次多标签分类树中进行基因功能预测。
BMC Bioinformatics. 2013 Sep 26;14:285. doi: 10.1186/1471-2105-14-285.
7
Integrating node embeddings and biological annotations for genes to predict disease-gene associations.整合基因的节点嵌入和生物学注释以预测疾病-基因关联。
BMC Syst Biol. 2018 Dec 31;12(Suppl 9):138. doi: 10.1186/s12918-018-0662-y.
8
Graph-DTI: A New Model for Drug-target Interaction Prediction Based on Heterogenous Network Graph Embedding.图-DTI:一种基于异质网络图嵌入的新药靶相互作用预测新模型。
Curr Comput Aided Drug Des. 2024;20(6):1013-1024. doi: 10.2174/1573409919666230713142255.
9
PmDNE: Prediction of miRNA-Disease Association Based on Network Embedding and Network Similarity Analysis.基于网络嵌入和网络相似性分析的 miRNA-疾病关联预测(PmDNE)
Biomed Res Int. 2020 Dec 7;2020:6248686. doi: 10.1155/2020/6248686. eCollection 2020.
10
A deep learning framework for predicting disease-gene associations with functional modules and graph augmentation.一种基于功能模块和图增强的深度学习框架,用于预测疾病-基因关联。
BMC Bioinformatics. 2024 Jun 14;25(1):214. doi: 10.1186/s12859-024-05841-3.

引用本文的文献

1
Enhancing Molecular Network-Based Cancer Driver Gene Prediction Using Machine Learning Approaches: Current Challenges and Opportunities.使用机器学习方法增强基于分子网络的癌症驱动基因预测:当前挑战与机遇
J Cell Mol Med. 2025 Jan;29(1):e70351. doi: 10.1111/jcmm.70351.

本文引用的文献

1
Radiation-induced Cell Death and Its Mechanisms.辐射诱导的细胞死亡及其机制。
Health Phys. 2022 Nov 1;123(5):376-386. doi: 10.1097/HP.0000000000001601. Epub 2022 Sep 6.
2
DeepGOZero: improving protein function prediction from sequence and zero-shot learning based on ontology axioms.DeepGOZero:基于本体论公理的序列和零样本学习改进蛋白质功能预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i238-i245. doi: 10.1093/bioinformatics/btac256.
3
Integrated weighted gene coexpression network analysis identifies Frizzled 2 (FZD2) as a key gene in invasive malignant pleomorphic adenoma.
整合加权基因共表达网络分析鉴定卷曲蛋白 2(FZD2)为侵袭性恶性多形性腺瘤的关键基因。
J Transl Med. 2022 Jan 5;20(1):15. doi: 10.1186/s12967-021-03204-7.
4
The Potential of Physical Exercise to Mitigate Radiation Damage-A Systematic Review.体育锻炼减轻辐射损伤的潜力——一项系统综述
Front Med (Lausanne). 2021 Apr 29;8:585483. doi: 10.3389/fmed.2021.585483. eCollection 2021.
5
Ionizing radiation and toll like receptors: A systematic review article.电离辐射与 Toll 样受体:系统评价文章。
Hum Immunol. 2021 Jun;82(6):446-454. doi: 10.1016/j.humimm.2021.03.008. Epub 2021 Mar 31.
6
Identification of hub genes and biological pathways in hepatocellular carcinoma by integrated bioinformatics analysis.通过综合生物信息学分析鉴定肝细胞癌中的枢纽基因和生物学通路
PeerJ. 2021 Jan 19;9:e10594. doi: 10.7717/peerj.10594. eCollection 2021.
7
Pathogenic gene prediction based on network embedding.基于网络嵌入的致病基因预测。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa353.
8
The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.2021 年的 STRING 数据库:可定制的蛋白质-蛋白质网络,以及用户上传的基因/测量集的功能特征分析。
Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612. doi: 10.1093/nar/gkaa1074.
9
Graph Convolutional Autoencoder and Generative Adversarial Network-Based Method for Predicting Drug-Target Interactions.基于图卷积自动编码器和生成对抗网络的药物-靶标相互作用预测方法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):455-464. doi: 10.1109/TCBB.2020.2999084. Epub 2022 Feb 3.
10
RadAtlas 1.0: a knowledgebase focusing on radiation-associated genes.RadAtlas 1.0:一个专注于辐射相关基因的知识库。
Int J Radiat Biol. 2020 Aug;96(8):980-987. doi: 10.1080/09553002.2020.1761567. Epub 2020 May 12.