• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于节点核的疾病基因优先级推断的异质网络整合。

Heterogeneous networks integration for disease-gene prioritization with node kernels.

机构信息

Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg im Breisgau, Germany.

Department of Mathematics, University of Padova, Padua, Italy.

出版信息

Bioinformatics. 2020 May 1;36(9):2649-2656. doi: 10.1093/bioinformatics/btaa008.

DOI:10.1093/bioinformatics/btaa008
PMID:31990289
Abstract

MOTIVATION

The identification of disease-gene associations is a task of fundamental importance in human health research. A typical approach consists in first encoding large gene/protein relational datasets as networks due to the natural and intuitive property of graphs for representing objects' relationships and then utilizing graph-based techniques to prioritize genes for successive low-throughput validation assays. Since different types of interactions between genes yield distinct gene networks, there is the need to integrate different heterogeneous sources to improve the reliability of prioritization systems.

RESULTS

We propose an approach based on three phases: first, we merge all sources in a single network, then we partition the integrated network according to edge density introducing a notion of edge type to distinguish the parts and finally, we employ a novel node kernel suitable for graphs with typed edges. We show how the node kernel can generate a large number of discriminative features that can be efficiently processed by linear regularized machine learning classifiers. We report state-of-the-art results on 12 disease-gene associations and on a time-stamped benchmark containing 42 newly discovered associations.

AVAILABILITY AND IMPLEMENTATION

Source code: https://github.com/dinhinfotech/DiGI.git.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

鉴定疾病-基因关联是人类健康研究中一项至关重要的任务。一种典型的方法是首先将大型基因/蛋白质关系数据集编码为网络,因为图具有表示对象关系的自然和直观性质,然后利用基于图的技术对基因进行优先级排序,以便进行后续的低通量验证实验。由于基因之间的不同类型的相互作用产生不同的基因网络,因此需要整合不同的异构源以提高优先级排序系统的可靠性。

结果

我们提出了一种基于三个阶段的方法:首先,我们将所有来源合并到一个单一的网络中,然后根据边缘密度将集成网络划分为不同的部分,引入边缘类型的概念来区分这些部分,最后,我们采用一种新的节点核函数,适用于具有类型化边缘的图。我们展示了节点核函数如何生成大量可通过线性正则化机器学习分类器有效处理的有区别的特征。我们在 12 个疾病-基因关联和一个包含 42 个新发现关联的时间戳基准上报告了最先进的结果。

可用性和实现

源代码:https://github.com/dinhinfotech/DiGI.git。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Heterogeneous networks integration for disease-gene prioritization with node kernels.基于节点核的疾病基因优先级推断的异质网络整合。
Bioinformatics. 2020 May 1;36(9):2649-2656. doi: 10.1093/bioinformatics/btaa008.
2
Classification in biological networks with hypergraphlet kernels.基于超图节点核的生物网络分类。
Bioinformatics. 2021 May 17;37(7):1000-1007. doi: 10.1093/bioinformatics/btaa768.
3
Ensembling graph attention networks for human microbe-drug association prediction.基于图注意力网络的微生物-药物关联预测方法
Bioinformatics. 2020 Dec 30;36(Suppl_2):i779-i786. doi: 10.1093/bioinformatics/btaa891.
4
scSGL: kernelized signed graph learning for single-cell gene regulatory network inference.scSGL:用于单细胞基因调控网络推断的核化有符号图学习。
Bioinformatics. 2022 May 26;38(11):3011-3019. doi: 10.1093/bioinformatics/btac288.
5
Dual-dropout graph convolutional network for predicting synthetic lethality in human cancers.双失活图卷积网络预测人类癌症中的合成致死性。
Bioinformatics. 2020 Aug 15;36(16):4458-4465. doi: 10.1093/bioinformatics/btaa211.
6
NIAPU: network-informed adaptive positive-unlabeled learning for disease gene identification.NIAPU:用于疾病基因识别的基于网络信息的自适应阳性无标签学习。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btac848.
7
Random walk with restart on multiplex and heterogeneous biological networks.重连随机游走在多重和异质生物网络中的应用。
Bioinformatics. 2019 Feb 1;35(3):497-505. doi: 10.1093/bioinformatics/bty637.
8
Semi-supervised network inference using simulated gene expression dynamics.基于模拟基因表达动力学的半监督网络推断。
Bioinformatics. 2018 Apr 1;34(7):1148-1156. doi: 10.1093/bioinformatics/btx748.
9
Interactogeneous: disease gene prioritization using heterogeneous networks and full topology scores.互适性:使用异质网络和全拓扑评分进行疾病基因优先级排序。
PLoS One. 2012;7(11):e49634. doi: 10.1371/journal.pone.0049634. Epub 2012 Nov 19.
10
MVGCN: data integration through multi-view graph convolutional network for predicting links in biomedical bipartite networks.MVGCN:通过多视图图卷积网络进行数据集成以预测生物医学二分网络中的链接
Bioinformatics. 2022 Jan 3;38(2):426-434. doi: 10.1093/bioinformatics/btab651.

引用本文的文献

1
Leveraging Social Determinants of Health in Alzheimer's Research Using LLM-Augmented Literature Mining and Knowledge Graphs.利用基于大语言模型增强的文献挖掘和知识图谱,在阿尔茨海默病研究中利用健康的社会决定因素
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:491-500. eCollection 2025.
2
Multiomics with Evolutionary Computation to Identify Molecular and Module Biomarkers for Early Diagnosis and Treatment of Complex Disease.结合多组学与进化计算以识别复杂疾病早期诊断和治疗的分子及模块生物标志物。
Genes (Basel). 2025 Feb 20;16(3):244. doi: 10.3390/genes16030244.
3
A knowledge graph approach to predict and interpret disease-causing gene interactions.
一种基于知识图谱的疾病相关基因互作预测与解释方法。
BMC Bioinformatics. 2023 Aug 29;24(1):324. doi: 10.1186/s12859-023-05451-5.
4
Predicting disease genes based on multi-head attention fusion.基于多头注意力融合的疾病基因预测。
BMC Bioinformatics. 2023 Apr 21;24(1):162. doi: 10.1186/s12859-023-05285-1.
5
HetIG-PreDiG: A Heterogeneous Integrated Graph Model for Predicting Human Disease Genes based on gene expression.HetIG-PreDiG:一种基于基因表达的用于预测人类疾病基因的异构集成图模型。
PLoS One. 2023 Feb 15;18(2):e0280839. doi: 10.1371/journal.pone.0280839. eCollection 2023.
6
Graph Embedding Based Novel Gene Discovery Associated With Diabetes Mellitus.基于图嵌入的与糖尿病相关的新型基因发现
Front Genet. 2021 Nov 25;12:779186. doi: 10.3389/fgene.2021.779186. eCollection 2021.
7
Hypergraph models of biological networks to identify genes critical to pathogenic viral response.生物网络的超图模型,以鉴定对致病病毒反应至关重要的基因。
BMC Bioinformatics. 2021 May 29;22(1):287. doi: 10.1186/s12859-021-04197-2.
8
RNMFMDA: A Microbe-Disease Association Identification Method Based on Reliable Negative Sample Selection and Logistic Matrix Factorization With Neighborhood Regularization.RNMFMDA:一种基于可靠负样本选择和带邻域正则化的逻辑矩阵分解的微生物-疾病关联识别方法。
Front Microbiol. 2020 Oct 27;11:592430. doi: 10.3389/fmicb.2020.592430. eCollection 2020.