• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用多个基因网络通过网络表示学习对 GWAS 候选基因进行优先级排序。

Leveraging multiple gene networks to prioritize GWAS candidate genes via network representation learning.

机构信息

Department of Computer Science, Tsinghua University, Beijing, China; MOE Key Laboratory of Bioinformatics, Bioinformatics Division and Center for Synthetic and Systems Biology, TNLIST, China.

Department of Automation, Tsinghua University, Beijing, China; MOE Key Laboratory of Bioinformatics, Bioinformatics Division and Center for Synthetic and Systems Biology, TNLIST, China.

出版信息

Methods. 2018 Aug 1;145:41-50. doi: 10.1016/j.ymeth.2018.06.002. Epub 2018 Jun 3.

DOI:10.1016/j.ymeth.2018.06.002
PMID:29874547
Abstract

Genome-wide association studies (GWAS) have successfully discovered a number of disease-associated genetic variants in the past decade, providing an unprecedented opportunity for deciphering genetic basis of human inherited diseases. However, it is still a challenging task to extract biological knowledge from the GWAS data, due to such issues as missing heritability and weak interpretability. Indeed, the fact that the majority of discovered loci fall into noncoding regions without clear links to genes has been preventing the characterization of their functions and appealing for a sophisticated approach to bridge genetic and genomic studies. Towards this problem, network-based prioritization of candidate genes, which performs integrated analysis of gene networks with GWAS data, has emerged as a promising direction and attracted much attention. However, most existing methods overlook the sparse and noisy properties of gene networks and thus may lead to suboptimal performance. Motivated by this understanding, we proposed a novel method called REGENT for integrating multiple gene networks with GWAS data to prioritize candidate genes for complex diseases. We leveraged a technique called the network representation learning to embed a gene network into a compact and robust feature space, and then designed a hierarchical statistical model to integrate features of multiple gene networks with GWAS data for the effective inference of genes associated with a disease of interest. We applied our method to six complex diseases and demonstrated the superior performance of REGENT over existing approaches in recovering known disease-associated genes. We further conducted a pathway analysis and showed that the ability of REGENT to discover disease-associated pathways. We expect to see applications of our method to a broad spectrum of diseases for post-GWAS analysis. REGENT is freely available at https://github.com/wmmthu/REGENT.

摘要

全基因组关联研究(GWAS)在过去十年中成功发现了许多与疾病相关的遗传变异,为解析人类遗传性疾病的遗传基础提供了前所未有的机会。然而,由于遗传率缺失和可解释性弱等问题,从 GWAS 数据中提取生物学知识仍然是一项具有挑战性的任务。事实上,大多数已发现的基因座都位于没有明确与基因相关联的非编码区域,这阻碍了对其功能的描述,并需要一种复杂的方法来连接遗传和基因组研究。针对这个问题,基于网络的候选基因优先级排序方法,通过对基因网络和 GWAS 数据进行综合分析,已经成为一个很有前途的方向,并引起了广泛关注。然而,大多数现有的方法都忽略了基因网络的稀疏和嘈杂特性,因此可能导致次优的性能。受此启发,我们提出了一种新的方法,称为 REGENT,用于将多个基因网络与 GWAS 数据集成,以优先考虑复杂疾病的候选基因。我们利用一种称为网络表示学习的技术,将基因网络嵌入到一个紧凑而稳健的特征空间中,然后设计了一个层次统计模型,将多个基因网络的特征与 GWAS 数据集成,以便有效地推断与感兴趣疾病相关的基因。我们将我们的方法应用于六种复杂疾病,并证明了 REGENT 在恢复已知疾病相关基因方面优于现有方法的性能。我们进一步进行了途径分析,并显示了 REGENT 发现疾病相关途径的能力。我们希望看到我们的方法在广泛的疾病后 GWAS 分析中得到应用。REGENT 可在 https://github.com/wmmthu/REGENT 上免费获取。

相似文献

1
Leveraging multiple gene networks to prioritize GWAS candidate genes via network representation learning.利用多个基因网络通过网络表示学习对 GWAS 候选基因进行优先级排序。
Methods. 2018 Aug 1;145:41-50. doi: 10.1016/j.ymeth.2018.06.002. Epub 2018 Jun 3.
2
Simultaneous inference of phenotype-associated genes and relevant tissues from GWAS data via Bayesian integration of multiple tissue-specific gene networks.通过整合多个组织特异性基因网络,对 GWAS 数据进行贝叶斯分析,同时推断表型相关基因和相关组织。
J Mol Cell Biol. 2017 Dec 1;9(6):436-452. doi: 10.1093/jmcb/mjx059.
3
SigMod: an exact and efficient method to identify a strongly interconnected disease-associated module in a gene network.SigMod:一种在基因网络中识别强相互连接的疾病相关模块的精确且高效的方法。
Bioinformatics. 2017 May 15;33(10):1536-1544. doi: 10.1093/bioinformatics/btx004.
4
ancGWAS: a post genome-wide association study method for interaction, pathway and ancestry analysis in homogeneous and admixed populations.ancGWAS:一种用于同质和混合人群中相互作用、通路及祖先分析的全基因组关联研究后分析方法。
Bioinformatics. 2016 Feb 15;32(4):549-56. doi: 10.1093/bioinformatics/btv619. Epub 2015 Oct 27.
5
Epistasis network centrality analysis yields pathway replication across two GWAS cohorts for bipolar disorder.连锁不平衡网络中心性分析在两项双相情感障碍 GWAS 队列中产生了通路复制。
Transl Psychiatry. 2012 Aug 14;2(8):e154. doi: 10.1038/tp.2012.80.
6
A network approach to prioritizing susceptibility genes for genome-wide association studies.一种用于全基因组关联研究中优先考虑易感性基因的网络方法。
Genet Epidemiol. 2019 Jul;43(5):477-491. doi: 10.1002/gepi.22198. Epub 2019 Mar 11.
7
GenToS: Use of Orthologous Gene Information to Prioritize Signals from Human GWAS.GenToS:利用直系同源基因信息对人类全基因组关联研究中的信号进行优先级排序。
PLoS One. 2016 Sep 9;11(9):e0162466. doi: 10.1371/journal.pone.0162466. eCollection 2016.
8
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
9
Integrating regulatory features data for prediction of functional disease-associated SNPs.整合调控特征数据以预测功能疾病相关的 SNPs。
Brief Bioinform. 2019 Jan 18;20(1):26-32. doi: 10.1093/bib/bbx094.
10
Integration of Enhancer-Promoter Interactions with GWAS Summary Results Identifies Novel Schizophrenia-Associated Genes and Pathways.增强子-启动子相互作用与 GWAS 汇总结果的整合确定了新的与精神分裂症相关的基因和途径。
Genetics. 2018 Jul;209(3):699-709. doi: 10.1534/genetics.118.300805. Epub 2018 May 4.

引用本文的文献

1
Applications for Deep Learning in Epilepsy Genetic Research.深度学习在癫痫遗传学研究中的应用。
Int J Mol Sci. 2023 Sep 27;24(19):14645. doi: 10.3390/ijms241914645.
2
HiChIPdb: a comprehensive database of HiChIP regulatory interactions.HiChIPdb:一个综合的 HiChIP 调控相互作用数据库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D159-D166. doi: 10.1093/nar/gkac859.
3
IID 2021: towards context-specific protein interaction analyses by increased coverage, enhanced annotation and enrichment analysis.IID 2021:通过增加覆盖度、增强注释和富集分析实现针对具体上下文的蛋白质相互作用分析。
Nucleic Acids Res. 2022 Jan 7;50(D1):D640-D647. doi: 10.1093/nar/gkab1034.
4
The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.2021 年的 STRING 数据库:可定制的蛋白质-蛋白质网络,以及用户上传的基因/测量集的功能特征分析。
Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612. doi: 10.1093/nar/gkaa1074.
5
SilencerDB: a comprehensive database of silencers.SilencerDB:一个全面的沉默子数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D221-D228. doi: 10.1093/nar/gkaa839.
6
Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci.全基因组关联研究进入终局:用于复杂疾病基因座优先级排序的机器学习方法
Front Genet. 2020 Apr 15;11:350. doi: 10.3389/fgene.2020.00350. eCollection 2020.
7
Constructing tissue-specific transcriptional regulatory networks via a Markov random field.通过马尔可夫随机场构建组织特异性转录调控网络。
BMC Genomics. 2018 Dec 31;19(Suppl 10):884. doi: 10.1186/s12864-018-5277-6.