• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过马尔可夫随机场模型将生物途径纳入全基因组关联研究中。

Incorporating biological pathways via a Markov random field model in genome-wide association studies.

机构信息

Division of Biostatistics, Department of Clinical Sciences, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America.

出版信息

PLoS Genet. 2011 Apr;7(4):e1001353. doi: 10.1371/journal.pgen.1001353. Epub 2011 Apr 7.

DOI:10.1371/journal.pgen.1001353
PMID:21490723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3072362/
Abstract

Genome-wide association studies (GWAS) examine a large number of markers across the genome to identify associations between genetic variants and disease. Most published studies examine only single markers, which may be less informative than considering multiple markers and multiple genes jointly because genes may interact with each other to affect disease risk. Much knowledge has been accumulated in the literature on biological pathways and interactions. It is conceivable that appropriate incorporation of such prior knowledge may improve the likelihood of making genuine discoveries. Although a number of methods have been developed recently to prioritize genes using prior biological knowledge, such as pathways, most methods treat genes in a specific pathway as an exchangeable set without considering the topological structure of a pathway. However, how genes are related with each other in a pathway may be very informative to identify association signals. To make use of the connectivity information among genes in a pathway in GWAS analysis, we propose a Markov Random Field (MRF) model to incorporate pathway topology for association analysis. We show that the conditional distribution of our MRF model takes on a simple logistic regression form, and we propose an iterated conditional modes algorithm as well as a decision theoretic approach for statistical inference of each gene's association with disease. Simulation studies show that our proposed framework is more effective to identify genes associated with disease than a single gene-based method. We also illustrate the usefulness of our approach through its applications to a real data example.

摘要

全基因组关联研究 (GWAS) 检查基因组中的大量标记,以确定遗传变异与疾病之间的关联。大多数已发表的研究仅检查单一标记,这可能不如同时考虑多个标记和多个基因更具信息量,因为基因可能相互作用以影响疾病风险。文献中积累了大量关于生物途径和相互作用的知识。可以想象,适当纳入这些先验知识可能会提高发现真实结果的可能性。尽管最近已经开发了许多使用先前的生物学知识(如途径)来优先考虑基因的方法,但大多数方法将特定途径中的基因视为可交换的集合,而不考虑途径的拓扑结构。然而,基因之间在途径中的相互关系可能对于识别关联信号非常有帮助。为了在 GWAS 分析中利用途径中基因之间的连接信息,我们提出了一种马尔可夫随机场 (MRF) 模型,以纳入途径拓扑结构进行关联分析。我们表明,我们的 MRF 模型的条件分布采用简单的逻辑回归形式,并且我们提出了一种迭代条件模式算法以及一种决策理论方法,用于统计推断每个基因与疾病的关联。模拟研究表明,与基于单个基因的方法相比,我们提出的框架更有效地识别与疾病相关的基因。我们还通过将其应用于真实数据示例来说明我们方法的有用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/079be3815c5b/pgen.1001353.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/b58867eeccfc/pgen.1001353.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/07b9f8a9482a/pgen.1001353.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/55a26f9a0e05/pgen.1001353.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/9e66c3b366e8/pgen.1001353.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/59ade846127e/pgen.1001353.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/074c61445a1b/pgen.1001353.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/079be3815c5b/pgen.1001353.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/b58867eeccfc/pgen.1001353.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/07b9f8a9482a/pgen.1001353.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/55a26f9a0e05/pgen.1001353.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/9e66c3b366e8/pgen.1001353.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/59ade846127e/pgen.1001353.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/074c61445a1b/pgen.1001353.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d4e/3072362/079be3815c5b/pgen.1001353.g007.jpg

相似文献

1
Incorporating biological pathways via a Markov random field model in genome-wide association studies.通过马尔可夫随机场模型将生物途径纳入全基因组关联研究中。
PLoS Genet. 2011 Apr;7(4):e1001353. doi: 10.1371/journal.pgen.1001353. Epub 2011 Apr 7.
2
A network-based kernel machine test for the identification of risk pathways in genome-wide association studies.一种基于网络的核机器测试,用于在全基因组关联研究中识别风险通路。
Hum Hered. 2013;76(2):64-75. doi: 10.1159/000357567. Epub 2014 Jan 14.
3
Guilt by rewiring: gene prioritization through network rewiring in genome wide association studies.通过重连进行归责:在全基因组关联研究中通过网络重连进行基因优先级排序。
Hum Mol Genet. 2014 May 15;23(10):2780-90. doi: 10.1093/hmg/ddt668. Epub 2013 Dec 30.
4
Pathway Analysis Incorporating Protein-Protein Interaction Networks Identified Candidate Pathways for the Seven Common Diseases.整合蛋白质-蛋白质相互作用网络的通路分析确定了七种常见疾病的候选通路。
PLoS One. 2016 Sep 13;11(9):e0162910. doi: 10.1371/journal.pone.0162910. eCollection 2016.
5
Pathway-based analysis for genome-wide association studies using supervised principal components.基于有监督主成分的全基因组关联研究的通路分析。
Genet Epidemiol. 2010 Nov;34(7):716-24. doi: 10.1002/gepi.20532.
6
Integrating genetic and gene expression evidence into genome-wide association analysis of gene sets.将遗传和基因表达证据整合到基因集的全基因组关联分析中。
Genome Res. 2012 Feb;22(2):386-97. doi: 10.1101/gr.124370.111. Epub 2011 Sep 22.
7
Integration of biological networks and pathways with genetic association studies.生物网络和通路的整合与遗传关联研究。
Hum Genet. 2012 Oct;131(10):1677-86. doi: 10.1007/s00439-012-1198-7. Epub 2012 Jul 10.
8
Integrating Biological Knowledge Into Case-Control Analysis Through Iterated Conditional Modes/Medians Algorithm.通过迭代条件模式/中位数算法将生物学知识整合到病例对照分析中。
J Comput Biol. 2020 Jul;27(7):1171-1179. doi: 10.1089/cmb.2019.0319. Epub 2019 Nov 7.
9
A Markov random field model for network-based analysis of genomic data.一种用于基于网络的基因组数据分析的马尔可夫随机场模型。
Bioinformatics. 2007 Jun 15;23(12):1537-44. doi: 10.1093/bioinformatics/btm129. Epub 2007 May 5.
10
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.

引用本文的文献

1
Weighted overlapping group lasso for integrating prior network knowledge into gene set analysis.用于将先验网络知识整合到基因集分析中的加权重叠组套索法。
BMC Bioinformatics. 2025 Sep 1;26(1):226. doi: 10.1186/s12859-025-06170-9.
2
Exploring drought-responsive crucial genes in .探索……中干旱响应关键基因
iScience. 2022 Oct 14;25(11):105347. doi: 10.1016/j.isci.2022.105347. eCollection 2022 Nov 18.
3
Network assisted analysis of de novo variants using protein-protein interaction information identified 46 candidate genes for congenital heart disease.

本文引用的文献

1
Identification of association between disease and multiple markers via sparse partial least-squares regression.通过稀疏偏最小二乘回归识别疾病与多个标志物之间的关联。
Genet Epidemiol. 2011 Sep;35(6):479-86. doi: 10.1002/gepi.20596. Epub 2011 Jun 15.
2
A hidden Markov random field model for genome-wide association studies.基于隐马尔可夫随机场模型的全基因组关联研究。
Biostatistics. 2010 Jan;11(1):139-50. doi: 10.1093/biostatistics/kxp043. Epub 2009 Oct 12.
3
Comparisons of multi-marker association methods to detect association between a candidate region and disease.
利用蛋白质-蛋白质相互作用信息进行从头变异的网络辅助分析,确定了 46 个先天性心脏病候选基因。
PLoS Genet. 2022 Jun 7;18(6):e1010252. doi: 10.1371/journal.pgen.1010252. eCollection 2022 Jun.
4
A Markov random field model for network-based differential expression analysis of single-cell RNA-seq data.基于马尔可夫随机场模型的单细胞 RNA-seq 数据的网络差异表达分析。
BMC Bioinformatics. 2021 Oct 26;22(1):524. doi: 10.1186/s12859-021-04412-0.
5
Statistical Identification of Important Nodes in Biological Systems.生物系统中重要节点的统计识别
J Syst Sci Complex. 2021;34(4):1454-1470. doi: 10.1007/s11424-020-0013-0. Epub 2021 Aug 10.
6
Cell Heterogeneity Analysis in Single-Cell RNA-seq Data Using Mixture Exponential Graph and Markov Random Field Model.基于混合指数图和马尔可夫随机场模型的单细胞 RNA-seq 数据中的细胞异质性分析。
Biomed Res Int. 2021 May 22;2021:9919080. doi: 10.1155/2021/9919080. eCollection 2021.
7
A Review of Statistical Methods for Identifying Trait-Relevant Tissues and Cell Types.识别与性状相关的组织和细胞类型的统计方法综述
Front Genet. 2021 Jan 22;11:587887. doi: 10.3389/fgene.2020.587887. eCollection 2020.
8
Statistical Identification of Important Nodes in Biological Systems.生物系统中重要节点的统计识别
J Syst Sci Complex. 2021 Jan 12:1-17. doi: 10.1007/s11424-021-0001-2.
9
Protein-Protein interactions uncover candidate 'core genes' within omnigenic disease networks.蛋白质-蛋白质相互作用揭示了全基因组疾病网络中的候选“核心基因”。
PLoS Genet. 2020 Jul 17;16(7):e1008903. doi: 10.1371/journal.pgen.1008903. eCollection 2020 Jul.
10
Leveraging gene co-expression patterns to infer trait-relevant tissues in genome-wide association studies.利用基因共表达模式推断全基因组关联研究中与性状相关的组织。
PLoS Genet. 2020 Apr 20;16(4):e1008734. doi: 10.1371/journal.pgen.1008734. eCollection 2020 Apr.
比较多种标记物关联方法,以检测候选区域与疾病之间的关联。
Genet Epidemiol. 2010 Apr;34(3):201-12. doi: 10.1002/gepi.20448.
4
Gene and pathway-based second-wave analysis of genome-wide association studies.基于基因和通路的全基因组关联研究的二次分析。
Eur J Hum Genet. 2010 Jan;18(1):111-7. doi: 10.1038/ejhg.2009.115.
5
CANDID: a flexible method for prioritizing candidate genes for complex human traits.CANDID:一种用于对复杂人类性状的候选基因进行优先级排序的灵活方法。
Genet Epidemiol. 2008 Dec;32(8):779-90. doi: 10.1002/gepi.20346.
6
Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease.全基因组关联研究确定了30多个克罗恩病的不同易感基因座。
Nat Genet. 2008 Aug;40(8):955-62. doi: 10.1038/ng.175. Epub 2008 Jun 29.
7
Systematic biological prioritization after a genome-wide association study: an application to nicotine dependence.全基因组关联研究后的系统生物学优先级排序:在尼古丁依赖中的应用
Bioinformatics. 2008 Aug 15;24(16):1805-11. doi: 10.1093/bioinformatics/btn315. Epub 2008 Jun 19.
8
Network-based global inference of human disease genes.基于网络的人类疾病基因全局推断
Mol Syst Biol. 2008;4:189. doi: 10.1038/msb.2008.27. Epub 2008 May 6.
9
Walking the interactome for prioritization of candidate disease genes.遍历相互作用组以对候选疾病基因进行优先级排序。
Am J Hum Genet. 2008 Apr;82(4):949-58. doi: 10.1016/j.ajhg.2008.02.013. Epub 2008 Mar 27.
10
Incorporating gene networks into statistical tests for genomic data via a spatially correlated mixture model.通过空间相关混合模型将基因网络纳入基因组数据的统计测试。
Bioinformatics. 2008 Feb 1;24(3):404-11. doi: 10.1093/bioinformatics/btm612. Epub 2007 Dec 14.