• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在关联分析中利用基因组结构

Exploiting genome structure in association analysis.

作者信息

Kim Seyoung, Xing Eric P

机构信息

School of Computer Science, Carnegie Mellon University , Pittsburgh, Pennsylvania.

出版信息

J Comput Biol. 2014 Apr;21(4):345-60. doi: 10.1089/cmb.2009.0224. Epub 2011 May 6.

DOI:10.1089/cmb.2009.0224
PMID:21548809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3962648/
Abstract

A genome-wide association study involves examining a large number of single-nucleotide polymorphisms (SNPs) to identify SNPs that are significantly associated with the given phenotype, while trying to reduce the false positive rate. Although haplotype-based association methods have been proposed to accommodate correlation information across nearby SNPs that are in linkage disequilibrium, none of these methods directly incorporated the structural information such as recombination events along chromosome. In this paper, we propose a new approach called stochastic block lasso for association mapping that exploits prior knowledge on linkage disequilibrium structure in the genome such as recombination rates and distances between adjacent SNPs in order to increase the power of detecting true associations while reducing false positives. Following a typical linear regression framework with the genotypes as inputs and the phenotype as output, our proposed method employs a sparsity-enforcing Laplacian prior for the regression coefficients, augmented by a first-order Markov process along the sequence of SNPs that incorporates the prior information on the linkage disequilibrium structure. The Markov-chain prior models the structural dependencies between a pair of adjacent SNPs, and allows us to look for association SNPs in a coupled manner, combining strength from multiple nearby SNPs. Our results on HapMap-simulated datasets and mouse datasets show that there is a significant advantage in incorporating the prior knowledge on linkage disequilibrium structure for marker identification under whole-genome association.

摘要

全基因组关联研究涉及检查大量单核苷酸多态性(SNP),以识别与给定表型显著相关的SNP,同时试图降低假阳性率。尽管已经提出了基于单倍型的关联方法来处理处于连锁不平衡状态的相邻SNP之间的相关信息,但这些方法都没有直接纳入诸如沿染色体的重组事件等结构信息。在本文中,我们提出了一种称为随机块套索的新方法用于关联定位,该方法利用基因组中连锁不平衡结构的先验知识,如重组率和相邻SNP之间的距离,以提高检测真实关联的能力,同时减少假阳性。遵循以基因型为输入、表型为输出的典型线性回归框架,我们提出的方法对回归系数采用了一种强制稀疏的拉普拉斯先验,并通过沿SNP序列的一阶马尔可夫过程进行增强,该过程纳入了连锁不平衡结构的先验信息。马尔可夫链先验对一对相邻SNP之间的结构依赖性进行建模,并允许我们以耦合的方式寻找关联SNP,结合来自多个相邻SNP的强度。我们在HapMap模拟数据集和小鼠数据集上的结果表明,在全基因组关联下纳入连锁不平衡结构的先验知识用于标记识别具有显著优势。

相似文献

1
Exploiting genome structure in association analysis.在关联分析中利用基因组结构
J Comput Biol. 2014 Apr;21(4):345-60. doi: 10.1089/cmb.2009.0224. Epub 2011 May 6.
2
A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies.一种结合基于随机森林的技术和通过潜在变量进行连锁不平衡建模的方法,用于进行多基因座全基因组关联研究。
BMC Bioinformatics. 2018 Mar 27;19(1):106. doi: 10.1186/s12859-018-2054-0.
3
A hidden Markov random field model for genome-wide association studies.基于隐马尔可夫随机场模型的全基因组关联研究。
Biostatistics. 2010 Jan;11(1):139-50. doi: 10.1093/biostatistics/kxp043. Epub 2009 Oct 12.
4
Performance of a blockwise approach in variable selection using linkage disequilibrium information.使用连锁不平衡信息进行变量选择时的分块方法性能。
BMC Bioinformatics. 2015 May 8;16:148. doi: 10.1186/s12859-015-0556-6.
5
Exploiting Linkage Disequilibrium for Ultrahigh-Dimensional Genome-Wide Data with an Integrated Statistical Approach.利用连锁不平衡和综合统计方法处理超高维全基因组数据
Genetics. 2016 Feb;202(2):411-26. doi: 10.1534/genetics.115.179507. Epub 2015 Dec 12.
6
Fine mapping of disease genes using tagging SNPs.利用标签单核苷酸多态性对疾病基因进行精细定位。
Ann Hum Genet. 2007 Nov;71(Pt 6):815-27. doi: 10.1111/j.1469-1809.2007.00379.x. Epub 2007 Jun 22.
7
Incorporating single-locus tests into haplotype cladistic analysis in case-control studies.在病例对照研究中,将单基因座检验纳入单倍型分支分析。
PLoS Genet. 2007 Mar 23;3(3):e46. doi: 10.1371/journal.pgen.0030046.
8
Bayesian estimates of linkage disequilibrium.连锁不平衡的贝叶斯估计。
BMC Genet. 2007 Jun 25;8:36. doi: 10.1186/1471-2156-8-36.
9
Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.基于局部上位效应选择紧密连锁 SNPs 进行单倍型构建可提高关联作图的功效。
G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.
10
A Markov chain model for haplotype assembly from SNP fragments.一种用于从单核苷酸多态性(SNP)片段进行单倍型组装的马尔可夫链模型。
Genome Inform. 2006;17(2):162-71.

本文引用的文献

1
Genome-wide association analysis by lasso penalized logistic regression.基于套索惩罚逻辑回归的全基因组关联分析。
Bioinformatics. 2009 Mar 15;25(6):714-21. doi: 10.1093/bioinformatics/btp041. Epub 2009 Jan 28.
2
Bayesian LASSO for quantitative trait loci mapping.用于数量性状基因座定位的贝叶斯套索法
Genetics. 2008 Jun;179(2):1045-55. doi: 10.1534/genetics.107.085589. Epub 2008 May 27.
3
Detecting disease-causing genes by LASSO-Patternsearch algorithm.利用LASSO模式搜索算法检测致病基因。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S60. doi: 10.1186/1753-6561-1-s1-s60. Epub 2007 Dec 18.
4
Accommodating linkage disequilibrium in genetic-association analyses via ridge regression.通过岭回归在基因关联分析中纳入连锁不平衡。
Am J Hum Genet. 2008 Feb;82(2):375-85. doi: 10.1016/j.ajhg.2007.10.012.
5
Imputation-based analysis of association studies: candidate regions and quantitative traits.基于归因的关联研究分析:候选区域和数量性状
PLoS Genet. 2007 Jul;3(7):e114. doi: 10.1371/journal.pgen.0030114. Epub 2007 May 30.
6
Spectrum: joint Bayesian inference of population structure and recombination events.频谱:群体结构与重组事件的联合贝叶斯推断
Bioinformatics. 2007 Jul 1;23(13):i479-89. doi: 10.1093/bioinformatics/btm171.
7
Forty mouse strain survey of water and sodium intake.对四十种小鼠品系的水和钠摄入量进行的调查。
Physiol Behav. 2007 Aug 15;91(5):620-31. doi: 10.1016/j.physbeh.2007.03.025. Epub 2007 Apr 1.
8
Association mapping via regularized regression analysis of single-nucleotide-polymorphism haplotypes in variable-sized sliding windows.通过对可变大小滑动窗口中的单核苷酸多态性单倍型进行正则化回归分析进行关联作图。
Am J Hum Genet. 2007 Apr;80(4):705-15. doi: 10.1086/513205. Epub 2007 Feb 19.
9
Leveraging the HapMap correlation structure in association studies.在关联研究中利用HapMap关联结构。
Am J Hum Genet. 2007 Apr;80(4):683-91. doi: 10.1086/513109. Epub 2007 Mar 2.
10
Multilocus association mapping using variable-length Markov chains.使用可变长度马尔可夫链的多位点关联作图
Am J Hum Genet. 2006 Jun;78(6):903-13. doi: 10.1086/503876. Epub 2006 Apr 7.