• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用密集单核苷酸多态性(SNP)或序列数据计算炎症性肠病(IBD)概率。

Calculation of IBD probabilities with dense SNP or sequence data.

作者信息

Keith Jonathan M, McRae Allan, Duffy David, Mengersen Kerrie, Visscher Peter M

机构信息

School of Mathematical Sciences, Queensland University of Technology, Brisbane, Qld. 4001, Australia.

出版信息

Genet Epidemiol. 2008 Sep;32(6):513-9. doi: 10.1002/gepi.20324.

DOI:10.1002/gepi.20324
PMID:18357613
Abstract

The probabilities that two individuals share 0, 1, or 2 alleles identical by descent (IBD) at a given genotyped marker locus are quantities of fundamental importance for disease gene and quantitative trait mapping and in family-based tests of association. Until recently, genotyped markers were sufficiently sparse that founder haplotypes could be modelled as having been drawn from a population in linkage equilibrium for the purpose of estimating IBD probabilities. However, with the advent of high-throughput single nucleotide polymorphism genotyping assays, this is no longer a reasonable assumption. Indeed, the imminent arrival of individual sequencing will enable high-density single nucleotide polymorphism genotyping on a scale for which current algorithms are not equipped. In this paper, we present a simple new model in which founder haplotypes are modelled as a Markov chain. Another important innovation is that genotyping errors are explicitly incorporated into the model. We compare results obtained using the new model to those obtained using the popular genetic linkage analysis package Merlin, with and without using the cluster model of linkage disequilibrium that is incorporated into that program. We find that the new model results in accuracy approaching that of Merlin with haplotype blocks, but achieves this with orders of magnitude faster run times. Moreover, the new algorithm scales linearly with number of markers, irrespective of density, whereas Merlin scales supralinearly. We also confirm a previous finding that ignoring linkage disequilibrium in founder haplotypes can cause errors in the calculation of IBD probabilities.

摘要

在给定的基因分型标记位点上,两个个体通过血缘共享0、1或2个相同等位基因(IBD)的概率,对于疾病基因和数量性状定位以及基于家系的关联检验来说,是至关重要的量。直到最近,基因分型标记还足够稀疏,以至于为了估计IBD概率,可以将奠基者单倍型建模为从处于连锁平衡的群体中抽取。然而,随着高通量单核苷酸多态性基因分型检测技术的出现,这不再是一个合理的假设。事实上,个体测序即将到来,将能够进行高密度单核苷酸多态性基因分型,而目前的算法还无法应对这样的规模。在本文中,我们提出了一个简单的新模型,其中奠基者单倍型被建模为一个马尔可夫链。另一个重要的创新是,基因分型错误被明确纳入模型。我们将使用新模型得到的结果与使用流行的遗传连锁分析软件Merlin得到的结果进行比较,包括使用和不使用该程序中纳入的连锁不平衡聚类模型的情况。我们发现,新模型的准确性接近使用单倍型模块的Merlin,但运行时间要快几个数量级。此外,新算法与标记数量呈线性比例关系,与密度无关,而Merlin呈超线性比例关系。我们还证实了之前的一个发现,即忽略奠基者单倍型中的连锁不平衡会导致IBD概率计算中的错误。

相似文献

1
Calculation of IBD probabilities with dense SNP or sequence data.利用密集单核苷酸多态性(SNP)或序列数据计算炎症性肠病(IBD)概率。
Genet Epidemiol. 2008 Sep;32(6):513-9. doi: 10.1002/gepi.20324.
2
A Markov chain model for haplotype assembly from SNP fragments.一种用于从单核苷酸多态性(SNP)片段进行单倍型组装的马尔可夫链模型。
Genome Inform. 2006;17(2):162-71.
3
A general method for linkage disequilibrium correction for multipoint linkage and association.一种用于多点连锁与关联分析中连锁不平衡校正的通用方法。
Genet Epidemiol. 2008 Nov;32(7):647-57. doi: 10.1002/gepi.20339.
4
Linkage disequilibrium assessment via log-linear modeling of SNP haplotype frequencies.通过单核苷酸多态性(SNP)单倍型频率的对数线性模型进行连锁不平衡评估。
Genet Epidemiol. 2003 Sep;25(2):106-14. doi: 10.1002/gepi.10254.
5
Optimal haplotype structure for linkage disequilibrium-based fine mapping of quantitative trait loci using identity by descent.基于家系同一性的数量性状基因座连锁不平衡精细定位的最佳单倍型结构
Genetics. 2006 Mar;172(3):1955-65. doi: 10.1534/genetics.105.048686. Epub 2005 Dec 1.
6
Does strong linkage disequilibrium guarantee redundant association results?强连锁不平衡是否保证关联结果冗余?
Genet Epidemiol. 2008 Sep;32(6):546-52. doi: 10.1002/gepi.20328.
7
Susceptibility of biallelic haplotype and genotype frequencies to genotyping error.双等位基因单倍型和基因型频率对基因分型错误的敏感性。
Biometrics. 2006 Dec;62(4):1116-23. doi: 10.1111/j.1541-0420.2006.00563.x.
8
Fine mapping functional sites or regions from case-control data using haplotypes of multiple linked SNPs.利用多个连锁单核苷酸多态性(SNP)的单倍型,从病例对照数据中精细定位功能位点或区域。
Ann Hum Genet. 2005 Jan;69(Pt 1):102-12. doi: 10.1046/j.1529-8817.2004.00140.x.
9
Fine mapping of disease genes via haplotype clustering.通过单倍型聚类对疾病基因进行精细定位。
Genet Epidemiol. 2006 Feb;30(2):170-9. doi: 10.1002/gepi.20134.
10
Accounting for haplotype phase uncertainty in linkage disequilibrium estimation.在连锁不平衡估计中考虑单倍型相位不确定性。
Genet Epidemiol. 2008 Feb;32(2):168-78. doi: 10.1002/gepi.20273.

引用本文的文献

1
Robust and Powerful Affected Sibpair Test for Rare Variant Association.用于罕见变异关联分析的稳健且强大的患病同胞对检验
Genet Epidemiol. 2015 Jul;39(5):325-33. doi: 10.1002/gepi.21903. Epub 2015 May 13.
2
Identity by descent estimation with dense genome-wide genotype data.基于全基因组高密度基因型数据的亲缘关系估计。
Genet Epidemiol. 2011 Sep;35(6):557-67. doi: 10.1002/gepi.20606. Epub 2011 Jul 18.
3
Efficient identification of identical-by-descent status in pedigrees with many untyped individuals.高效鉴定具有大量未分型个体的家系中同系同源状态。
Bioinformatics. 2010 Jun 15;26(12):i191-8. doi: 10.1093/bioinformatics/btq222.