Suppr超能文献

基于全基因组高密度基因型数据的亲缘关系估计。

Identity by descent estimation with dense genome-wide genotype data.

机构信息

Department of Human Genetics, University of Chicago, Illinois, USA.

出版信息

Genet Epidemiol. 2011 Sep;35(6):557-67. doi: 10.1002/gepi.20606. Epub 2011 Jul 18.

Abstract

We present a novel method, IBDLD, for estimating the probability of identity by descent (IBD) for a pair of related individuals at a locus, given dense genotype data and a pedigree of arbitrary size and complexity. IBDLD overcomes the challenges of exact multipoint estimation of IBD in pedigrees of potentially large size and eliminates the difficulty of accommodating the background linkage disequilibrium (LD) that is present in high-density genotype data. We show that IBDLD is much more accurate at estimating the true IBD sharing than methods that remove LD by pruning SNPs and is highly robust to pedigree errors or other forms of misspecified relationships. The method is fast and can be used to estimate the probability for each possible IBD sharing state at every SNP from a high-density genotyping array for hundreds of thousands of pairs of individuals. We use it to estimate point-wise and genomewide IBD sharing between 185,745 pairs of subjects all of whom are related through a single, large and complex 13-generation pedigree and genotyped with the Affymetrix 500 k chip. We find that we are able to identify the true pedigree relationship for individuals who were misidentified in the collected data and estimate empirical kinship coefficients that can be used in follow-up QTL mapping studies. IBDLD is implemented as an open source software package and is freely available.

摘要

我们提出了一种新的方法,即 IBDLD,用于估计一对相关个体在给定密集基因型数据和任意大小和复杂程度的家系中特定位置的同源(IBD)的概率。IBDLD 克服了在潜在大型家系中精确多点估计 IBD 的挑战,并消除了适应高密度基因型数据中存在的背景连锁不平衡(LD)的困难。我们表明,IBDLD 在估计真实 IBD 共享方面比通过修剪 SNP 去除 LD 的方法准确得多,并且对家谱错误或其他形式的指定关系错误具有高度的稳健性。该方法速度很快,可用于从高密度基因分型阵列中估计数十万对个体的每个 SNP 的每个可能 IBD 共享状态的概率。我们使用它来估计 185745 对个体之间的点和全基因组 IBD 共享,这些个体全部通过一个单一的、大型且复杂的 13 代家谱相关,并使用 Affymetrix 500k 芯片进行基因分型。我们发现,我们能够识别在收集的数据中被错误识别的个体的真实家谱关系,并估计可用于后续 QTL 映射研究的经验亲缘系数。IBDLD 作为开源软件包实现,可免费使用。

相似文献

1
Identity by descent estimation with dense genome-wide genotype data.
Genet Epidemiol. 2011 Sep;35(6):557-67. doi: 10.1002/gepi.20606. Epub 2011 Jul 18.
2
Using identity by descent estimation with dense genotype data to detect positive selection.
Eur J Hum Genet. 2013 Feb;21(2):205-11. doi: 10.1038/ejhg.2012.148. Epub 2012 Jul 11.
3
Multipoint quantitative-trait linkage analysis in general pedigrees.
Am J Hum Genet. 1998 May;62(5):1198-211. doi: 10.1086/301844.
4
Identity-by-descent estimation and mapping of qualitative traits in large, complex pedigrees.
Genetics. 2008 Jul;179(3):1577-90. doi: 10.1534/genetics.108.089912. Epub 2008 Jul 13.
5
Linkage disequilibrium across two different single-nucleotide polymorphism genome scans.
BMC Genet. 2005 Dec 30;6 Suppl 1(Suppl 1):S86. doi: 10.1186/1471-2156-6-S1-S86.
6
Ancestral haplotype reconstruction in endogamous populations using identity-by-descent.
PLoS Comput Biol. 2021 Feb 26;17(2):e1008638. doi: 10.1371/journal.pcbi.1008638. eCollection 2021 Feb.
7
Estimating the degree of identity by descent in consanguineous couples.
Hum Mutat. 2011 Dec;32(12):1350-8. doi: 10.1002/humu.21584. Epub 2011 Sep 23.
10
Efficient Estimation of Realized Kinship from Single Nucleotide Polymorphism Genotypes.
Genetics. 2017 Mar;205(3):1063-1078. doi: 10.1534/genetics.116.197004. Epub 2017 Jan 18.

引用本文的文献

1
Comprehensive genomic analysis of genetic diversity, body size, and origins of the Hetian Gray donkey.
BMC Genomics. 2025 Apr 30;26(1):428. doi: 10.1186/s12864-025-11595-w.
2
Genomic evidence for human-mediated introgressive hybridization and selection in the developed breed.
BMC Genomics. 2024 Apr 2;25(1):331. doi: 10.1186/s12864-024-10259-5.
3
Biobank-scale inference of multi-individual identity by descent and gene conversion.
Am J Hum Genet. 2024 Apr 4;111(4):691-700. doi: 10.1016/j.ajhg.2024.02.015. Epub 2024 Mar 20.
6
Biobank-scale inference of multi-individual identity by descent and gene conversion.
bioRxiv. 2023 Nov 5:2023.11.03.565574. doi: 10.1101/2023.11.03.565574.
8
Causal inference for the covariance between breeding values under identity disequilibrium.
Genet Sel Evol. 2022 Sep 23;54(1):64. doi: 10.1186/s12711-022-00750-6.
9
A genealogical estimate of genetic relationships.
Am J Hum Genet. 2022 May 5;109(5):812-824. doi: 10.1016/j.ajhg.2022.03.016. Epub 2022 Apr 12.

本文引用的文献

1
Maximum-likelihood estimation of recent shared ancestry (ERSA).
Genome Res. 2011 May;21(5):768-74. doi: 10.1101/gr.115972.110. Epub 2011 Feb 8.
2
A fast, powerful method for detecting identity by descent.
Am J Hum Genet. 2011 Feb 11;88(2):173-82. doi: 10.1016/j.ajhg.2011.01.010.
3
High-resolution detection of identity by descent in unrelated individuals.
Am J Hum Genet. 2010 Apr 9;86(4):526-39. doi: 10.1016/j.ajhg.2010.02.021. Epub 2010 Mar 18.
4
Variance component model to account for sample structure in genome-wide association studies.
Nat Genet. 2010 Apr;42(4):348-54. doi: 10.1038/ng.548. Epub 2010 Mar 7.
5
Linkage analysis with dense SNP maps in isolated populations.
Hum Hered. 2009;68(2):87-97. doi: 10.1159/000212501. Epub 2009 Apr 9.
6
A graphical algorithm for fast computation of identity coefficients and generalized kinship coefficients.
Bioinformatics. 2009 Jun 15;25(12):1561-3. doi: 10.1093/bioinformatics/btp185. Epub 2009 Apr 9.
7
Detection of sharing by descent, long-range phasing and haplotype imputation.
Nat Genet. 2008 Sep;40(9):1068-75. doi: 10.1038/ng.216.
9
Whole population, genome-wide mapping of hidden relatedness.
Genome Res. 2009 Feb;19(2):318-26. doi: 10.1101/gr.081398.108. Epub 2008 Oct 29.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验