利用同胞关系数据对多个连锁单核苷酸多态性进行准确的单倍型推断。

Accurate haplotype inference for multiple linked single-nucleotide polymorphisms using sibship data.

作者信息

Liu Peng-Yuan, Lu Yan, Deng Hong-Wen

机构信息

Osteoporosis Research Center, Creighton University, Omaha, Nebraska 68131, USA.

出版信息

Genetics. 2006 Sep;174(1):499-509. doi: 10.1534/genetics.105.054213. Epub 2006 Jun 18.

DOI:10.1534/genetics.105.054213

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1569787/

Abstract

Sibships are commonly used in genetic dissection of complex diseases, particularly for late-onset diseases. Haplotype-based association studies have been advocated as powerful tools for fine mapping and positional cloning of complex disease genes. Existing methods for haplotype inference using data from relatives were originally developed for pedigree data. In this study, we proposed a new statistical method for haplotype inference for multiple tightly linked single-nucleotide polymorphisms (SNPs), which is tailored for extensively accumulated sibship data. This new method was implemented via an expectation-maximization (EM) algorithm without the usual assumption of linkage equilibrium among markers. Our EM algorithm does not incur extra computational burden for haplotype inference using sibship data when compared with using unrelated parental data. Furthermore, its computational efficiency is not affected by increasing sibship size. We examined the robustness and statistical performance of our new method in simulated data created from an empirical haplotype data set of human growth hormone gene 1. The utility of our method was illustrated with an application to the analyses of haplotypes of three candidate genes for osteoporosis.

摘要

同胞关系常用于复杂疾病的基因剖析，尤其是对于晚发性疾病。基于单倍型的关联研究已被倡导作为精细定位和克隆复杂疾病基因的有力工具。现有的利用亲属数据进行单倍型推断的方法最初是为系谱数据开发的。在本研究中，我们提出了一种新的统计方法，用于对多个紧密连锁的单核苷酸多态性（SNP）进行单倍型推断，该方法是针对广泛积累的同胞关系数据量身定制的。这种新方法通过期望最大化（EM）算法实现，无需通常关于标记间连锁平衡的假设。与使用无关亲代数据相比，我们的EM算法在使用同胞关系数据进行单倍型推断时不会带来额外的计算负担。此外，其计算效率不受同胞关系规模增加的影响。我们在由人类生长激素基因1的经验单倍型数据集创建的模拟数据中检验了我们新方法的稳健性和统计性能。通过将我们的方法应用于骨质疏松症三个候选基因的单倍型分析，说明了该方法的实用性。

相似文献

1

Accurate haplotype inference for multiple linked single-nucleotide polymorphisms using sibship data.

Genetics. 2006 Sep;174(1):499-509. doi: 10.1534/genetics.105.054213. Epub 2006 Jun 18.

2

Sibship analysis of associations between SNP haplotypes and a continuous trait with application to mammographic density.

Genet Epidemiol. 2010 May;34(4):309-18. doi: 10.1002/gepi.20462.

3

Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms.

Biostatistics. 2003 Oct;4(4):513-22. doi: 10.1093/biostatistics/4.4.513.

4

A comparison of several methods for haplotype frequency estimation and haplotype reconstruction for tightly linked markers from general pedigrees.

Genet Epidemiol. 2006 Jul;30(5):423-37. doi: 10.1002/gepi.20154.

5

Comparison of haplotype inference methods using genotypic data from unrelated individuals.

Hum Hered. 2004;58(2):63-8. doi: 10.1159/000083026.

6

A survey of haplotype variants at several disease candidate genes: the importance of rare variants for complex diseases.

J Med Genet. 2005 Mar;42(3):221-7. doi: 10.1136/jmg.2004.024752.

7

Haplotype inference for population data with genotyping errors.

Biom J. 2009 Aug;51(4):644-58. doi: 10.1002/bimj.200800215.

8

Fine mapping functional sites or regions from case-control data using haplotypes of multiple linked SNPs.

Ann Hum Genet. 2005 Jan;69(Pt 1):102-12. doi: 10.1046/j.1529-8817.2004.00140.x.

9

HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.

Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1.

10

Maximum-likelihood estimation of haplotype frequencies in nuclear families.

Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323.

引用本文的文献

1

Strain-specific susceptibility for pulmonary metastasis of sarcoma 180 cells in inbred mice.

Cancer Res. 2010 Jun 15;70(12):4859-67. doi: 10.1158/0008-5472.CAN-09-4177. Epub 2010 May 25.

2

Haplotyping methods for pedigrees.

Hum Hered. 2009;67(4):248-66. doi: 10.1159/000194978. Epub 2009 Jan 27.

3

Detection of parent-of-origin effects in complete and incomplete nuclear families with multiple affected children using multiple tightly linked markers.

Hum Hered. 2009;67(2):116-27. doi: 10.1159/000179559. Epub 2008 Dec 12.

4

A new method for haplotype inference including full-sib information.

Genetics. 2007 Nov;177(3):1929-40. doi: 10.1534/genetics.107.079525. Epub 2007 Oct 18.

本文引用的文献

1

The use of pedigree, sib-pair and association studies of common diseases for genetic mapping and epidemiology.

Nat Genet. 2004 Oct;36(10):1045-51. doi: 10.1038/ng1433.

2

Conditional probability methods for haplotyping in pedigrees.

Genetics. 2004 Aug;167(4):2055-65. doi: 10.1534/genetics.103.021055.

3

Efficient inference of haplotypes from genotypes on a pedigree.

J Bioinform Comput Biol. 2003 Apr;1(1):41-69. doi: 10.1142/s0219720003000204.

4

Maximum-likelihood estimation of haplotype frequencies in nuclear families.

Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323.

5

Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes.

BMC Genet. 2004 May 24;5:11. doi: 10.1186/1471-2156-5-11.

6

Family-based tests for associating haplotypes with general phenotype data: application to asthma genetics.

Genet Epidemiol. 2004 Jan;26(1):61-9. doi: 10.1002/gepi.10295.

7

The International HapMap Project.

Nature. 2003 Dec 18;426(6968):789-96. doi: 10.1038/nature02168.

8

THE INTERACTION OF SELECTION AND LINKAGE. II. OPTIMUM MODELS.

Genetics. 1964 Oct;50(4):757-82. doi: 10.1093/genetics/50.4.757.

9

Transmission/disequilibrium test based on haplotype sharing for tightly linked markers.

Am J Hum Genet. 2003 Sep;73(3):566-79. doi: 10.1086/378205. Epub 2003 Aug 15.

10

Human growth hormone 1 (GH1) gene expression: complex haplotype-dependent influence of polymorphic variation in the proximal promoter and locus control region.

Hum Mutat. 2003 Apr;21(4):408-23. doi: 10.1002/humu.10167.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。