Liu Peng-Yuan, Lu Yan, Deng Hong-Wen
Osteoporosis Research Center, Creighton University, Omaha, Nebraska 68131, USA.
Genetics. 2006 Sep;174(1):499-509. doi: 10.1534/genetics.105.054213. Epub 2006 Jun 18.
Sibships are commonly used in genetic dissection of complex diseases, particularly for late-onset diseases. Haplotype-based association studies have been advocated as powerful tools for fine mapping and positional cloning of complex disease genes. Existing methods for haplotype inference using data from relatives were originally developed for pedigree data. In this study, we proposed a new statistical method for haplotype inference for multiple tightly linked single-nucleotide polymorphisms (SNPs), which is tailored for extensively accumulated sibship data. This new method was implemented via an expectation-maximization (EM) algorithm without the usual assumption of linkage equilibrium among markers. Our EM algorithm does not incur extra computational burden for haplotype inference using sibship data when compared with using unrelated parental data. Furthermore, its computational efficiency is not affected by increasing sibship size. We examined the robustness and statistical performance of our new method in simulated data created from an empirical haplotype data set of human growth hormone gene 1. The utility of our method was illustrated with an application to the analyses of haplotypes of three candidate genes for osteoporosis.
同胞关系常用于复杂疾病的基因剖析,尤其是对于晚发性疾病。基于单倍型的关联研究已被倡导作为精细定位和克隆复杂疾病基因的有力工具。现有的利用亲属数据进行单倍型推断的方法最初是为系谱数据开发的。在本研究中,我们提出了一种新的统计方法,用于对多个紧密连锁的单核苷酸多态性(SNP)进行单倍型推断,该方法是针对广泛积累的同胞关系数据量身定制的。这种新方法通过期望最大化(EM)算法实现,无需通常关于标记间连锁平衡的假设。与使用无关亲代数据相比,我们的EM算法在使用同胞关系数据进行单倍型推断时不会带来额外的计算负担。此外,其计算效率不受同胞关系规模增加的影响。我们在由人类生长激素基因1的经验单倍型数据集创建的模拟数据中检验了我们新方法的稳健性和统计性能。通过将我们的方法应用于骨质疏松症三个候选基因的单倍型分析,说明了该方法的实用性。