Shibata Kyoko, Ito Toshikazu, Kitamura Yutaka, Iwasaki Naoko, Tanaka Hiroshi, Kamatani Naoyuki
Department of Bioinformatics, Graduate School of Tokyo Medical and Dental University, 113-8510, Japan.
Genetics. 2004 Sep;168(1):525-39. doi: 10.1534/genetics.104.029751.
The analysis of the haplotype-phenotype relationship has become more and more important. We have developed an algorithm, using individual genotypes at linked loci as well as their quantitative phenotypes, to estimate the parameters of the distribution of the phenotypes for subjects with and without a particular haplotype by an expectation-maximization (EM) algorithm. We assumed that the phenotype for a diplotype configuration follows a normal distribution. The algorithm simultaneously calculates the maximum likelihood (L0max) under the null hypothesis (i.e., nonassociation between the haplotype and phenotype), and the maximum likelihood (Lmax) under the alternative hypothesis (i.e., association between the haplotype and phenotype). Then we tested the association between the haplotype and the phenotype using a test statistic, -2 log(L0max/Lmax). The above algorithm along with some extensions for different modes of inheritance was implemented as a computer program, QTLHAPLO. Simulation studies using single-nucleotide polymorphism (SNP) genotypes have clarified that the estimation was very accurate when the linkage disequilibrium between linked loci was rather high. Empirical power using the simulated data was high enough. We applied QTLHAPLO for the analysis of the real data of the genotypes at the calpain 10 gene obtained from diabetic and control subjects in various laboratories.
单倍型与表型关系的分析变得越来越重要。我们开发了一种算法,利用连锁位点的个体基因型及其定量表型,通过期望最大化(EM)算法估计具有和不具有特定单倍型的受试者的表型分布参数。我们假设双倍型配置的表型呈正态分布。该算法同时计算零假设(即单倍型与表型之间无关联)下的最大似然值(L0max)和备择假设(即单倍型与表型之间有关联)下的最大似然值(Lmax)。然后我们使用检验统计量-2 log(L0max/Lmax)检验单倍型与表型之间的关联。上述算法以及针对不同遗传模式的一些扩展被实现为一个计算机程序QTLHAPLO。使用单核苷酸多态性(SNP)基因型的模拟研究表明,当连锁位点之间的连锁不平衡相当高时,估计非常准确。使用模拟数据的实证效能足够高。我们将QTLHAPLO应用于分析从各个实验室的糖尿病患者和对照受试者获得的钙蛋白酶10基因的基因型真实数据。