Becker Tim, Knapp Michael
Institute for Medical Biometry, Informatics and Epidemiology, University of Bonn, Bonn, Germany.
Am J Hum Genet. 2004 Oct;75(4):561-70. doi: 10.1086/424390. Epub 2004 Jul 30.
Haplotypes--that is, linear arrangements of alleles on the same chromosome that were inherited as a unit--are expected to carry important information in the context of association fine mapping of complex diseases. In consideration of a set of tightly linked markers, there is an enormous number of different marker combinations that can be analyzed. Therefore, a severe multiple-testing problem is introduced. One method to deal with this problem is Bonferroni correction by the number of combinations that are considered. Bonferroni correction is appropriate for independent tests but will result in a loss of power in the presence of linkage disequilibrium in the region. A second method is to perform simulations. It is unfortunate that most methods of haplotype analysis already require simulations to obtain an uncorrected P value for a specific marker combination. Thus, it seems that nested simulations are necessary to obtain P values that are corrected for multiple testing, which, apparently, limits the applicability of this approach because of computer running-time restrictions. Here, an algorithm is described that avoids such nested simulations. We check the validity of our approach under two disease models for haplotype analysis of family data. The true type I error rate of our algorithm corresponds to the nominal significance level. Furthermore, we observe a strong gain in power with our method to obtain the global P value, compared with the Bonferroni procedure to calculate the global P value. The method described here has been implemented in the latest update of our program FAMHAP.
单倍型,即作为一个单元遗传的同一染色体上等位基因的线性排列,有望在复杂疾病的关联精细定位中携带重要信息。考虑到一组紧密连锁的标记,存在大量可分析的不同标记组合。因此,引入了严重的多重检验问题。处理这个问题的一种方法是通过所考虑的组合数量进行Bonferroni校正。Bonferroni校正适用于独立检验,但在该区域存在连锁不平衡的情况下会导致检验效能的损失。第二种方法是进行模拟。不幸的是,大多数单倍型分析方法已经需要模拟来获得特定标记组合的未校正P值。因此,似乎需要进行嵌套模拟来获得针对多重检验校正的P值,显然,由于计算机运行时间的限制,这限制了该方法的适用性。在此,描述了一种避免此类嵌套模拟的算法。我们在两种疾病模型下检查了我们方法对家系数据进行单倍型分析的有效性。我们算法的实际I型错误率与名义显著性水平相对应。此外,与计算全局P值的Bonferroni程序相比,我们的方法在获得全局P值时观察到检验效能有显著提高。这里描述的方法已在我们的程序FAMHAP的最新版本中实现。