Valdes A M, Thomson G
Department of Integrative Biology, University of California at Berkeley, 94720-3140, USA.
Am J Hum Genet. 1997 Mar;60(3):703-16.
For many HLA-associated diseases, multiple alleles-- and, in some cases, multiple loci--have been suggested as the causative agents. The haplotype method for identifying disease-predisposing amino acids in a genetic region is a stratification analysis. We show that, for each haplotype combination containing all the amino acid sites involved in the disease process, the relative frequencies of amino acid variants at sites not involved in disease but in linkage disequilibrium with the disease-predisposing sites are expected to be the same in patients and controls. The haplotype method is robust to mode of inheritance and penetrance of the disease and can be used to determine unequivocally whether all amino acid sites involved in the disease have not been identified. Using a resampling technique, we developed a statistical test that takes account of the nonindependence of the sites sampled. Further, when multiple sites in the genetic region are involved in disease, the test statistic gives a closer fit to the null expectation when some--compared with none--of the true predisposing factors are included in the haplotype analysis. Although the haplotype method cannot distinguish between very highly correlated sites in one population, ethnic comparisons may help identify the true predisposing factors. The haplotype method was applied to insulin-dependent diabetes mellitus (IDDM) HLA class II DQA1-DQB1 data from Caucasian, African, and Japanese populations. Our results indicate that the combination DQA1#52 (Arg predisposing) DQB1#57 (Asp protective), which has been proposed as an important IDDM agent, does not include all the predisposing elements. With rheumatoid arthritis HLA class II DRB1 data, the results were consistent with the shared-epitope hypothesis.
对于许多与HLA相关的疾病,已提出多个等位基因——在某些情况下,还有多个基因座——作为致病因素。在遗传区域中识别疾病易感性氨基酸的单倍型方法是一种分层分析。我们表明,对于包含疾病过程中所有氨基酸位点的每种单倍型组合,在未参与疾病但与疾病易感性位点处于连锁不平衡状态的位点上,氨基酸变体在患者和对照中的相对频率预计是相同的。单倍型方法对疾病的遗传模式和外显率具有稳健性,可用于明确确定是否尚未识别出所有参与疾病的氨基酸位点。使用重采样技术,我们开发了一种统计检验方法,该方法考虑了所采样位点的非独立性。此外,当遗传区域中的多个位点参与疾病时,与单倍型分析中不包含任何真实易感性因素相比,当包含一些真实易感性因素时,检验统计量更符合零假设预期。尽管单倍型方法无法区分一个人群中高度相关的位点,但种族比较可能有助于识别真正的易感性因素。单倍型方法应用于来自白种人、非洲人和日本人群的胰岛素依赖型糖尿病(IDDM)HLA II类DQA1 - DQB1数据。我们的结果表明,已被提议作为重要IDDM致病因素的DQA1#52(精氨酸易感性)DQB1#57(天冬氨酸保护性)组合并不包含所有的致病因素。对于类风湿性关节炎HLA II类DRB1数据,结果与共享表位假说一致。