Svejgaard A, Ryder L P
Department of Clinical Immunology, National University Hospital (Rigshospitalet), Copenhagen, Denmark.
Tissue Antigens. 1994 Jan;43(1):18-27. doi: 10.1111/j.1399-0039.1994.tb02291.x.
A major aim of HLA and disease association studies is to identify the causative HLA factor truly responsible for the association. This is usually difficult due to the pronounced linkage disequilibrium between most HLA determinants. The causative factor must show the strongest association compared to all other factors. Here we describe a simple analysis which can be used to identify which of two factors, say A and B, shows the strongest association. The basic data for the analysis are the entries of the two-by-four table giving the four phenotypic combinations of A and B in patients and controls, respectively. These data are analyzed in various two-by-two tables involving stratification of each of the two factors against the other. A stronger increase of factor A is established if A is significantly associated with the condition both in B-positives and in B-negatives, when this is not true for B in A-positives and A-negatives. Using simulation with control data, it is demonstrated how linkage disequilibrium may influence secondary associations. The analysis may also be used to investigate interaction between HLA factors, but linkage disequilibrium complicates the interpretation in such cases. The method is exemplified using various published data. Finally, some statistical recommendations are given. Thus, we advise that phenotype (marker) frequencies are generally used instead of gene (i.e. allele, or haplotype) frequencies. The importance of correcting p-values, the levels of significance, and the power of Fisher's exact test are discussed.
HLA与疾病关联研究的一个主要目标是确定真正导致这种关联的HLA因素。由于大多数HLA决定簇之间存在明显的连锁不平衡,这通常很困难。与所有其他因素相比,致病因素必须显示出最强的关联性。在此,我们描述一种简单的分析方法,可用于确定两个因素(如A和B)中哪一个显示出最强的关联性。该分析的基础数据是二乘四表格中的数据项,分别给出了患者和对照中A和B的四种表型组合。这些数据在各种二乘二表格中进行分析,其中涉及将两个因素中的每一个与另一个进行分层。如果A在B阳性和B阴性个体中均与疾病显著相关,而B在A阳性和A阴性个体中并非如此,则可确定A有更强的增加趋势。通过对对照数据进行模拟,展示了连锁不平衡如何影响次级关联。该分析也可用于研究HLA因素之间的相互作用,但在这种情况下,连锁不平衡会使解释变得复杂。文中使用各种已发表的数据对该方法进行了举例说明。最后,给出了一些统计建议。因此,我们建议一般使用表型(标记)频率而非基因(即等位基因或单倍型)频率。讨论了校正p值、显著性水平以及Fisher精确检验功效的重要性。