Li Mingyao, Boehnke Michael, Abecasis Gonçalo R
Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania School of Medicine, Philadelphia; and; Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor.
Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor.
Am J Hum Genet. 2006 May;78(5):778-792. doi: 10.1086/503711. Epub 2006 Mar 20.
Linkage mapping of complex diseases is often followed by association studies between phenotypes and marker genotypes through use of case-control or family-based designs. Given fixed genotyping resources, it is important to know which study designs are the most efficient. To address this problem, we extended the likelihood-based method of Li et al., which assesses whether there is linkage disequilibrium between a disease locus and a SNP, to accommodate sibships of arbitrary size and disease-phenotype configuration. A key advantage of our method is the ability to combine data from different family structures. We consider scenarios for which genotypes are available for unrelated cases, affected sib pairs (ASPs), or only one sibling per ASP. We construct designs that use cases only and others that use unaffected siblings or unrelated unaffected individuals as controls. Different combinations of cases and controls result in seven study designs. We compare the efficiency of these designs when the number of individuals to be genotyped is fixed. Our results suggest that (1) when the disease is influenced by a single gene, the one sibling per ASP-control design is the most efficient, followed by the ASP-control design, and familial cases contribute more association information than singleton cases; (2) when the disease is influenced by multiple genes, familial cases provide more association information than singleton cases, unless the effect of the locus being tested is much smaller than at least one other untested disease locus; and (3) the case-control design can be useful for detecting genes with small effect in the presence of genes with much larger effect. Our findings will be helpful for researchers designing and analyzing complex disease-association studies and will facilitate genotyping resource allocation.
复杂疾病的连锁图谱绘制之后,通常会通过病例对照设计或基于家系的设计,对表型与标记基因型之间进行关联研究。在基因分型资源固定的情况下,了解哪种研究设计效率最高很重要。为了解决这个问题,我们扩展了Li等人基于似然性的方法,该方法评估疾病位点与单核苷酸多态性(SNP)之间是否存在连锁不平衡,以适应任意大小的同胞组和疾病表型配置。我们方法的一个关键优势是能够合并来自不同家系结构的数据。我们考虑了以下几种情况:基因型可用于无关病例、患病同胞对(ASP),或者每个ASP仅有一个同胞。我们构建了仅使用病例的设计,以及使用未患病同胞或无关未患病个体作为对照的其他设计。病例与对照的不同组合产生了七种研究设计。当待基因分型的个体数量固定时,我们比较了这些设计的效率。我们的结果表明:(1)当疾病受单个基因影响时,每个ASP一个同胞作为对照的设计效率最高, 其次是ASP作为对照的设计,并且家系病例比散发病例贡献更多的关联信息;(2)当疾病受多个基因影响时,家系病例比散发病例提供更多的关联信息,除非所检测位点的效应远小于至少一个其他未检测的疾病位点;(3)在存在效应大得多的基因的情况下,病例对照设计对于检测效应小的基因可能有用。我们的研究结果将有助于研究人员设计和分析复杂疾病关联研究,并有助于基因分型资源的分配。