Department of Psychiatry, Washington University School of Medicine, St Louis, MO 63110, USA.
Genet Epidemiol. 2011;35 Suppl 1(0 1):S74-9. doi: 10.1002/gepi.20654.
Recent developments in sequencing technology have allowed the investigation of the common disease/rare variant hypothesis. In the Genetic Analysis Workshop 17 data set, we have sequence data on both unrelated individuals and eight large extended pedigrees with simulated quantitative and qualitative phenotypes. Group 11, whose focus was incorporating linkage information, considered several different ways to use the extended pedigrees to identify causal genes and variants. The first issue was the use of standard linkage or identity-by-descent information to identify regions containing causal rare variants. We found that rare variants of large effect segregating through pedigrees were precisely the bailiwick of linkage analysis. For a common disease, we anticipate many risk loci, so a heterogeneity linkage analysis or an analysis of a single pedigree at a time may be useful. The second issue was using pedigree data to identify individuals for sequencing. If one can identify linked regions and even carriers of risk haplotypes, the sequencing will be substantially more efficient. In fact, sequencing only 2.5% of the genome in carefully selected individuals can detect 52% of the risk variants that would be detected through whole-exome sequencing in a large number of unrelated individuals. Finally, we found that linkage information from pedigrees can provide weights for case-control association tests. We also found that pedigree-based association tests have the same issues of binning variants and variant counting as those in tests of unrelated individuals. Clearly, when pedigrees are available, they can provide great assistance in the search for rare variants that influence common disorders.
测序技术的最新进展使得对常见疾病/稀有变异假说的研究成为可能。在遗传分析研讨会 17 数据集,我们既有无关个体的序列数据,也有 8 个带有模拟定量和定性表型的大型扩展家系的序列数据。第 11 组的重点是整合连锁信息,他们考虑了几种不同的方法来利用大型家系识别因果基因和变异。第一个问题是使用标准连锁或同源性来识别包含因果稀有变异的区域。我们发现,通过家系分离的大效应稀有变异正是连锁分析的管辖范围。对于一种常见疾病,我们预计会有许多风险位点,因此进行异质性连锁分析或逐个分析单个家系可能会很有用。第二个问题是使用家系数据识别需要测序的个体。如果能够识别连锁区域甚至风险单倍型携带者,测序将大大提高效率。实际上,在精心挑选的个体中只对基因组的 2.5%进行测序,就可以检测到在大量无关个体中通过全外显子组测序检测到的 52%的风险变异。最后,我们发现家系的连锁信息可以为病例对照关联测试提供权重。我们还发现,基于家系的关联测试与对无关个体的测试一样,存在着对变异进行分组和计数的问题。显然,当家系可用时,它们可以极大地帮助寻找影响常见疾病的稀有变异。