Center for Cardiovascular Genetics, Brown Foundation Institute of Molecular Medicine, The University of Texas Health Science Center and Texas Heart Institute, Houston, TX 77030, USA.
Transl Res. 2012 Feb;159(2):64-79. doi: 10.1016/j.trsl.2011.08.001. Epub 2011 Aug 31.
The approach to molecular genetic studies of complex phenotypes evolved considerably during the recent years. The candidate gene approach, which is restricted to an analysis of a few single-nucleotide polymorphisms (SNPs) in a modest number of cases and controls, has been supplanted by the unbiased approach of genome-wide association studies (GWAS), wherein a large number of tagger SNPs are typed in many individuals. GWAS, which are designed on the common disease-common variant hypothesis (CD-CV), identified several SNPs and loci for complex phenotypes. However, the alleles identified through GWAS are typically not causative but rather in linkage disequilibrium (LD) with the true causal variants. The common alleles, which may not capture the uncommon and rare variants, account only for a fraction of heritability of the complex traits. Hence, the focus is being shifted to rare variants-common disease (RV-CD) hypothesis, surmising that rare variants exert large effect sizes on the phenotype. In conjunctional with this conceptual shift, technologic advances in DNA sequencing techniques have dramatically enhanced whole genome or whole exome sequencing capacity. The sequencing approach affords identification of not only the rare but also the common variants. The approach-whether used in complementation with GWAS or as a stand-alone approach-could define the genetic architecture of the complex phenotypes. Robust phenotyping and large-scale sequencing studies are essential to extract the information content of the vast number of DNA sequence variants (DSVs) in the genome. To garner meaningful clinical information and link the genotype to a phenotype, the identification and characterization of a large number of causal fields beyond the information content of DNA sequence variants would be necessary. This review provides an update on the current progress and limitations in identifying DSVs that are associated with phenotypic effects.
近年来,复杂表型的分子遗传学研究方法有了很大的发展。候选基因方法仅限于在少数病例和对照中分析少数几个单核苷酸多态性(SNP),已经被全基因组关联研究(GWAS)的无偏方法所取代,其中大量的标记 SNP 在许多个体中进行分型。基于常见疾病-常见变异假说(CD-CV)设计的 GWAS 确定了许多与复杂表型相关的 SNP 和基因座。然而,通过 GWAS 识别的等位基因通常不是致病的,而是与真正的致病变异处于连锁不平衡(LD)状态。常见等位基因可能无法捕获罕见和稀有变异,仅占复杂性状遗传力的一小部分。因此,研究重点正在转移到罕见变异-常见疾病(RV-CD)假说,推测罕见变异对表型有很大的影响。随着这一概念的转变,DNA 测序技术的技术进步极大地提高了全基因组或全外显子测序的能力。该测序方法不仅可以识别罕见变异,还可以识别常见变异。该方法——无论是与 GWAS 结合使用还是作为独立方法使用——都可以确定复杂表型的遗传结构。稳健的表型分析和大规模测序研究对于从基因组中大量的 DNA 序列变异(DSV)中提取信息内容至关重要。为了提取大量的遗传信息并将基因型与表型联系起来,有必要识别和描述除 DNA 序列变异信息内容之外的大量因果领域。这篇综述提供了有关鉴定与表型效应相关的 DSV 的最新进展和局限性的信息。