Dipartimento di Scienze Mediche, Università di Torino, Torino, Italy.
Environ Mol Mutagen. 2013 Aug;54(7):518-32. doi: 10.1002/em.21799. Epub 2013 Aug 6.
The allelic frequency spectrum emerging from several Next Generation Sequencing (NGS) projects is revealing important details about evolutionary and demographic forces that shaped the human genome. Herein, we discuss some of the achievements of the use of low-frequency and rare variants from NGS studies. The majority of variants that affect protein-coding regions are recent and rare. Often, the novel rare variants are enriched for deleterious alleles and are population-specific, making them suitable for the study of disease susceptibility. To investigate this kind of variation and its effects in association studies, very large sample sizes will be necessary to achieve sufficient statistical power. Moreover, as these variants are typically population-specific, the replication of disease associations across populations could be very difficult due to population stratification. Therefore, the design of experiments focusing on the identification of rare variants and their effects should be carefully planned. Although several successes have already been achieved through NGS for genetic epidemiology, pharmacogenetic and clinical purposes, with improvements of the sequencing technology and decreased costs, further advances are expected in the near future.
从几个下一代测序(NGS)项目中出现的等位基因频率谱揭示了关于塑造人类基因组的进化和人口统计学力量的重要细节。在这里,我们讨论了使用 NGS 研究中的低频和罕见变体的一些成就。影响蛋白质编码区域的大多数变体都是最近的和罕见的。通常,新的罕见变体富含有害等位基因,并且是特定于人群的,这使得它们适合研究疾病易感性。为了研究这种变异及其在关联研究中的影响,需要非常大的样本量才能达到足够的统计效力。此外,由于这些变体通常是特定于人群的,因此由于人群分层,跨人群复制疾病关联可能非常困难。因此,应仔细计划专注于识别罕见变体及其影响的实验设计。尽管通过 NGS 在遗传流行病学、药物遗传学和临床方面已经取得了一些成功,但随着测序技术的改进和成本的降低,预计在不久的将来还会有进一步的进展。