Alonso Nerea, Lucas Gavin, Hysi Pirro
Rheumatic Diseases Unit, MRC Institute of Genetics and Molecular Medicine, Centre for Genomic and Experimental Medicine, Western General Hospital, University of Edinburgh , Edinburgh, UK.
Clear Genetics SL , Barcelona, Spain.
Bonekey Rep. 2015 Feb 11;4:635. doi: 10.1038/bonekey.2015.2. eCollection 2015.
Genome-wide association studies (GWAS) have been developed as a practical method to identify genetic loci associated with disease by scanning multiple markers across the genome. Significant advances in the genetics of complex diseases have been made owing to advances in genotyping technologies, the progress of projects such as HapMap and 1000G and the emergence of genetics as a collaborative discipline. Because of its great potential to be used in parallel by multiple collaborators, it is important to adhere to strict protocols assuring data quality and analyses. Quality control analyses must be applied to each sample and each single-nucleotide polymorphism (SNP). The software package PLINK is capable of performing the whole range of necessary quality control tests. Genotype imputation has also been developed to substantially increase the power of GWAS methodology. Imputation permits the investigation of associations at genetic markers that are not directly genotyped. Results of individual GWAS reports can be combined through meta-analysis. Finally, next-generation sequencing (NGS) has gained popularity in recent years through its capacity to analyse a much greater number of markers across the genome. Although NGS platforms are capable of examining a higher number of SNPs compared with GWA studies, the results obtained by NGS require careful interpretation, as their biological correlation is incompletely understood. In this article, we will discuss the basic features of such protocols.
全基因组关联研究(GWAS)已发展成为一种实用方法,通过扫描基因组中的多个标记来识别与疾病相关的基因位点。由于基因分型技术的进步、国际人类基因组单体型图计划(HapMap)和千人基因组计划(1000G)等项目的进展以及遗传学作为一门合作学科的出现,复杂疾病遗传学取得了重大进展。由于其具有被多个合作者并行使用的巨大潜力,因此遵守确保数据质量和分析的严格方案非常重要。质量控制分析必须应用于每个样本和每个单核苷酸多态性(SNP)。软件包PLINK能够执行所有必要的质量控制测试。基因型填补技术也已得到发展,以大幅提高GWAS方法的效能。填补允许对未直接进行基因分型的遗传标记处的关联进行研究。各个GWAS报告的结果可以通过荟萃分析进行合并。最后,近年来,新一代测序(NGS)因其能够分析基因组中更多数量的标记而受到欢迎。尽管与全基因组关联研究相比,NGS平台能够检测更多的SNP,但由于其生物学相关性尚未完全了解,因此对NGS获得的结果需要仔细解读。在本文中,我们将讨论此类方案的基本特征。