Suppr超能文献

利用HapMap和千人基因组计划参考面板对拉丁裔进行基因型推算

Genotype Imputation for Latinos Using the HapMap and 1000 Genomes Project Reference Panels.

作者信息

Gao Xiaoyi, Haritunians Talin, Marjoram Paul, McKean-Cowdin Roberta, Torres Mina, Taylor Kent D, Rotter Jerome I, Gauderman William J, Varma Rohit

机构信息

Department of Ophthalmology, Keck School of Medicine, University of Southern California Los Angeles, CA, USA.

出版信息

Front Genet. 2012 Jun 27;3:117. doi: 10.3389/fgene.2012.00117. eCollection 2012.

Abstract

Genotype imputation is a vital tool in genome-wide association studies (GWAS) and meta-analyses of multiple GWAS results. Imputation enables researchers to increase genomic coverage and to pool data generated using different genotyping platforms. HapMap samples are often employed as the reference panel. More recently, the 1000 Genomes Project resource is becoming the primary source for reference panels. Multiple GWAS and meta-analyses are targeting Latinos, the most populous, and fastest growing minority group in the US. However, genotype imputation resources for Latinos are rather limited compared to individuals of European ancestry at present, largely because of the lack of good reference data. One choice of reference panel for Latinos is one derived from the population of Mexican individuals in Los Angeles contained in the HapMap Phase 3 project and the 1000 Genomes Project. However, a detailed evaluation of the quality of the imputed genotypes derived from the public reference panels has not yet been reported. Using simulation studies, the Illumina OmniExpress GWAS data from the Los Angles Latino Eye Study and the MACH software package, we evaluated the accuracy of genotype imputation in Latinos. Our results show that the 1000 Genomes Project AMR + CEU + YRI reference panel provides the highest imputation accuracy for Latinos, and that also including Asian samples in the panel can reduce imputation accuracy. We also provide the imputation accuracy for each autosomal chromosome using the 1000 Genomes Project panel for Latinos. Our results serve as a guide to future imputation based analysis in Latinos.

摘要

基因型填充是全基因组关联研究(GWAS)以及多个GWAS结果的荟萃分析中的一项重要工具。填充使研究人员能够扩大基因组覆盖范围,并整合使用不同基因分型平台生成的数据。HapMap样本常被用作参考面板。最近,千人基因组计划资源正成为参考面板的主要来源。多个GWAS和荟萃分析的目标是拉丁裔,他们是美国人口最多且增长最快的少数群体。然而,与欧洲血统的个体相比,目前拉丁裔的基因型填充资源相当有限,这主要是因为缺乏良好的参考数据。拉丁裔参考面板的一个选择是来自HapMap 3期项目和千人基因组计划中包含的洛杉矶墨西哥人群体。然而,尚未有关于从公共参考面板获得的填充基因型质量的详细评估报告。我们使用模拟研究、来自洛杉矶拉丁裔眼研究的Illumina OmniExpress GWAS数据以及MACH软件包,评估了拉丁裔基因型填充的准确性。我们的结果表明,千人基因组计划的AMR + CEU + YRI参考面板为拉丁裔提供了最高的填充准确性,并且在面板中纳入亚洲样本会降低填充准确性。我们还使用针对拉丁裔的千人基因组计划面板提供了每条常染色体的填充准确性。我们的结果为未来拉丁裔基于填充的分析提供了指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d08/3384355/a466dc8c2863/fgene-03-00117-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验