使用元填充分析韩国参考基因组可提高韩国人群中罕见变异的填充准确性和范围。

Analyzing the Korean reference genome with meta-imputation increased the imputation accuracy and spectrum of rare variants in the Korean population.

作者信息

Hwang Mi Yeong, Choi Nak-Hyeon, Won Hong Hee, Kim Bong-Jo, Kim Young Jin

机构信息

Division of Genome Science, Department of Precision Medicine, National Institute of Health, Cheongju-si, South Korea.

Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology (SAIHST), Samsung Medical Center, Sungkyunkwan University, Seoul, South Korea.

出版信息

Front Genet. 2022 Nov 24;13:1008646. doi: 10.3389/fgene.2022.1008646. eCollection 2022.

DOI:10.3389/fgene.2022.1008646

PMID:36506321

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9731225/

Abstract

Genotype imputation is essential for enhancing the power of association-mapping and discovering rare and indels that are missed by most genotyping arrays. Imputation analysis can be more accurate with a population-specific reference panel or a multi-ethnic reference panel with numerous samples. The National Institute of Health, Republic of Korea, initiated the Korean Reference Genome (KRG) project to identify variants in whole-genome sequences of ∼20,000 Korean participants. In the pilot phase, we analyzed the data from 1,490 participants. The genetic characteristics and imputation performance of the KRG were compared with those of the 1,000 Genomes Project Phase 3, GenomeAsia 100K Project, ChinaMAP, NARD, and TOPMed reference panels. For comparison analysis, genotype panels were artificially generated using whole-genome sequencing data from combinations of four different ancestries (Korean, Japanese, Chinese, and European) and two population-specific optimized microarrays (Korea Biobank Array and UK Biobank Array). The KRG reference panel performed best for the Korean population ( = 0.78-0.84, percentage of well-imputed is 91.9% for allele frequency >5%), although the other reference panels comprised a larger number of samples with genetically different background. By comparing multiple reference panels and multi-ethnic genotype panels, optimal imputation was obtained using reference panels from genetically related populations and a population-optimized microarray. Indeed, the reference panels of KRG and TOPMed showed the best performance when applied to the genotype panels of KBA ( = 0.84) and UKB ( = 0.87), respectively. Using a meta-imputation approach to merge imputation results from different reference panels increased the imputation accuracy for rare variants (∼7%) and provided additional well-imputed variants (∼20%) with comparable imputation accuracy to that of the KRG. Our results demonstrate the importance of using a population-specific reference panel and meta-imputation to assess a substantial number of accurately imputed rare variants.

摘要

基因型填充对于增强关联映射的能力以及发现大多数基因分型阵列遗漏的罕见变异和插入缺失至关重要。使用特定人群的参考面板或包含大量样本的多民族参考面板进行填充分析可以更准确。大韩民国国立卫生研究院启动了韩国参考基因组（KRG）项目，以识别约20000名韩国参与者全基因组序列中的变异。在试点阶段，我们分析了1490名参与者的数据。将KRG的遗传特征和填充性能与千人基因组计划第三阶段、亚洲基因组100K计划、中国MAP、NARD和TOPMed参考面板进行了比较。为了进行比较分析，使用来自四种不同祖先（韩国、日本、中国和欧洲）组合的全基因组测序数据以及两种特定人群优化的微阵列（韩国生物样本库阵列和英国生物样本库阵列）人工生成基因型面板。KRG参考面板在韩国人群中表现最佳（r = 0.78 - 0.84，等位基因频率>5%时填充良好的百分比为91.9%），尽管其他参考面板包含更多具有遗传背景差异的样本。通过比较多个参考面板和多民族基因型面板，使用来自遗传相关人群的参考面板和人群优化的微阵列可获得最佳填充效果。事实上，当分别应用于KBA（r = 0.84）和UKB（r = 0.87）的基因型面板时，KRG和TOPMed的参考面板表现最佳。使用元填充方法合并来自不同参考面板的填充结果提高了罕见变异的填充准确性（约7%），并提供了额外的填充良好的变异（约20%），其填充准确性与KRG相当。我们的结果证明了使用特定人群的参考面板和元填充来评估大量准确填充的罕见变异的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9d7/9731225/31cffa1c55db/fgene-13-1008646-g001.jpg

相似文献

Analyzing the Korean reference genome with meta-imputation increased the imputation accuracy and spectrum of rare variants in the Korean population.

Front Genet. 2022 Nov 24;13:1008646. doi: 10.3389/fgene.2022.1008646. eCollection 2022.

Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.

PLoS Genet. 2019 Dec 23;15(12):e1008500. doi: 10.1371/journal.pgen.1008500. eCollection 2019 Dec.

Improving imputation quality in Samoans through the integration of population-specific sequences into existing reference panels.

medRxiv. 2023 Oct 31:2023.10.31.23297835. doi: 10.1101/2023.10.31.23297835.

Rare variant genotype imputation with thousands of study-specific whole-genome sequences: implications for cost-effective study designs.

Eur J Hum Genet. 2015 Jul;23(7):975-83. doi: 10.1038/ejhg.2014.216. Epub 2014 Oct 8.

A diverse ancestrally-matched reference panel increases genotype imputation accuracy in a underrepresented population.

Sci Rep. 2023 Jul 31;13(1):12360. doi: 10.1038/s41598-023-39429-3.

Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing.

Am J Hum Genet. 2022 Sep 1;109(9):1653-1666. doi: 10.1016/j.ajhg.2022.07.012. Epub 2022 Aug 17.

Comprehensive evaluation of imputation performance in African Americans.

J Hum Genet. 2012 Jul;57(7):411-21. doi: 10.1038/jhg.2012.43. Epub 2012 May 31.

NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants.

Genome Med. 2019 Oct 22;11(1):64. doi: 10.1186/s13073-019-0677-z.

Performance of genotype imputation for low frequency and rare variants from the 1000 genomes.

PLoS One. 2015 Jan 26;10(1):e0116487. doi: 10.1371/journal.pone.0116487. eCollection 2015.

Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels.

Genet Epidemiol. 2017 Dec;41(8):744-755. doi: 10.1002/gepi.22067. Epub 2017 Sep 1.

引用本文的文献

Toward a Kinh Vietnamese Reference Genome: Constructing a De Novo Genome Assembly Using Long-Read Sequencing and Optical Mapping.

Genes (Basel). 2025 Apr 29;16(5):536. doi: 10.3390/genes16050536.

Lessons from national biobank projects utilizing whole-genome sequencing for population-scale genomics.

Genomics Inform. 2025 Mar 6;23(1):8. doi: 10.1186/s44342-025-00040-9.

Effects of Genetic Risk and Lifestyle Habits on Gout: A Korean Cohort Study.

J Korean Med Sci. 2025 Jan 13;40(2):e1. doi: 10.3346/jkms.2025.40.e1.

Population-specific reference panel improves imputation quality for genome-wide association studies conducted on the Japanese population.

Commun Biol. 2024 Dec 19;7(1):1665. doi: 10.1038/s42003-024-07338-4.

Rare disease genomics and precision medicine.

Genomics Inform. 2024 Dec 3;22(1):28. doi: 10.1186/s44342-024-00032-1.

本文引用的文献

The sequences of 150,119 genomes in the UK Biobank.

Nature. 2022 Jul;607(7920):732-740. doi: 10.1038/s41586-022-04965-x. Epub 2022 Jul 20.

Meta-imputation: An efficient method to combine genotype data after imputation with multiple reference panels.

Am J Hum Genet. 2022 Jun 2;109(6):1007-1015. doi: 10.1016/j.ajhg.2022.04.002. Epub 2022 May 3.

The ChinaMAP reference panel for the accurate genotype imputation in Chinese populations.

Cell Res. 2021 Dec;31(12):1308-1310. doi: 10.1038/s41422-021-00564-z. Epub 2021 Sep 6.

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.

Korean Genome Project: 1094 Korean personal genomes with clinical information.

Sci Adv. 2020 May 27;6(22):eaaz7835. doi: 10.1126/sciadv.aaz7835. eCollection 2020 May.

The mutational constraint spectrum quantified from variation in 141,456 humans.

Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.

The ChinaMAP analytics of deep whole genome sequences in 10,588 individuals.

Cell Res. 2020 Sep;30(9):717-731. doi: 10.1038/s41422-020-0322-9. Epub 2020 Apr 30.

The GenomeAsia 100K Project enables genetic discoveries across Asia.

Nature. 2019 Dec;576(7785):106-111. doi: 10.1038/s41586-019-1793-z. Epub 2019 Dec 4.

Accurate, scalable and integrative haplotype estimation.

Nat Commun. 2019 Nov 28;10(1):5436. doi: 10.1038/s41467-019-13225-y.

NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants.

Genome Med. 2019 Oct 22;11(1):64. doi: 10.1186/s13073-019-0677-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用元填充分析韩国参考基因组可提高韩国人群中罕见变异的填充准确性和范围。

Analyzing the Korean reference genome with meta-imputation increased the imputation accuracy and spectrum of rare variants in the Korean population.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献