• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

采用两步插补法提高罕见变异插补的准确性。

Improving accuracy of rare variant imputation with a two-step imputation approach.

作者信息

Kreiner-Møller Eskil, Medina-Gomez Carolina, Uitterlinden André G, Rivadeneira Fernando, Estrada Karol

机构信息

1] Department of Internal Medicine, Erasmus University Medical Center, Genetic Laboratory of Internal Medicin, Rotterdam, The Netherlands [2] COPSAC; Copenhagen Prospective Studies on Asthma in Childhood; Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark [3] The Danish Pediatric Asthma Center; Copenhagen University Hospital, Ledreborg Alle 34, Gentofte, Denmark.

Department of Internal Medicine, Erasmus University Medical Center, Genetic Laboratory of Internal Medicin, Rotterdam, The Netherlands.

出版信息

Eur J Hum Genet. 2015 Mar;23(3):395-400. doi: 10.1038/ejhg.2014.91. Epub 2014 Jun 18.

DOI:10.1038/ejhg.2014.91
PMID:24939589
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4326719/
Abstract

Genotype imputation has been the pillar of the success of genome-wide association studies (GWAS) for identifying common variants associated with common diseases. However, most GWAS have been run using only 60 HapMap samples as reference for imputation, meaning less frequent and rare variants not being comprehensively scrutinized. Next-generation arrays ensuring sufficient coverage together with new reference panels, as the 1000 Genomes panel, are emerging to facilitate imputation of low frequent single-nucleotide polymorphisms (minor allele frequency (MAF) <5%). In this study, we present a two-step imputation approach improving the quality of the 1000 Genomes imputation by genotyping only a subset of samples to create a local reference population on a dense array with many low-frequency markers. In this approach, the study sample, genotyped with a first generation array, is imputed first to the local reference sample genotyped on a dense array and hereafter to the 1000 Genomes reference panel. We show that mean imputation quality, measured by the r(2) using this approach, increases by 28% for variants with a MAF between 1 and 5% as compared with direct imputation to 1000 Genomes reference. Similarly, the concordance rate between calls of imputed and true genotypes was found to be significantly higher for heterozygotes (P<1e-15) and rare homozygote calls (P<1e-15) in this low frequency range. The two-step approach in our setting improves imputation quality compared with traditional direct imputation noteworthy in the low-frequency spectrum and is a cost-effective strategy in large epidemiological studies.

摘要

基因型填充一直是全基因组关联研究(GWAS)成功识别与常见疾病相关的常见变异的支柱。然而,大多数GWAS仅使用60个HapMap样本作为填充参考,这意味着低频和罕见变异没有得到全面审查。随着新一代阵列确保足够的覆盖范围以及新的参考面板(如千人基因组面板)的出现,有助于对低频单核苷酸多态性(次要等位基因频率(MAF)<5%)进行填充。在本研究中,我们提出了一种两步填充方法,通过仅对一部分样本进行基因分型,在具有许多低频标记的密集阵列上创建本地参考群体,从而提高千人基因组填充的质量。在这种方法中,先用第一代阵列进行基因分型的研究样本,首先被填充到在密集阵列上进行基因分型的本地参考样本中,然后再填充到千人基因组参考面板中。我们表明,与直接填充到千人基因组参考相比,使用这种方法,对于MAF在1%至5%之间的变异,用r(2)衡量的平均填充质量提高了28%。同样,在这个低频范围内,对于杂合子(P<1e-15)和罕见纯合子调用(P<1e-15),填充基因型与真实基因型之间的一致性率也显著更高。在我们的设置中,与传统直接填充相比,两步法在低频谱中提高了填充质量,是大型流行病学研究中的一种具有成本效益的策略。

相似文献

1
Improving accuracy of rare variant imputation with a two-step imputation approach.采用两步插补法提高罕见变异插补的准确性。
Eur J Hum Genet. 2015 Mar;23(3):395-400. doi: 10.1038/ejhg.2014.91. Epub 2014 Jun 18.
2
Performance of genotype imputation for low frequency and rare variants from the 1000 genomes.基于千人基因组计划的低频和罕见变异基因型填充性能
PLoS One. 2015 Jan 26;10(1):e0116487. doi: 10.1371/journal.pone.0116487. eCollection 2015.
3
Effect of genome-wide genotyping and reference panels on rare variants imputation.全基因组基因分型和参考面板对稀有变异体推断的影响。
J Genet Genomics. 2012 Oct 20;39(10):545-50. doi: 10.1016/j.jgg.2012.07.002. Epub 2012 Jul 24.
4
Comprehensive evaluation of imputation performance in African Americans.对非裔美国人插补性能的综合评估。
J Hum Genet. 2012 Jul;57(7):411-21. doi: 10.1038/jhg.2012.43. Epub 2012 May 31.
5
Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs.评价 IMPUTE 程序在使用多种模型设计的墨西哥城混合样本中的插补性能。
BMC Med Genomics. 2012 May 1;5:12. doi: 10.1186/1755-8794-5-12.
6
Assessment of genotype imputation performance using 1000 Genomes in African American studies.使用 1000 基因组计划在非裔美国人研究中评估基因型推断性能。
PLoS One. 2012;7(11):e50610. doi: 10.1371/journal.pone.0050610. Epub 2012 Nov 30.
7
Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels.利用来自分布式参考面板的多组推算基因型提高关联检验效能。
Genet Epidemiol. 2017 Dec;41(8):744-755. doi: 10.1002/gepi.22067. Epub 2017 Sep 1.
8
Evaluation of the accuracy of imputed sequence variant genotypes and their utility for causal variant detection in cattle.评估插补序列变异基因型的准确性及其在牛因果变异检测中的效用。
Genet Sel Evol. 2017 Feb 21;49(1):24. doi: 10.1186/s12711-017-0301-x.
9
Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel.使用基于全基因组测序(WGS)的特定人群高覆盖度插补参考面板提高罕见和低频变异的插补准确性。
Eur J Hum Genet. 2017 Jun;25(7):869-876. doi: 10.1038/ejhg.2017.51. Epub 2017 Apr 12.
10
Genotype imputation of Metabochip SNPs using a study-specific reference panel of ~4,000 haplotypes in African Americans from the Women's Health Initiative.使用来自妇女健康倡议的约 4000 个非洲裔美国人的研究特定参考面板对 Metabochip SNPs 进行基因型推断。
Genet Epidemiol. 2012 Feb;36(2):107-17. doi: 10.1002/gepi.21603.

引用本文的文献

1
Old vs. New Local Ancestry Inference in HCHS/SOL: A Comparative Study.西班牙裔社区健康研究/拉丁裔研究中旧版与新版本地血统推断的比较研究
bioRxiv. 2025 Feb 8:2025.02.04.636481. doi: 10.1101/2025.02.04.636481.
2
A cautionary tale of low-pass sequencing and imputation with respect to haplotype accuracy.低通测序和单倍型准确性的插补问题的一个警示性案例。
Genet Sel Evol. 2024 Jan 12;56(1):6. doi: 10.1186/s12711-024-00875-w.
3
Polygenic risk score in comparison with C-reactive protein for predicting incident coronary heart disease.多基因风险评分与 C 反应蛋白在预测冠心病事件中的比较。
Atherosclerosis. 2023 Aug;379:117194. doi: 10.1016/j.atherosclerosis.2023.117194. Epub 2023 Jul 26.
4
Germline modifiers of the tumor immune microenvironment implicate drivers of cancer risk and immunotherapy response.肿瘤免疫微环境的种系修饰因子提示癌症风险和免疫治疗反应的驱动因素。
Nat Commun. 2023 May 12;14(1):2744. doi: 10.1038/s41467-023-38271-5.
5
Imputation to whole-genome sequence and its use in genome-wide association studies for pork colour traits in crossbred and purebred pigs.全基因组序列填充及其在杂交猪和纯种猪猪肉颜色性状全基因组关联研究中的应用。
Front Genet. 2022 Oct 11;13:1022681. doi: 10.3389/fgene.2022.1022681. eCollection 2022.
6
Comparison of Genotype Imputation for SNP Array and Low-Coverage Whole-Genome Sequencing Data.单核苷酸多态性(SNP)阵列与低覆盖度全基因组测序数据的基因型填充比较
Front Genet. 2022 Jan 3;12:704118. doi: 10.3389/fgene.2021.704118. eCollection 2021.
7
Genetic and Metabolic Determinants of Atrial Fibrillation in a General Population Sample: The CHRIS Study.一般人群中心律失常的遗传和代谢决定因素:CHRIS 研究。
Biomolecules. 2021 Nov 9;11(11):1663. doi: 10.3390/biom11111663.
8
Impact of pre- and post-variant filtration strategies on imputation.变异前和变异后过滤策略对插补的影响。
Sci Rep. 2021 Mar 18;11(1):6214. doi: 10.1038/s41598-021-85333-z.
9
Medium-coverage DNA sequencing in the design of the genetic association study.中等覆盖度 DNA 测序在遗传关联研究设计中的应用。
Eur J Hum Genet. 2020 Oct;28(10):1459-1466. doi: 10.1038/s41431-020-0656-2. Epub 2020 May 26.
10
New Insights From Imputed Whole-Genome Sequence-Based Genome-Wide Association Analysis and Transcriptome Analysis: The Genetic Mechanisms Underlying Residual Feed Intake in Chickens.基于推算全基因组序列的全基因组关联分析和转录组分析的新见解:鸡剩余采食量的遗传机制
Front Genet. 2020 Apr 3;11:243. doi: 10.3389/fgene.2020.00243. eCollection 2020.

本文引用的文献

1
Fast and accurate genotype imputation in genome-wide association studies through pre-phasing.通过预分组实现全基因组关联研究中的快速准确基因型推断。
Nat Genet. 2012 Jul 22;44(8):955-9. doi: 10.1038/ng.2354.
2
A two-platform design for next generation genome-wide association studies.用于下一代全基因组关联研究的双平台设计。
Genet Epidemiol. 2012 May;36(4):400-8. doi: 10.1002/gepi.21634. Epub 2012 Apr 16.
3
1000 Genomes-based imputation identifies novel and refined associations for the Wellcome Trust Case Control Consortium phase 1 Data.基于 1000 基因组计划的推断为惠康信托基金会病例对照研究第一阶段数据识别出了新的和更精细的关联。
Eur J Hum Genet. 2012 Jul;20(7):801-5. doi: 10.1038/ejhg.2012.3. Epub 2012 Feb 1.
4
Five years of GWAS discovery.GWAS 发现的五年。
Am J Hum Genet. 2012 Jan 13;90(1):7-24. doi: 10.1016/j.ajhg.2011.11.029.
5
Performance of genotype imputations using data from the 1000 Genomes Project.利用千人基因组计划的数据进行基因型填充的性能。
Hum Hered. 2012;73(1):18-25. doi: 10.1159/000334084. Epub 2011 Dec 30.
6
The Rotterdam Study: 2012 objectives and design update.《鹿特丹研究:2012 年目标和设计更新》
Eur J Epidemiol. 2011 Aug;26(8):657-86. doi: 10.1007/s10654-011-9610-5. Epub 2011 Aug 30.
7
Genomics: The search for association.基因组学:关联性研究
Nature. 2010 Oct 28;467(7319):1135-8. doi: 10.1038/4671135a.
8
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
9
Hundreds of variants clustered in genomic loci and biological pathways affect human height.数以百计的变异体聚集在基因组位置和生物途径中,影响人类身高。
Nature. 2010 Oct 14;467(7317):832-8. doi: 10.1038/nature09410. Epub 2010 Sep 29.
10
Genotype imputation for genome-wide association studies.全基因组关联研究中的基因型推断。
Nat Rev Genet. 2010 Jul;11(7):499-511. doi: 10.1038/nrg2796.