基因型数据的单重插补与多重插补

Single versus multiple imputation for genotypic data.

作者信息

Fridley Brooke L, McDonnell Shannon K, Rabe Kari G, Tang Rui, Biernacka Joanna M, Sinnwell Jason P, Rider David N, Goode Ellen L

机构信息

Department of Health Sciences Research, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA.

出版信息

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S7. doi: 10.1186/1753-6561-3-s7-s7.

DOI:10.1186/1753-6561-3-s7-s7

PMID:20018064

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2795971/

Abstract

Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation in estimation due to imputation. An alternative to single imputation is multiple imputation. In this paper, we assess the variation in imputation by completing both single and multiple imputations of genotypic data using MACH, a commonly used hidden Markov model imputation method. Using data from the North American Rheumatoid Arthritis Consortium genome-wide study, the use of single and multiple imputation was assessed in four regions of chromosome 1 with varying levels of linkage disequilibrium and association signals. Two scenarios for missing genotypic data were assessed: imputation of untyped markers and combination of genotypic data from two studies. This limited study involving four regions indicates that, contrary to expectations, multiple imputations may not be necessary.

摘要

由于整合多个研究数据以及基于参考样本推算未分型标记的需求不断增加，已经开发了几种用于推算和分析缺失基因型的分析工具。当前的推算方法依赖于单一推算，这种方法忽略了因推算导致的估计值变化。单一推算的替代方法是多重推算。在本文中，我们使用常用的隐马尔可夫模型推算方法MACH对基因型数据进行单一和多重推算，以评估推算中的变化。利用来自北美类风湿关节炎联盟全基因组研究的数据，在1号染色体的四个具有不同连锁不平衡水平和关联信号的区域评估了单一和多重推算的使用情况。评估了两种缺失基因型数据的情况：未分型标记的推算以及两项研究的基因型数据的合并。这项涉及四个区域的有限研究表明，与预期相反，多重推算可能并非必要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdbf/2795971/af0cb9809cc7/1753-6561-3-S7-S7-1.jpg

相似文献

Single versus multiple imputation for genotypic data.

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S7. doi: 10.1186/1753-6561-3-s7-s7.

Assessment of genotype imputation methods.

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S5. doi: 10.1186/1753-6561-3-s7-s5.

Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.

BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27.

Impact of imputation methods on the amount of genetic variation captured by a single-nucleotide polymorphism panel in soybeans.

BMC Bioinformatics. 2016 Feb 2;17:55. doi: 10.1186/s12859-016-0899-7.

Application of imputation methods to the analysis of rheumatoid arthritis data in genome-wide association studies.

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S24. doi: 10.1186/1753-6561-3-s7-s24.

Analyses and comparison of accuracy of different genotype imputation methods.

PLoS One. 2008;3(10):e3551. doi: 10.1371/journal.pone.0003551. Epub 2008 Oct 29.

Evaluation of vicinity-based hidden Markov models for genotype imputation.

BMC Bioinformatics. 2022 Aug 29;23(1):356. doi: 10.1186/s12859-022-04896-4.

Genotype imputation for African Americans using data from HapMap phase II versus 1000 genomes projects.

Genet Epidemiol. 2012 Jul;36(5):508-16. doi: 10.1002/gepi.21647. Epub 2012 May 29.

PRED-LD: efficient imputation of GWAS summary statistics.

BMC Bioinformatics. 2025 Apr 16;26(1):107. doi: 10.1186/s12859-025-06119-y.

The use of family relationships and linkage disequilibrium to impute phase and missing genotypes in up to whole-genome sequence density genotypic data.

Genetics. 2010 Aug;185(4):1441-9. doi: 10.1534/genetics.110.113936. Epub 2010 May 17.

引用本文的文献

Assessment of genotype imputation methods.

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S5. doi: 10.1186/1753-6561-3-s7-s5.

Genome-wide association studies for discrete traits.

Genet Epidemiol. 2009;33 Suppl 1(Suppl 1):S8-12. doi: 10.1002/gepi.20465.

本文引用的文献

Assessment of genotype imputation methods.

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S5. doi: 10.1186/1753-6561-3-s7-s5.

A comprehensive evaluation of SNP genotype imputation.

Hum Genet. 2009 Mar;125(2):163-71. doi: 10.1007/s00439-008-0606-5. Epub 2008 Dec 17.

Analyses and comparison of accuracy of different genotype imputation methods.

PLoS One. 2008;3(10):e3551. doi: 10.1371/journal.pone.0003551. Epub 2008 Oct 29.

Data for Genetic Analysis Workshop (GAW) 15 Problem 2, genetic causes of rheumatoid arthritis and associated traits.

BMC Proc. 2007;1 Suppl 1(Suppl 1):S3. doi: 10.1186/1753-6561-1-s1-s3. Epub 2007 Dec 18.

Several regions in the major histocompatibility complex confer risk for anti-CCP-antibody positive rheumatoid arthritis, independent of the DRB1 locus.

Mol Med. 2008 May-Jun;14(5-6):293-300. doi: 10.2119/2007-00123.Lee.

A second generation human haplotype map of over 3.1 million SNPs.

Nature. 2007 Oct 18;449(7164):851-61. doi: 10.1038/nature06258.

PLINK: a tool set for whole-genome association and population-based linkage analyses.

Am J Hum Genet. 2007 Sep;81(3):559-75. doi: 10.1086/519795. Epub 2007 Jul 25.

A new multipoint method for genome-wide association studies by imputation of genotypes.

Nat Genet. 2007 Jul;39(7):906-13. doi: 10.1038/ng2088. Epub 2007 Jun 17.

Testing untyped alleles (TUNA)-applications to genome-wide association studies.

Genet Epidemiol. 2006 Dec;30(8):718-27. doi: 10.1002/gepi.20182.

A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Am J Hum Genet. 2006 Apr;78(4):629-44. doi: 10.1086/502802. Epub 2006 Feb 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基因型数据的单重插补与多重插补

Single versus multiple imputation for genotypic data.

作者信息

Fridley Brooke L, McDonnell Shannon K, Rabe Kari G, Tang Rui, Biernacka Joanna M, Sinnwell Jason P, Rider David N, Goode Ellen L

机构信息

Department of Health Sciences Research, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA.

出版信息

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S7. doi: 10.1186/1753-6561-3-s7-s7.

DOI:10.1186/1753-6561-3-s7-s7

PMID:20018064

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2795971/

Abstract

摘要

基因型数据的单重插补与多重插补

Single versus multiple imputation for genotypic data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基因型数据的单重插补与多重插补

Single versus multiple imputation for genotypic data.

作者信息

机构信息

出版信息