SNP 基因分型置信分数在 IBD 推断中的整合。

Integration of SNP genotyping confidence scores in IBD inference.

机构信息

The Morris Kahn Laboratory of Human Genetics, Department of Virology and Developmental Genetics, NIBN, Ben Gurion University, Israel.

出版信息

Bioinformatics. 2011 Oct 15;27(20):2880-7. doi: 10.1093/bioinformatics/btr486. Epub 2011 Aug 23.

DOI:10.1093/bioinformatics/btr486

PMID:21862568

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3187655/

Abstract

MOTIVATION

High-throughput single nucleotide polymorphism (SNP) arrays have become the standard platform for linkage and association analyses. The high SNP density of these platforms allows high-resolution identification of ancestral recombination events even for distant relatives many generations apart. However, such inference is sensitive to marker mistyping and current error detection methods rely on the genotyping of additional close relatives. Genotyping algorithms provide a confidence score for each marker call that is currently not integrated in existing methods. There is a need for a model that incorporates this prior information within the standard identical by descent (IBD) and association analyses.

RESULTS

We propose a novel model that incorporates marker confidence scores within IBD methods based on the Lander-Green Hidden Markov Model. The novel parameter of this model is the joint distribution of confidence scores and error status per array. We estimate this probability distribution by applying a modified expectation-maximization (EM) procedure on data from nuclear families genotyped with Affymetrix 250K SNP arrays. The converged tables from two different genotyping algorithms are shown for a wide range of error rates. We demonstrate the efficacy of our method in refining the detection of IBD signals using nuclear pedigrees and distant relatives.

AVAILABILITY

Plinke, a new version of Plink with an extended pairwise IBD inference model allowing per marker error probabilities is freely available at: http://bioinfo.bgu.ac.il/bsu/software/plinke.

CONTACT

obirk@bgu.ac.il; markusb@bgu.ac.il

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

高通量单核苷酸多态性 (SNP) 阵列已成为连锁和关联分析的标准平台。这些平台的 SNP 密度很高，即使是相隔多代的远亲，也能高度精确地识别祖先重组事件。然而，这种推断对标记误配很敏感，目前的错误检测方法依赖于额外近亲的基因分型。基因分型算法为每个标记调用提供了置信度评分，但目前尚未集成到现有方法中。需要一种模型，在标准的同源（IBD）和关联分析中纳入这种先验信息。

结果

我们提出了一种新模型，该模型基于 Lander-Green 隐马尔可夫模型，将标记置信度评分纳入 IBD 方法中。该模型的新参数是每个数组的置信度评分和错误状态的联合分布。我们通过对用 Affymetrix 250K SNP 阵列基因分型的核家族数据应用修改后的期望最大化（EM）过程来估计这个概率分布。对于广泛的错误率，展示了来自两种不同基因分型算法的收敛表。我们展示了我们的方法在使用核家族和远亲细化 IBD 信号检测方面的有效性。

可用性

Plinke 是 Plink 的新版本，具有扩展的成对 IBD 推断模型，允许每个标记的错误概率，可在以下网址免费获得：http://bioinfo.bgu.ac.il/bsu/software/plinke。

联系方式

obirk@bgu.ac.il; markusb@bgu.ac.il

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

Integration of SNP genotyping confidence scores in IBD inference.

Bioinformatics. 2011 Oct 15;27(20):2880-7. doi: 10.1093/bioinformatics/btr486. Epub 2011 Aug 23.

Efficient identification of identical-by-descent status in pedigrees with many untyped individuals.

Bioinformatics. 2010 Jun 15;26(12):i191-8. doi: 10.1093/bioinformatics/btq222.

Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping.

Bioinformatics. 2010 Jun 15;26(12):i175-82. doi: 10.1093/bioinformatics/btq204.

A system for exact and approximate genetic linkage analysis of SNP data in large pedigrees.

Bioinformatics. 2013 Jan 15;29(2):197-205. doi: 10.1093/bioinformatics/bts658. Epub 2012 Nov 18.

Inference of relationships in population data using identity-by-descent and identity-by-state.

PLoS Genet. 2011 Sep;7(9):e1002287. doi: 10.1371/journal.pgen.1002287. Epub 2011 Sep 22.

Estimating genotyping error rates from Mendelian errors in SNP array genotypes and their impact on inference.

Genomics. 2007 Sep;90(3):291-6. doi: 10.1016/j.ygeno.2007.05.011. Epub 2007 Jun 27.

Linked region detection using high-density SNP genotype data via the minimum recombinant model of pedigree haplotype inference.

BMC Bioinformatics. 2009 Jul 15;10:216. doi: 10.1186/1471-2105-10-216.

A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.

Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25.

Haplotype reconstruction in large pedigrees with untyped individuals through IBD inference.

J Comput Biol. 2011 Nov;18(11):1411-21. doi: 10.1089/cmb.2011.0167. Epub 2011 Sep 16.

Hot topic: performance of bovine high-density genotyping platforms in Holsteins and Jerseys.

J Dairy Sci. 2011 Dec;94(12):6116-21. doi: 10.3168/jds.2011-4764.

引用本文的文献

geck: trio-based comparative benchmarking of variant calls.

Bioinformatics. 2018 Oct 15;34(20):3488-3495. doi: 10.1093/bioinformatics/bty415.

Detection of Mendelian consistent genotyping errors in pedigrees.

Genet Epidemiol. 2014 May;38(4):291-9. doi: 10.1002/gepi.21806. Epub 2014 Apr 9.

Genome-wide patterns of identity-by-descent sharing in the French Canadian founder population.

Eur J Hum Genet. 2014 Jun;22(6):814-21. doi: 10.1038/ejhg.2013.227. Epub 2013 Oct 16.

Deciphering the fine-structure of tribal admixture in the Bedouin population using genomic data.

Heredity (Edinb). 2014 Feb;112(2):182-9. doi: 10.1038/hdy.2013.90. Epub 2013 Oct 2.

Isolated foveal hypoplasia with secondary nystagmus and low vision is associated with a homozygous SLC38A8 mutation.

Eur J Hum Genet. 2014 May;22(5):703-6. doi: 10.1038/ejhg.2013.212. Epub 2013 Sep 18.

Unlocking the bottleneck in forward genetics using whole-genome sequencing and identity by descent to isolate causative mutations.

PLoS Genet. 2013;9(1):e1003219. doi: 10.1371/journal.pgen.1003219. Epub 2013 Jan 31.

The role of large pedigrees in an era of high-throughput sequencing.

Hum Genet. 2012 Oct;131(10):1555-63. doi: 10.1007/s00439-012-1190-2. Epub 2012 Jun 20.

本文引用的文献

Pelizaeus-Merzbacher-like disease caused by AIMP1/p43 homozygous mutation.

Am J Hum Genet. 2010 Dec 10;87(6):820-8. doi: 10.1016/j.ajhg.2010.10.016. Epub 2010 Nov 18.

Estimating genome-wide IBD sharing from SNP data via an efficient hidden Markov model of LD with application to gene mapping.

Bioinformatics. 2010 Jun 15;26(12):i175-82. doi: 10.1093/bioinformatics/btq204.

High-resolution detection of identity by descent in unrelated individuals.

Am J Hum Genet. 2010 Apr 9;86(4):526-39. doi: 10.1016/j.ajhg.2010.02.021. Epub 2010 Mar 18.

OpenADAM: an open source genome-wide association data management system for Affymetrix SNP arrays.

BMC Genomics. 2008 Dec 31;9:636. doi: 10.1186/1471-2164-9-636.

Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.

Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.

Genome-wide association studies for complex traits: consensus, uncertainty and challenges.

Nat Rev Genet. 2008 May;9(5):356-69. doi: 10.1038/nrg2344.

The IBD process along four chromosomes.

Theor Popul Biol. 2008 May;73(3):369-73. doi: 10.1016/j.tpb.2007.11.011. Epub 2007 Dec 31.

High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans.

Science. 2008 Mar 7;319(5868):1395-8. doi: 10.1126/science.1151851. Epub 2008 Jan 31.

PLINK: a tool set for whole-genome association and population-based linkage analyses.

Am J Hum Genet. 2007 Sep;81(3):559-75. doi: 10.1086/519795. Epub 2007 Jul 25.

Estimating genotyping error rates from Mendelian errors in SNP array genotypes and their impact on inference.

Genomics. 2007 Sep;90(3):291-6. doi: 10.1016/j.ygeno.2007.05.011. Epub 2007 Jun 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SNP 基因分型置信分数在 IBD 推断中的整合。

Integration of SNP genotyping confidence scores in IBD inference.

机构信息

The Morris Kahn Laboratory of Human Genetics, Department of Virology and Developmental Genetics, NIBN, Ben Gurion University, Israel.

出版信息

Bioinformatics. 2011 Oct 15;27(20):2880-7. doi: 10.1093/bioinformatics/btr486. Epub 2011 Aug 23.

DOI:10.1093/bioinformatics/btr486

PMID:21862568

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3187655/

Abstract

MOTIVATION

RESULTS

AVAILABILITY

Plinke, a new version of Plink with an extended pairwise IBD inference model allowing per marker error probabilities is freely available at: http://bioinfo.bgu.ac.il/bsu/software/plinke.

CONTACT

obirk@bgu.ac.il; markusb@bgu.ac.il

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

结果

可用性

Plinke 是 Plink 的新版本，具有扩展的成对 IBD 推断模型，允许每个标记的错误概率，可在以下网址免费获得：http://bioinfo.bgu.ac.il/bsu/software/plinke。

联系方式

obirk@bgu.ac.il; markusb@bgu.ac.il

补充信息

补充数据可在 Bioinformatics 在线获得。

SNP 基因分型置信分数在 IBD 推断中的整合。

Integration of SNP genotyping confidence scores in IBD inference.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

CONTACT

SUPPLEMENTARY INFORMATION

动机

结果

可用性

联系方式

补充信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

SNP 基因分型置信分数在 IBD 推断中的整合。

Integration of SNP genotyping confidence scores in IBD inference.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

CONTACT

SUPPLEMENTARY INFORMATION

动机

结果

可用性

联系方式

补充信息