高效精确的贝叶斯 SNP 基因型多倍体最大后验计算。

Efficient exact maximum a posteriori computation for bayesian SNP genotyping in polyploids.

机构信息

Department of Neurobiology, Harvard Medical School, Boston, Massachusetts, United States of America.

出版信息

PLoS One. 2012;7(2):e30906. doi: 10.1371/journal.pone.0030906. Epub 2012 Feb 17.

DOI:10.1371/journal.pone.0030906

PMID:22363513

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3281906/

Abstract

The problem of genotyping polyploids is extremely important for the creation of genetic maps and assembly of complex plant genomes. Despite its significance, polyploid genotyping still remains largely unsolved and suffers from a lack of statistical formality. In this paper a graphical bayesian model for SNP genotyping data is introduced. This model can infer genotypes even when the ploidy of the population is unknown. We also introduce an algorithm for finding the exact maximum a posteriori genotype configuration with this model. This algorithm is implemented in a freely available web-based software package SuperMASSA. We demonstrate the utility, efficiency, and flexibility of the model and algorithm by applying them to two different platforms, each of which is applied to a polyploid data set: Illumina GoldenGate data from potato and Sequenom MassARRAY data from sugarcane. Our method achieves state-of-the-art performance on both data sets and can be trivially adapted to use models that utilize prior information about any platform or species.

摘要

多倍体基因分型问题对于构建遗传图谱和组装复杂植物基因组至关重要。尽管其意义重大，但多倍体基因分型仍然在很大程度上尚未得到解决，并且缺乏统计形式。本文介绍了一种用于 SNP 基因分型数据的图形贝叶斯模型。该模型即使在未知群体倍性的情况下也可以推断基因型。我们还介绍了一种使用该模型找到精确最大后验基因型配置的算法。该算法在一个免费提供的基于网络的软件包 SuperMASSA 中实现。我们通过将其应用于两个不同的平台来证明模型和算法的实用性、效率和灵活性，每个平台都应用于一个多倍体数据集：来自马铃薯的 Illumina GoldenGate 数据和来自甘蔗的 Sequenom MassARRAY 数据。我们的方法在两个数据集上都达到了最先进的性能，并且可以轻而易举地适应使用任何平台或物种的先验信息的模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/331c/3281906/67bf75716c7f/pone.0030906.g001.jpg

相似文献

Efficient exact maximum a posteriori computation for bayesian SNP genotyping in polyploids.

PLoS One. 2012;7(2):e30906. doi: 10.1371/journal.pone.0030906. Epub 2012 Feb 17.

Quantitative SNP genotyping of polyploids with MassARRAY and other platforms.

Methods Mol Biol. 2015;1245:215-41. doi: 10.1007/978-1-4939-1966-6_17.

A fully automated pipeline for quantitative genotype calling from next generation sequencing data in autopolyploids.

BMC Bioinformatics. 2018 Nov 1;19(1):398. doi: 10.1186/s12859-018-2433-6.

UGbS-Flex, a novel bioinformatics pipeline for imputation-free SNP discovery in polyploids without a reference genome: finger millet as a case study.

BMC Plant Biol. 2018 Jun 15;18(1):117. doi: 10.1186/s12870-018-1316-3.

flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning.

J Comput Biol. 2022 Feb;29(2):195-211. doi: 10.1089/cmb.2021.0436. Epub 2022 Jan 17.

Role of NGS and SNP genotyping methods in sugarcane improvement programs.

Crit Rev Biotechnol. 2020 Sep;40(6):865-880. doi: 10.1080/07388551.2020.1765730. Epub 2020 Jun 7.

SNP genotyping allows an in-depth characterisation of the genome of sugarcane and other complex autopolyploids.

Sci Rep. 2013 Dec 2;3:3399. doi: 10.1038/srep03399.

Mining sequence variations in representative polyploid sugarcane germplasm accessions.

BMC Genomics. 2017 Aug 9;18(1):594. doi: 10.1186/s12864-017-3980-3.

Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT.

BMC Genomics. 2008 Jul 30;9:356. doi: 10.1186/1471-2164-9-356.

pSBVB: A Versatile Simulation Tool To Evaluate Genomic Selection in Polyploid Species.

G3 (Bethesda). 2019 Feb 7;9(2):327-334. doi: 10.1534/g3.118.200942.

引用本文的文献

Genetic linkage mapping in Megathyrsus maximus (Jacq.) with multiple dosage markers.

G3 (Bethesda). 2025 Sep 3;15(9). doi: 10.1093/g3journal/jkaf126.

Tests for segregation distortion in tetraploid F1 populations.

Theor Appl Genet. 2025 Jan 16;138(1):30. doi: 10.1007/s00122-025-04816-z.

Advances in genomic characterization of Urochloa humidicola: exploring polyploid inheritance and apomixis.

Theor Appl Genet. 2023 Nov 2;136(11):238. doi: 10.1007/s00122-023-04485-w.

Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps.

Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad092. Epub 2023 Oct 27.

Demographic history inference and the polyploid continuum.

Genetics. 2023 Aug 9;224(4). doi: 10.1093/genetics/iyad107.

Genetic Analysis of Potato Breeding Collection Using Single-Nucleotide Polymorphism (SNP) Markers.

Plants (Basel). 2023 May 6;12(9):1895. doi: 10.3390/plants12091895.

Smooth Descent: A ploidy-aware algorithm to improve linkage mapping in the presence of genotyping errors.

Front Genet. 2023 Mar 1;14:1049988. doi: 10.3389/fgene.2023.1049988. eCollection 2023.

Perspective for genomic-enabled prediction against black sigatoka disease and drought stress in polyploid species.

Front Plant Sci. 2022 Oct 28;13:953133. doi: 10.3389/fpls.2022.953133. eCollection 2022.

Development of a 135K SNP genotyping array for Actinidia arguta and its applications for genetic mapping and QTL analysis in kiwifruit.

Plant Biotechnol J. 2023 Feb;21(2):369-380. doi: 10.1111/pbi.13958. Epub 2022 Nov 29.

Analysis of genetic diversity and population structure among cultivated potato clones from Korea and global breeding programs.

Sci Rep. 2022 Jun 21;12(1):10462. doi: 10.1038/s41598-022-12874-2.

本文引用的文献

The detection and estimation of linkage in polyploids using single-dose restriction fragments.

Theor Appl Genet. 1992 Jan;83(3):294-300. doi: 10.1007/BF00224274.

Genotype calling in tetraploid species from bi-allelic marker data using mixture models.

BMC Bioinformatics. 2011 May 19;12:172. doi: 10.1186/1471-2105-12-172.

Genotype and SNP calling from next-generation sequencing data.

Nat Rev Genet. 2011 Jun;12(6):443-51. doi: 10.1038/nrg2986.

Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data.

J Proteome Res. 2010 Oct 1;9(10):5346-57. doi: 10.1021/pr100594k.

A pipeline for high throughput detection and mapping of SNPs from EST databases.

Mol Breed. 2010 Jun;26(1):65-75. doi: 10.1007/s11032-009-9377-5. Epub 2010 Jan 20.

Microcollinearity between autopolyploid sugarcane and diploid sorghum genomes.

BMC Genomics. 2010 Apr 23;11:261. doi: 10.1186/1471-2164-11-261.

Bayesian estimation of marker dosage in sugarcane and other autopolyploids.

Theor Appl Genet. 2010 May;120(8):1653-72. doi: 10.1007/s00122-010-1283-z. Epub 2010 Feb 25.

Qualitative and quantitative genotyping using single base primer extension coupled with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MassARRAY).

Methods Mol Biol. 2009;578:307-43. doi: 10.1007/978-1-60327-411-1_20.

Every genome sequence needs a good map.

Genome Res. 2009 Nov;19(11):1925-8. doi: 10.1101/gr.094557.109. Epub 2009 Jul 13.

Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assay.

Theor Appl Genet. 2009 Aug;119(3):507-17. doi: 10.1007/s00122-009-1059-5. Epub 2009 May 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

高效精确的贝叶斯 SNP 基因型多倍体最大后验计算。

Efficient exact maximum a posteriori computation for bayesian SNP genotyping in polyploids.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献