Suppr超能文献

基于SAT的杂合多倍体中未分型SNP数据的单倍型推断

Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT.

作者信息

Neigenfind Jost, Gyetvai Gabor, Basekow Rico, Diehl Svenja, Achenbach Ute, Gebhardt Christiane, Selbig Joachim, Kersten Birgit

机构信息

Bioinformatics, GabiPD team, Max Planck Institute of Molecular Plant Physiology, 14424 Potsdam-Golm, Germany.

出版信息

BMC Genomics. 2008 Jul 30;9:356. doi: 10.1186/1471-2164-9-356.

Abstract

BACKGROUND

Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (Solanum tuberosum). Potato species are tetraploid and highly heterozygous.

RESULTS

Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlotyper uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes.

CONCLUSION

Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as a Java JAR file. The software can be downloaded from the webpage of the GABI Primary Database at http://www.gabipd.org/projects/satlotyper/. The application of SATlotyper will provide haplotype information, which can be used in haplotype association mapping studies of polyploid plants.

摘要

背景

基于未分型单核苷酸多态性(SNP)标记进行单倍型推断是群体遗传学中的一项重要任务。虽然在二倍体物种中有不同的单倍型推断方法,但现有的软件并不适用于从多倍体物种(如栽培马铃薯(Solanum tuberosum))的未分型SNP数据中推断单倍型。马铃薯物种是四倍体且高度杂合。

结果

在此,我们展示了能够处理多倍体和多等位基因数据的软件SATlotyper。SATlotyper利用布尔可满足性问题来通过纯简约法进行单倍型推断。该软件排除了现有的单倍型推断,从而允许计算替代推断。由于不知道给定的未分型数据集最支持多个单倍型推断中的哪一个,我们使用一种自展程序,该程序允许对替代推断进行评分。最后,通过自展分数,可以优化属于给定单倍型推断的分型基因型。该程序使用为马铃薯杂合四倍体群体生成的模拟和实验SNP数据进行评估。我们表明,通过应用包括对替代单倍型推断进行评分和基因型优化在内的额外方法,而不是采用程序报告的第一个单倍型推断,我们可以显著提高最终结果的质量。对于一个由19个个体组成的亚群体,将SATlotyper计算的预测结果与通过克隆扩增子测序进行实验单倍型推断获得的结果直接进行了比较。在推断的单倍型和分型基因型方面,预测和实验给出了相似的结果。

结论

我们的结果表明,即使对于来自杂合多倍体的未分型SNP数据集,通过SAT方法也可以有效地解决纯简约法单倍型推断问题。SATlotyper是免费软件,以Java JAR文件形式分发。该软件可从GABI初级数据库的网页(http://www.gabipd.org/projects/satlotyper/)下载。SATlotyper的应用将提供单倍型信息,可用于多倍体植物的单倍型关联作图研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01c8/2566320/442a57107ee1/1471-2164-9-356-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验