Suppr超能文献

Approximation properties of haplotype tagging.

作者信息

Vinterbo Staal A, Dreiseitl Stephan, Ohno-Machado Lucila

机构信息

Decision Systems Group, Brigham and Women's Hospital, Boston, MA 02115, USA.

出版信息

BMC Bioinformatics. 2006 Jan 9;7:8. doi: 10.1186/1471-2105-7-8.

Abstract

BACKGROUND

Single nucleotide polymorphisms (SNPs) are locations at which the genomic sequences of population members differ. Since these differences are known to follow patterns, disease association studies are facilitated by identifying SNPs that allow the unique identification of such patterns. This process, known as haplotype tagging, is formulated as a combinatorial optimization problem and analyzed in terms of complexity and approximation properties.

RESULTS

It is shown that the tagging problem is NP-hard but approximable within 1 + ln((n2 - n)/2) for n haplotypes but not approximable within (1-epsilon) ln(n/2) for any epsilon > 0 unless NP subset DTIME(n(log log n)). A simple, very easily implementable algorithm that exhibits the above upper bound on solution quality is presented. This algorithm has running time O(np/2(2m-p+1)) < or = O(m(n2-n)/2) where p < or = min(n, m) for n haplotypes of size m. As we show that the approximation bound is asymptotically tight, the algorithm presented is optimal with respect to this asymptotic bound.

CONCLUSION

The haplotype tagging problem is hard, but approachable with a fast, practical, and surprisingly simple algorithm that cannot be significantly improved upon on a single processor machine. Hence, significant improvement in computational efforts expended can only be expected if the computational effort is distributed and done in parallel.

摘要

相似文献

1
Approximation properties of haplotype tagging.
BMC Bioinformatics. 2006 Jan 9;7:8. doi: 10.1186/1471-2105-7-8.
2
htSNPer1.0: software for haplotype block partition and htSNPs selection.
BMC Bioinformatics. 2005 Mar 1;6:38. doi: 10.1186/1471-2105-6-38.
3
An approximation algorithm for haplotype inference by maximum parsimony.
J Comput Biol. 2005 Dec;12(10):1261-74. doi: 10.1089/cmb.2005.12.1261.
4
Selecting additional tag SNPs for tolerating missing data in genotyping.
BMC Bioinformatics. 2005 Nov 1;6:263. doi: 10.1186/1471-2105-6-263.
6
Fast and cheap genome wide haplotype construction via optical mapping.
Pac Symp Biocomput. 2005:385-96. doi: 10.1142/9789812702456_0037.
7
Tag SNP selection via a genetic algorithm.
J Biomed Inform. 2010 Oct;43(5):800-4. doi: 10.1016/j.jbi.2010.05.011. Epub 2010 May 28.
8
On the Minimum Error Correction Problem for Haplotype Assembly in Diploid and Polyploid Genomes.
J Comput Biol. 2016 Sep;23(9):718-36. doi: 10.1089/cmb.2015.0220. Epub 2016 Jun 9.
9
A (1.5 + epsilon)-approximation algorithm for unsigned translocation distance.
IEEE/ACM Trans Comput Biol Bioinform. 2008 Jan-Mar;5(1):56-66. doi: 10.1109/TCBB.2007.70216.
10
A double classification tree search algorithm for index SNP selection.
BMC Bioinformatics. 2004 Jul 6;5:89. doi: 10.1186/1471-2105-5-89.

本文引用的文献

1
Mapping complex disease loci in whole-genome association studies.
Nature. 2004 May 27;429(6990):446-52. doi: 10.1038/nature02623.
2
Haplotype tagging single nucleotide polymorphisms and association studies.
Hum Hered. 2003;56(1-3):48-55. doi: 10.1159/000073732.
3
Some notes on the combinatorial properties of haplotype tagging.
Math Biosci. 2003 Oct;185(2):205-16. doi: 10.1016/s0025-5564(03)00089-0.
4
Minimal haplotype tagging.
Proc Natl Acad Sci U S A. 2003 Aug 19;100(17):9900-5. doi: 10.1073/pnas.1633613100. Epub 2003 Aug 4.
5
Efficient selective screening of haplotype tag SNPs.
Bioinformatics. 2003 Jan 22;19(2):287-8. doi: 10.1093/bioinformatics/19.2.287.
6
Haplotype tagging for the identification of common disease genes.
Nat Genet. 2001 Oct;29(2):233-7. doi: 10.1038/ng1001-233.
7
Population genomics: linkage disequilibrium holds the key.
Curr Biol. 2001 Jul 24;11(14):R576-9. doi: 10.1016/s0960-9822(01)00348-7.
8
Linkage disequilibrium in the human genome.
Nature. 2001 May 10;411(6834):199-204. doi: 10.1038/35075590.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验