利用基因型数据进行单倍型块划分和标签单核苷酸多态性选择及其在关联研究中的应用。

Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.

作者信息

Zhang Kui, Qin Zhaohui S, Liu Jun S, Chen Ting, Waterman Michael S, Sun Fengzhu

机构信息

Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089-1113, USA.

出版信息

Genome Res. 2004 May;14(5):908-16. doi: 10.1101/gr.1837404. Epub 2004 Apr 12.

DOI:10.1101/gr.1837404

PMID:15078859

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC479119/

Abstract

Recent studies have revealed that linkage disequilibrium (LD) patterns vary across the human genome with some regions of high LD interspersed by regions of low LD. A small fraction of SNPs (tag SNPs) is sufficient to capture most of the haplotype structure of the human genome. In this paper, we develop a method to partition haplotypes into blocks and to identify tag SNPs based on genotype data by combining a dynamic programming algorithm for haplotype block partitioning and tag SNP selection based on haplotype data with a variation of the expectation maximization (EM) algorithm for haplotype inference. We assess the effects of using either haplotype or genotype data in haplotype block identification and tag SNP selection as a function of several factors, including sample size, density or number of SNPs studied, allele frequencies, fraction of missing data, and genotyping error rate, using extensive simulations. We find that a modest number of haplotype or genotype samples will result in consistent block partitions and tag SNP selection. The power of association studies based on tag SNPs using genotype data is similar to that using haplotype data.

摘要

最近的研究表明，连锁不平衡（LD）模式在人类基因组中各不相同，一些高LD区域与低LD区域相间分布。一小部分单核苷酸多态性（标签SNP）足以捕获人类基因组的大部分单倍型结构。在本文中，我们开发了一种方法，通过将基于单倍型数据的单倍型块划分和标签SNP选择的动态规划算法与用于单倍型推断的期望最大化（EM）算法的变体相结合，根据基因型数据将单倍型划分为块并识别标签SNP。我们使用广泛的模拟，评估了在单倍型块识别和标签SNP选择中使用单倍型或基因型数据的效果，该效果是几个因素的函数，包括样本大小、研究的SNP的密度或数量、等位基因频率、缺失数据的比例以及基因分型错误率。我们发现，适量数量的单倍型或基因型样本将导致一致的块划分和标签SNP选择。基于标签SNP使用基因型数据的关联研究的效能与使用单倍型数据的相似。

相似文献

Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.

Genome Res. 2004 May;14(5):908-16. doi: 10.1101/gr.1837404. Epub 2004 Apr 12.

Haplotype block structure and its applications to association studies: power and study designs.

Am J Hum Genet. 2002 Dec;71(6):1386-94. doi: 10.1086/344780. Epub 2002 Nov 18.

HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithms.

Bioinformatics. 2005 Jan 1;21(1):131-4. doi: 10.1093/bioinformatics/bth482. Epub 2004 Aug 27.

Haplotype and linkage disequilibrium architecture for human cancer-associated genes.

Genome Res. 2002 Dec;12(12):1846-53. doi: 10.1101/gr.483802.

Tag SNP selection for association studies.

Genet Epidemiol. 2004 Dec;27(4):365-74. doi: 10.1002/gepi.20028.

Inference of missing SNPs and information quantity measurements for haplotype blocks.

Bioinformatics. 2005 May 1;21(9):2001-7. doi: 10.1093/bioinformatics/bti261. Epub 2005 Feb 4.

Efficient haplotype block partitioning and tag SNP selection algorithms under various constraints.

Biomed Res Int. 2013;2013:984014. doi: 10.1155/2013/984014. Epub 2013 Nov 11.

The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.

Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.

Haplotype structure, LD blocks, and uneven recombination within the LRP5 gene.

Genome Res. 2003 May;13(5):845-55. doi: 10.1101/gr.563703.

Selecting additional tag SNPs for tolerating missing data in genotyping.

BMC Bioinformatics. 2005 Nov 1;6:263. doi: 10.1186/1471-2105-6-263.

引用本文的文献

Clinical and Metabolic Signatures of - Haplotypes in a General Population Sample.

Kidney Int Rep. 2025 Feb 25;10(5):1495-1508. doi: 10.1016/j.ekir.2025.02.018. eCollection 2025 May.

Genome-Wide Association Mapping for Yield and Yield-Related Traits in Rice ( L.) Using SNPs Markers.

Genes (Basel). 2023 May 15;14(5):1089. doi: 10.3390/genes14051089.

Personality traits as mediators in the association between rs12415800 polymorphism and depressive symptoms among Chinese college students.

Front Psychiatry. 2023 Apr 14;14:1104664. doi: 10.3389/fpsyt.2023.1104664. eCollection 2023.

Polymorphisms and gene expression of Notch4 in pulmonary tuberculosis.

Front Immunol. 2023 Feb 2;14:1081483. doi: 10.3389/fimmu.2023.1081483. eCollection 2023.

Informative SNP Selection Based on a Fuzzy Clustering and Improved Binary Particle Swarm Optimization Algorithm.

Comput Math Methods Med. 2022 Jun 16;2022:3837579. doi: 10.1155/2022/3837579. eCollection 2022.

Globally Rare Variants With Founder Haplotypes in the South African Population: Implications for Point-of-Care Testing Based on a Single-Institution Next-Generation Sequencing Study.

Front Oncol. 2021 Feb 12;10:619469. doi: 10.3389/fonc.2020.619469. eCollection 2020.

Upscaling Statistical Patterns from Reduced Storage in Social and Life Science Big Datasets.

Entropy (Basel). 2020 Sep 26;22(10):1084. doi: 10.3390/e22101084.

Genetic association study of prolylcarboxypeptidase polymorphisms with susceptibility to essential hypertension in the Yi minority of China: A case-control study based on an isolated population.

J Renin Angiotensin Aldosterone Syst. 2020 Apr-Jun;21(2):1470320320919586. doi: 10.1177/1470320320919586.

Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset.

J Adv Res. 2019 Jan 18;18:113-126. doi: 10.1016/j.jare.2019.01.006. eCollection 2019 Jul.

Comparative study for haplotype block partitioning methods - Evidence from chromosome 6 of the North American Rheumatoid Arthritis Consortium (NARAC) dataset.

PLoS One. 2018 Dec 31;13(12):e0209603. doi: 10.1371/journal.pone.0209603. eCollection 2018.

本文引用的文献

The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models.

Genetics. 1964 Jan;49(1):49-67. doi: 10.1093/genetics/49.1.49.

Haplotype tagging single nucleotide polymorphisms and association studies.

Hum Hered. 2003;56(1-3):48-55. doi: 10.1159/000073732.

Haplotype blocks and linkage disequilibrium in the human genome.

Nat Rev Genet. 2003 Aug;4(8):587-97. doi: 10.1038/nrg1123.

Finding haplotype block boundaries by using the minimum-description-length principle.

Am J Hum Genet. 2003 Aug;73(2):336-54. doi: 10.1086/377106. Epub 2003 Jul 11.

Haplotype block partition with limited resources and applications to human chromosome 21 haplotype data.

Am J Hum Genet. 2003 Jul;73(1):63-73. doi: 10.1086/376437. Epub 2003 Jun 10.

An MDL method for finding haplotype blocks and for estimating the strength of haplotype block boundaries.

Pac Symp Biocomput. 2003:502-13. doi: 10.1142/9789812776303_0047.

Chromosome-wide distribution of haplotype blocks and the role of recombination hot spots.

Nat Genet. 2003 Mar;33(3):382-7. doi: 10.1038/ng1100. Epub 2003 Feb 18.

On the use of DNA pooling to estimate haplotype frequencies.

Genet Epidemiol. 2003 Jan;24(1):74-82. doi: 10.1002/gepi.10195.

Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms.

Am J Hum Genet. 2002 Nov;71(5):1242-7. doi: 10.1086/344207.

Haplotype block structure and its applications to association studies: power and study designs.

Am J Hum Genet. 2002 Dec;71(6):1386-94. doi: 10.1086/344780. Epub 2002 Nov 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用基因型数据进行单倍型块划分和标签单核苷酸多态性选择及其在关联研究中的应用。

Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.

作者信息

Zhang Kui, Qin Zhaohui S, Liu Jun S, Chen Ting, Waterman Michael S, Sun Fengzhu

机构信息

Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089-1113, USA.

出版信息

Genome Res. 2004 May;14(5):908-16. doi: 10.1101/gr.1837404. Epub 2004 Apr 12.

DOI:10.1101/gr.1837404

PMID:15078859

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC479119/

Abstract

摘要

利用基因型数据进行单倍型块划分和标签单核苷酸多态性选择及其在关联研究中的应用。

Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用基因型数据进行单倍型块划分和标签单核苷酸多态性选择及其在关联研究中的应用。

Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.

作者信息

机构信息

出版信息