Suppr超能文献

使用可变大小滑动窗口策略对单倍型进行详尽分析时的功效提升:关联映射策略的比较。

Gains in power for exhaustive analyses of haplotypes using variable-sized sliding window strategy: a comparison of association-mapping strategies.

作者信息

Guo Yanfang, Li Jian, Bonham Aaron J, Wang Yuping, Deng Hongwen

机构信息

The Key Laboratory of Biomedical Information Engineering of Ministry of Education and Institute of Molecular Genetics, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, PR China.

出版信息

Eur J Hum Genet. 2009 Jun;17(6):785-92. doi: 10.1038/ejhg.2008.244. Epub 2008 Dec 17.

Abstract

Linkage disequilibrium (LD)-based association mapping is often performed by analyzing either individual SNPs or block-based multi-SNP haplotypes. Sliding windows of several fixed sizes (in terms of SNP numbers) were also applied to a few simulated or real data sets. In comparison, exhaustively testing based on variable-sized sliding windows (VSW) of all possible sizes of SNPs over a genomic region has the best chance to capture the optimum markers (single SNPs or haplotypes) that are most significantly associated with the traits under study. However, the cost is the increased number of multiple tests and computation. Here, a strategy of VSW of all possible sizes is proposed and its power is examined, in comparison with those using only haplotype blocks (BLK) or single SNP loci (SGL) tests. Critical values for statistical significance testing that account for multiple testing are simulated. We demonstrated that, over a wide range of parameters simulated, VSW increased power for the detection of disease variants by approximately 1-15% over the BLK and SGL approaches. The improved performance was more significant in regions with high recombination rates. In an empirical data set, VSW obtained the most significant signal and identified the LRP5 gene as strongly associated with osteoporosis. With the use of computational techniques such as parallel algorithms and clustering computing, it is feasible to apply VSW to large genomic regions or those regions preliminarily identified by traditional SGL/BLK methods.

摘要

基于连锁不平衡(LD)的关联作图通常通过分析单个单核苷酸多态性(SNP)或基于块的多SNP单倍型来进行。几种固定大小(以SNP数量计)的滑动窗口也被应用于一些模拟或真实数据集。相比之下,在基因组区域上对所有可能大小的SNP进行基于可变大小滑动窗口(VSW)的详尽测试,最有可能捕获与所研究性状最显著相关的最佳标记(单个SNP或单倍型)。然而,代价是多重检验和计算的数量增加。在此,我们提出了一种针对所有可能大小的VSW策略,并与仅使用单倍型块(BLK)或单个SNP位点(SGL)检验的策略相比,检验了其效能。模拟了考虑多重检验的统计显著性检验的临界值。我们证明,在广泛的模拟参数范围内,与BLK和SGL方法相比,VSW将疾病变异检测的效能提高了约1% - 15%。在重组率高的区域,性能的提升更为显著。在一个实证数据集中,VSW获得了最显著的信号,并确定低密度脂蛋白受体相关蛋白5(LRP5)基因与骨质疏松症密切相关。通过使用并行算法和聚类计算等计算技术,将VSW应用于大型基因组区域或那些通过传统SGL/BLK方法初步鉴定的区域是可行的。

相似文献

3
Detecting genome wide haplotype sharing using SNP or microsatellite haplotype data.
Hum Genet. 2006 Mar;119(1-2):38-50. doi: 10.1007/s00439-005-0114-9. Epub 2005 Dec 14.
4
Performance of a blockwise approach in variable selection using linkage disequilibrium information.
BMC Bioinformatics. 2015 May 8;16:148. doi: 10.1186/s12859-015-0556-6.
5
HapBoost: a fast approach to boosting haplotype association analyses in genome-wide association studies.
IEEE/ACM Trans Comput Biol Bioinform. 2013 Jan-Feb;10(1):207-12. doi: 10.1109/TCBB.2013.6.
8
Haplotype-based quantitative trait mapping using a clustering algorithm.
BMC Bioinformatics. 2006 May 18;7:258. doi: 10.1186/1471-2105-7-258.
9
Haplotype block structure and its applications to association studies: power and study designs.
Am J Hum Genet. 2002 Dec;71(6):1386-94. doi: 10.1086/344780. Epub 2002 Nov 18.

引用本文的文献

3
Analysis of dog breed diversity using a composite selection index.
Sci Rep. 2023 Jan 30;13(1):1674. doi: 10.1038/s41598-023-28826-3.
4
Haplotype-Based Single-Step GWAS for Yearling Temperament in American Angus Cattle.
Genes (Basel). 2021 Dec 22;13(1):17. doi: 10.3390/genes13010017.
5
A Genome-Wide Association Study on Feed Efficiency Related Traits in Landrace Pigs.
Front Genet. 2020 Jul 3;11:692. doi: 10.3389/fgene.2020.00692. eCollection 2020.
7
Exploring effective approaches for haplotype block phasing.
BMC Bioinformatics. 2019 Oct 30;20(1):540. doi: 10.1186/s12859-019-3095-8.
10
Comparison of haplotype-based statistical tests for disease association with rare and common variants.
Brief Bioinform. 2016 Jul;17(4):657-71. doi: 10.1093/bib/bbv072. Epub 2015 Sep 2.

本文引用的文献

1
PLINK: a tool set for whole-genome association and population-based linkage analyses.
Am J Hum Genet. 2007 Sep;81(3):559-75. doi: 10.1086/519795. Epub 2007 Jul 25.
2
Enriching the analysis of genomewide association studies with hierarchical modeling.
Am J Hum Genet. 2007 Aug;81(2):397-404. doi: 10.1086/519794. Epub 2007 Jun 26.
5
Significant variation in haplotype block structure but conservation in tagSNP patterns among global populations.
Eur J Hum Genet. 2007 Mar;15(3):302-12. doi: 10.1038/sj.ejhg.5201751. Epub 2007 Jan 3.
6
Fine mapping of the MAPT locus using quantitative trait analysis identifies possible causal variants in Alzheimer's disease.
Mol Psychiatry. 2007 May;12(5):510-7. doi: 10.1038/sj.mp.4001935. Epub 2006 Dec 19.
7
Test for interaction between two unlinked loci.
Am J Hum Genet. 2006 Nov;79(5):831-45. doi: 10.1086/508571. Epub 2006 Sep 21.
8
A tutorial on statistical methods for population association studies.
Nat Rev Genet. 2006 Oct;7(10):781-91. doi: 10.1038/nrg1916.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验