Suppr超能文献

一种用于识别全基因组关联研究染色体模式的扫描统计量的快速实现方法。

A Fast Implementation of a Scan Statistic for Identifying Chromosomal Patterns of Genome Wide Association Studies.

作者信息

Sun Yan V, Jacobsen Douglas M, Turner Stephen T, Boerwinkle Eric, Kardia Sharon L R

机构信息

Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan.

出版信息

Comput Stat Data Anal. 2009 Mar 15;53(5):1794-1801. doi: 10.1016/j.csda.2008.04.013.

Abstract

In order to take into account the complex genomic distribution of SNP variations when identifying chromosomal regions with significant SNP effects, a single nucleotide polymorphism (SNP) association scan statistic was developed. To address the computational needs of genome wide association (GWA) studies, a fast Java application, which combines single-locus SNP tests and a scan statistic for identifying chromosomal regions with significant clusters of significant SNP effects, was developed and implemented. To illustrate this application, SNP associations were analyzed in a pharmacogenomic study of the blood pressure lowering effect of thiazide-diuretics (N=195) using the Affymetrix Human Mapping 100K Set. 55,335 tagSNPs (pair-wise linkage disequilibrium R(2)<0.5) were selected to reduce the frequency correlation between SNPs. A typical workstation can complete the whole genome scan including 10,000 permutation tests within 3 hours. The most significant regions locate on chromosome 3, 6, 13 and 16, two of which contain candidate genes that may be involved in the underlying drug response mechanism. The computational performance of ChromoScan-GWA and its scalability were tested with up to 1,000,000 SNPs and up to 4,000 subjects. Using 10,000 permutations, the computation time grew linearly in these datasets. This scan statistic application provides a robust statistical and computational foundation for identifying genomic regions associated with disease and provides a method to compare GWA results even across different platforms.

摘要

为了在识别具有显著单核苷酸多态性(SNP)效应的染色体区域时考虑SNP变异的复杂基因组分布,开发了一种单核苷酸多态性(SNP)关联扫描统计量。为满足全基因组关联(GWA)研究的计算需求,开发并实现了一个快速Java应用程序,该程序结合了单基因座SNP测试和一种扫描统计量,用于识别具有显著SNP效应簇的染色体区域。为说明此应用程序,在一项使用Affymetrix Human Mapping 100K Set进行的噻嗪类利尿剂降压效果的药物基因组学研究(N = 195)中分析了SNP关联。选择了55,335个标签SNP(成对连锁不平衡R(2)<0.5)以降低SNP之间的频率相关性。一台典型的工作站可以在3小时内完成包括10,000次置换检验的全基因组扫描。最显著的区域位于3号、6号、13号和16号染色体上,其中两个区域包含可能参与潜在药物反应机制的候选基因。使用多达1,000,000个SNP和多达4,000名受试者对ChromoScan-GWA的计算性能及其可扩展性进行了测试。使用10,000次置换,在这些数据集中计算时间呈线性增长。这种扫描统计量应用程序为识别与疾病相关的基因组区域提供了强大的统计和计算基础,并提供了一种即使在不同平台之间比较GWA结果的方法。

相似文献

1
A Fast Implementation of a Scan Statistic for Identifying Chromosomal Patterns of Genome Wide Association Studies.
Comput Stat Data Anal. 2009 Mar 15;53(5):1794-1801. doi: 10.1016/j.csda.2008.04.013.
2
A scan statistic for identifying chromosomal patterns of SNP association.
Genet Epidemiol. 2006 Nov;30(7):627-35. doi: 10.1002/gepi.20173.
3
ChromoScan: a scan statistic application for identifying chromosomal regions in genomic studies.
Bioinformatics. 2006 Dec 1;22(23):2945-7. doi: 10.1093/bioinformatics/btl503. Epub 2006 Oct 10.
4
ParallABEL: an R library for generalized parallelization of genome-wide association studies.
BMC Bioinformatics. 2010 Apr 29;11:217. doi: 10.1186/1471-2105-11-217.
6
"Replicated" genome wide association for dependence on illegal substances: genomic regions identified by overlapping clusters of nominally positive SNPs.
Am J Med Genet B Neuropsychiatr Genet. 2011 Mar;156(2):125-38. doi: 10.1002/ajmg.b.31143. Epub 2010 Dec 16.
7
Uncovering networks from genome-wide association studies via circular genomic permutation.
G3 (Bethesda). 2012 Sep;2(9):1067-75. doi: 10.1534/g3.112.002618. Epub 2012 Sep 1.
8
RS-SNP: a random-set method for genome-wide association studies.
BMC Genomics. 2011 Mar 30;12:166. doi: 10.1186/1471-2164-12-166.
9
Genome-wide "pleiotropy scan" identifies HNF1A region as a novel pancreatic cancer susceptibility locus.
Cancer Res. 2011 Jul 1;71(13):4352-8. doi: 10.1158/0008-5472.CAN-11-0124. Epub 2011 Apr 15.
10
Fine mapping by composite genome-wide association analysis.
Genet Res (Camb). 2017 Jun 6;99:e4. doi: 10.1017/S0016672317000027.

引用本文的文献

本文引用的文献

2
Family-based association tests for genomewide association scans.
Am J Hum Genet. 2007 Nov;81(5):913-26. doi: 10.1086/521580. Epub 2007 Sep 18.
3
A new multipoint method for genome-wide association studies by imputation of genotypes.
Nat Genet. 2007 Jul;39(7):906-13. doi: 10.1038/ng2088. Epub 2007 Jun 17.
5
A common allele on chromosome 9 associated with coronary heart disease.
Science. 2007 Jun 8;316(5830):1488-91. doi: 10.1126/science.1142447. Epub 2007 May 3.
6
A common variant on chromosome 9p21 affects the risk of myocardial infarction.
Science. 2007 Jun 8;316(5830):1491-3. doi: 10.1126/science.1142842. Epub 2007 May 3.
7
A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants.
Science. 2007 Jun 1;316(5829):1341-5. doi: 10.1126/science.1142382. Epub 2007 Apr 26.
8
Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels.
Science. 2007 Jun 1;316(5829):1331-6. doi: 10.1126/science.1142358. Epub 2007 Apr 26.
9
A variant in CDKAL1 influences insulin response and risk of type 2 diabetes.
Nat Genet. 2007 Jun;39(6):770-5. doi: 10.1038/ng2043. Epub 2007 Apr 26.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验