Norris Cancer Center, University of Southern California, Los Angeles, CA 90033, USA.
Nucleic Acids Res. 2012 Oct;40(18):e139. doi: 10.1093/nar/gks542. Epub 2012 Jun 8.
Single nucleotide polymorphisms (SNPs) are increasingly used to tag genetic loci associated with phenotypes such as risk of complex diseases. Technically, this is done genome-wide without prior restriction or knowledge of biological feasibility in scans referred to as genome-wide association studies (GWAS). Depending on the linkage disequilibrium (LD) structure at a particular locus, such tagSNPs may be surrogates for many thousands of other SNPs, and it is difficult to distinguish those that may play a functional role in the phenotype from those simply genetically linked. Because a large proportion of tagSNPs have been identified within non-coding regions of the genome, distinguishing functional from non-functional SNPs has been an even greater challenge. A strategy was recently proposed that prioritizes surrogate SNPs based on non-coding chromatin and epigenomic mapping techniques that have become feasible with the advent of massively parallel sequencing. Here, we introduce an R/Bioconductor software package that enables the identification of candidate functional SNPs by integrating information from tagSNP locations, lists of linked SNPs from the 1000 genomes project and locations of chromatin features which may have functional significance.
FunciSNP is available from Bioconductor (bioconductor.org).
单核苷酸多态性(SNPs)越来越多地被用于标记与复杂疾病风险等表型相关的遗传基因座。从技术上讲,这是在没有事先限制或了解生物可行性的情况下,在全基因组范围内进行的扫描,称为全基因组关联研究(GWAS)。根据特定基因座的连锁不平衡(LD)结构,这些标签 SNP 可能是数千个其他 SNP 的替代物,并且很难区分那些可能在表型中发挥功能作用的 SNP 和那些仅仅在遗传上相关的 SNP。由于大多数标签 SNP 已在基因组的非编码区域中被识别,因此区分功能 SNP 和非功能 SNP 更是一项挑战。最近提出了一种策略,该策略基于非编码染色质和表观基因组图谱技术,优先考虑替代 SNP,这些技术随着大规模并行测序的出现而成为可能。在这里,我们引入了一个 R/Bioconductor 软件包,该软件包通过整合来自标签 SNP 位置、1000 个基因组项目的连锁 SNP 列表以及可能具有功能意义的染色质特征位置的信息,从而能够识别候选功能 SNP。
FunciSNP 可从 Bioconductor(bioconductor.org)获得。