PolyMAPr：用于多态性数据库挖掘、注释和功能分析的程序。

PolyMAPr: programs for polymorphism database mining, annotation, and functional analysis.

作者信息

Freimuth Robert R, Stormo Gary D, McLeod Howard L

机构信息

Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, USA.

出版信息

Hum Mutat. 2005 Feb;25(2):110-7. doi: 10.1002/humu.20123.

DOI:10.1002/humu.20123

PMID:15643605

Abstract

Pharmacogenomic and disease-association studies rely on identifying a comprehensive set of polymorphisms within candidate genes. Public SNP databases are a rich source of polymorphism data, but mining them effectively requires overcoming at least four challenges: ensuring accurate annotations for genes and polymorphisms, eliminating both inter- and intra-database redundancy, integrating data from multiple public sources with data generated locally, and prioritizing the variants for further study. PolyMAPr (Polymorphism Mining and Annotation Programs)' was developed to overcome these challenges and to improve the efficiency of database mining and polymorphism annotation. PolyMAPr takes as input a file containing a list of genes to be processed and files containing each annotated gene sequence. Polymorphic sequences obtained from public databases (dbSNP, CGAP, and JSNP) or through local SNP discovery efforts, as well as oligonucleotide sequences (e.g., PCR primers), are mapped to the annotated gene sequences and named according to suggested nomenclature guidelines. The functional effects of nonsynonymous coding-region SNPs (cSNPs) and any variants that might alter exon splicing enhancer (ESE) sites, putative transcription factor binding sites, or intron-exon splice sites are predicted. The output files are accessible though a browser interface. In addition, the results are also provided in Extensible Markup Language (XML) format to facilitate uploading them into a local relational database. PolyMAPr increases the efficiency of mining public databases for genetic variants within candidate genes and provides a mechanism by which data from multiple sources (both public and private) can be uniformly integrated, thereby significantly reducing the effort required to obtain a comprehensive set of polymorphisms for pharmacogenomic and disease-association studies. PolyMAPr can be obtained from http://pharmacogenomics.wustl.edu.

摘要

药物基因组学和疾病关联研究依赖于在候选基因中识别出一套全面的多态性。公共单核苷酸多态性（SNP）数据库是多态性数据的丰富来源，但要有效地挖掘这些数据库，至少需要克服四个挑战：确保对基因和多态性进行准确注释，消除数据库间和数据库内的冗余，将来自多个公共来源的数据与本地生成的数据进行整合，以及对变异进行优先级排序以便进一步研究。开发PolyMAPr（多态性挖掘与注释程序）就是为了克服这些挑战，并提高数据库挖掘和多态性注释的效率。PolyMAPr以一个包含待处理基因列表的文件以及包含每个注释基因序列的文件作为输入。从公共数据库（dbSNP、CGAP和JSNP）或通过本地SNP发现工作获得的多态性序列，以及寡核苷酸序列（例如PCR引物），被映射到注释基因序列上，并根据建议的命名指南进行命名。预测非同义编码区SNP（cSNP）以及任何可能改变外显子剪接增强子（ESE）位点、假定转录因子结合位点或内含子-外显子剪接位点的变异的功能效应。输出文件可通过浏览器界面访问。此外，结果还以可扩展标记语言（XML）格式提供，以便于上传到本地关系数据库中。PolyMAPr提高了在候选基因中挖掘公共数据库以获取遗传变异的效率，并提供了一种机制，通过该机制可以统一整合来自多个来源（包括公共和私人来源）的数据，从而显著减少为药物基因组学和疾病关联研究获取一套全面多态性所需的工作量。可从http://pharmacogenomics.wustl.edu获取PolyMAPr。

相似文献

PolyMAPr: programs for polymorphism database mining, annotation, and functional analysis.

Hum Mutat. 2005 Feb;25(2):110-7. doi: 10.1002/humu.20123.

SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation.

Bioinformatics. 2007 Jul 1;23(13):i387-91. doi: 10.1093/bioinformatics/btm192.

AutoSNPdb: an annotated single nucleotide polymorphism database for crop plants.

Nucleic Acids Res. 2009 Jan;37(Database issue):D951-3. doi: 10.1093/nar/gkn650. Epub 2008 Oct 14.

New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies.

Biotechniques. 2002 Jun;Suppl:48-50, 52, 54.

Data mining of public SNP databases for the selection of intragenic SNPs.

Hum Mutat. 2002 Sep;20(3):162-73. doi: 10.1002/humu.10107.

PromoLign: a database for upstream region analysis and SNPs.

Hum Mutat. 2004 Jun;23(6):534-9. doi: 10.1002/humu.20049.

SNP@Ethnos: a database of ethnically variant single-nucleotide polymorphisms.

Nucleic Acids Res. 2007 Jan;35(Database issue):D711-5. doi: 10.1093/nar/gkl962. Epub 2006 Nov 28.

Frequency Finder: a multi-source web application for collection of public allele frequencies of SNP markers.

Bioinformatics. 2004 Feb 12;20(3):439-43. doi: 10.1093/bioinformatics/btg446. Epub 2004 Jan 22.

SNP-PHAGE--High throughput SNP discovery pipeline.

BMC Bioinformatics. 2006 Oct 23;7:468. doi: 10.1186/1471-2105-7-468.

Go!Poly: A gene-oriented polymorphism database.

Hum Mutat. 2001 Nov;18(5):382-7. doi: 10.1002/humu.1209.

引用本文的文献

SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.

BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-14-S1-S9. Epub 2013 Jan 14.

Genetic variation in the beta2 subunit of the voltage-gated calcium channel and pharmacogenetic association with adverse cardiovascular outcomes in the INternational VErapamil SR-Trandolapril STudy GENEtic Substudy (INVEST-GENES).

Circ Cardiovasc Genet. 2010 Dec;3(6):548-55. doi: 10.1161/CIRCGENETICS.110.957654.

Bioinformatic tools for identifying disease gene and SNP candidates.

Methods Mol Biol. 2010;628:307-19. doi: 10.1007/978-1-60327-367-1_17.

CACNA1C gene polymorphisms, cardiovascular disease outcomes, and treatment response.

Circ Cardiovasc Genet. 2009 Aug;2(4):362-70. doi: 10.1161/CIRCGENETICS.109.857839. Epub 2009 Jun 3.

Analytical methods for inferring functional effects of single base pair substitutions in human cancers.

Hum Genet. 2009 Oct;126(4):481-98. doi: 10.1007/s00439-009-0677-y. Epub 2009 May 12.

A survey of genomic properties for the detection of regulatory polymorphisms.

PLoS Comput Biol. 2007 Jun;3(6):e106. doi: 10.1371/journal.pcbi.0030106. Epub 2007 Apr 25.

Single nucleotide polymorphism discovery and haplotype analysis of Ca2+-dependent K+ channel beta-1 subunit.

Pharmacogenet Genomics. 2007 Apr;17(4):267-75. doi: 10.1097/FPC.0b013e3280105235.

Identification of NR1I2 genetic variation using resequencing.

Eur J Clin Pharmacol. 2007 Jun;63(6):547-54. doi: 10.1007/s00228-007-0295-3. Epub 2007 Apr 3.

PharmGED: Pharmacogenetic Effect Database.

Nucleic Acids Res. 2007 Jan;35(Database issue):D794-9. doi: 10.1093/nar/gkl853. Epub 2006 Dec 6.

FASTSNP: an always up-to-date and extendable service for SNP function analysis and prioritization.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W635-41. doi: 10.1093/nar/gkl236.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PolyMAPr：用于多态性数据库挖掘、注释和功能分析的程序。

PolyMAPr: programs for polymorphism database mining, annotation, and functional analysis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献