Suppr超能文献

一种用于研究单核苷酸多态性与疾病的集成数据库-管道系统。

An integrated database-pipeline system for studying single nucleotide polymorphisms and diseases.

作者信息

Yang Jin Ok, Hwang Sohyun, Oh Jeongsu, Bhak Jong, Sohn Tae-Kwon

机构信息

Korean BioInformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, 305-806, Korea.

出版信息

BMC Bioinformatics. 2008 Dec 12;9 Suppl 12(Suppl 12):S19. doi: 10.1186/1471-2105-9-S12-S19.

Abstract

BACKGROUND

Studies on the relationship between disease and genetic variations such as single nucleotide polymorphisms (SNPs) are important. Genetic variations can cause disease by influencing important biological regulation processes. Despite the needs for analyzing SNP and disease correlation, most existing databases provide information only on functional variants at specific locations on the genome, or deal with only a few genes associated with disease. There is no combined resource to widely support gene-, SNP-, and disease-related information, and to capture relationships among such data. Therefore, we developed an integrated database-pipeline system for studying SNPs and diseases.

RESULTS

To implement the pipeline system for the integrated database, we first unified complicated and redundant disease terms and gene names using the Unified Medical Language System (UMLS) for classification and noun modification, and the HUGO Gene Nomenclature Committee (HGNC) and NCBI gene databases. Next, we collected and integrated representative databases for three categories of information. For genes and proteins, we examined the NCBI mRNA, UniProt, UCSC Table Track and MitoDat databases. For genetic variants we used the dbSNP, JSNP, ALFRED, and HGVbase databases. For disease, we employed OMIM, GAD, and HGMD databases. The database-pipeline system provides a disease thesaurus, including genes and SNPs associated with disease. The search results for these categories are available on the web page http://diseasome.kobic.re.kr/, and a genome browser is also available to highlight findings, as well as to permit the convenient review of potentially deleterious SNPs among genes strongly associated with specific diseases and clinical phenotypes.

CONCLUSION

Our system is designed to capture the relationships between SNPs associated with disease and disease-causing genes. The integrated database-pipeline provides a list of candidate genes and SNP markers for evaluation in both epidemiological and molecular biological approaches to diseases-gene association studies. Furthermore, researchers then can decide semi-automatically the data set for association studies while considering the relationships between genetic variation and diseases. The database can also be economical for disease-association studies, as well as to facilitate an understanding of the processes which cause disease. Currently, the database contains 14,674 SNP records and 109,715 gene records associated with human diseases and it is updated at regular intervals.

摘要

背景

研究疾病与单核苷酸多态性(SNP)等基因变异之间的关系非常重要。基因变异可通过影响重要的生物调节过程导致疾病。尽管需要分析SNP与疾病的相关性,但大多数现有数据库仅提供基因组特定位置的功能变异信息,或仅处理少数与疾病相关的基因。目前尚无综合资源能广泛支持与基因、SNP和疾病相关的信息,并捕捉这些数据之间的关系。因此,我们开发了一个用于研究SNP与疾病的综合数据库 - 管道系统。

结果

为实现综合数据库的管道系统,我们首先使用统一医学语言系统(UMLS)进行分类和名词修饰,并借助HUGO基因命名委员会(HGNC)和NCBI基因数据库,统一复杂且冗余的疾病术语和基因名称。接下来,我们收集并整合了三类信息的代表性数据库。对于基因和蛋白质,我们研究了NCBI mRNA、UniProt、UCSC Table Track和MitoDat数据库。对于基因变异,我们使用了dbSNP、JSNP、ALFRED和HGVbase数据库。对于疾病,我们采用了OMIM、GAD和HGMD数据库。该数据库 - 管道系统提供了一个疾病词库,包括与疾病相关的基因和SNP。这些类别的搜索结果可在网页http://diseasome.kobic.re.kr/上获取,同时还提供了一个基因组浏览器,用于突出显示研究结果,并便于查看与特定疾病和临床表型密切相关的基因中潜在有害的SNP。

结论

我们的系统旨在捕捉与疾病相关的SNP和致病基因之间的关系。该综合数据库 - 管道为疾病 - 基因关联研究的流行病学和分子生物学方法评估提供了候选基因和SNP标记列表。此外,研究人员在考虑基因变异与疾病之间的关系时,可以半自动地确定关联研究的数据集。该数据库对于疾病关联研究也较为经济实惠,有助于理解导致疾病产生的过程。目前,该数据库包含14,674条与人类疾病相关的SNP记录和109,715条基因记录,并定期更新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cb8/2638159/1baf6f14793c/1471-2105-9-S12-S19-1.jpg

相似文献

1
An integrated database-pipeline system for studying single nucleotide polymorphisms and diseases.
BMC Bioinformatics. 2008 Dec 12;9 Suppl 12(Suppl 12):S19. doi: 10.1186/1471-2105-9-S12-S19.
2
MedRefSNP: a database of medically investigated SNPs.
Hum Mutat. 2009 Mar;30(3):E460-6. doi: 10.1002/humu.20914.
5
JSNP: a database of common gene variations in the Japanese population.
Nucleic Acids Res. 2002 Jan 1;30(1):158-62. doi: 10.1093/nar/30.1.158.
6
SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.
BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-14-S1-S9. Epub 2013 Jan 14.
7
A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders.
BMC Genomics. 2014;15 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-15-S3-S5. Epub 2014 May 6.
8
SNP2NMD: a database of human single nucleotide polymorphisms causing nonsense-mediated mRNA decay.
Bioinformatics. 2007 Feb 1;23(3):397-9. doi: 10.1093/bioinformatics/btl593. Epub 2006 Nov 22.
9
F-SNP: computationally predicted functional SNPs for disease association studies.
Nucleic Acids Res. 2008 Jan;36(Database issue):D820-4. doi: 10.1093/nar/gkm904. Epub 2007 Nov 5.

引用本文的文献

1
How to build personalized multi-omics comorbidity profiles.
Front Cell Dev Biol. 2015 Jun 24;3:28. doi: 10.3389/fcell.2015.00028. eCollection 2015.
2
Detection and analysis of disease-associated single nucleotide polymorphism influencing post-translational modification.
BMC Med Genomics. 2015;8 Suppl 2(Suppl 2):S7. doi: 10.1186/1755-8794-8-S2-S7. Epub 2015 May 29.
4
Drug repurposing: far beyond new targets for old drugs.
AAPS J. 2012 Dec;14(4):759-63. doi: 10.1208/s12248-012-9390-1. Epub 2012 Jul 24.
5
VnD: a structure-centric database of disease-related SNPs and drugs.
Nucleic Acids Res. 2011 Jan;39(Database issue):D939-44. doi: 10.1093/nar/gkq957. Epub 2010 Nov 4.
6
Mutation@A Glance: an integrative web application for analysing mutations from human genetic diseases.
DNA Res. 2010 Jun;17(3):197-208. doi: 10.1093/dnares/dsq010. Epub 2010 Apr 1.
7
PhosSNP for systematic analysis of genetic polymorphisms that influence protein phosphorylation.
Mol Cell Proteomics. 2010 Apr;9(4):623-34. doi: 10.1074/mcp.M900273-MCP200. Epub 2009 Dec 8.
8
PDbase: a database of Parkinson's disease-related genes and genetic variation using substantia nigra ESTs.
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S32. doi: 10.1186/1471-2164-10-S3-S32.
9
Emerging strengths in Asia Pacific bioinformatics.
BMC Bioinformatics. 2008 Dec 12;9 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-9-S12-S1.

本文引用的文献

1
Identification of SNP markers for common CNV regions and association analysis of risk of subarachnoid aneurysmal hemorrhage in Japanese population.
Biochem Biophys Res Commun. 2008 Sep 5;373(4):593-6. doi: 10.1016/j.bbrc.2008.06.083. Epub 2008 Jul 2.
2
SNP@Promoter: a database of human SNPs (single nucleotide polymorphisms) within the putative promoter regions.
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2105-9-S1-S2.
3
Structural genomic variation in ischemic stroke.
Neurogenetics. 2008 May;9(2):101-8. doi: 10.1007/s10048-008-0119-3. Epub 2008 Feb 21.
4
Database resources of the National Center for Biotechnology Information.
Nucleic Acids Res. 2008 Jan;36(Database issue):D13-21. doi: 10.1093/nar/gkm1000. Epub 2007 Nov 27.
6
The UCSC genome browser database: update 2007.
Nucleic Acids Res. 2007 Jan;35(Database issue):D668-73. doi: 10.1093/nar/gkl928. Epub 2006 Nov 16.
7
Mapping of a gene causing brittle cornea syndrome in Tunisian jews to 16q24.
Invest Ophthalmol Vis Sci. 2006 Dec;47(12):5283-7. doi: 10.1167/iovs.06-0206.
8
Genomic rearrangements and gene copy-number alterations as a cause of nervous system disorders.
Neuron. 2006 Oct 5;52(1):103-21. doi: 10.1016/j.neuron.2006.09.027.
9
SNP@Domain: a web resource of single nucleotide polymorphisms (SNPs) within protein domain structures and sequences.
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W642-4. doi: 10.1093/nar/gkl323.
10
The HUGO Gene Nomenclature Database, 2006 updates.
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D319-21. doi: 10.1093/nar/gkj147.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验