Suppr超能文献

复杂疾病关联研究中的目标单核苷酸多态性选择

Target SNP selection in complex disease association studies.

作者信息

Wjst Matthias

机构信息

Gruppe Molekulare Epidemiologie, Institut für Epidemiologie, GSF - Forschungszentrum für Umwelt und Gesundheit, Ingolstädter Landstrasse 1, D-85758 Neuherberg/Munich, Germany.

出版信息

BMC Bioinformatics. 2004 Jul 12;5:92. doi: 10.1186/1471-2105-5-92.

Abstract

BACKGROUND

The massive amount of SNP data stored at public internet sites provides unprecedented access to human genetic variation. Selecting target SNP for disease-gene association studies is currently done more or less randomly as decision rules for the selection of functional relevant SNPs are not available.

RESULTS

We implemented a computational pipeline that retrieves the genomic sequence of target genes, collects information about sequence variation and selects functional motifs containing SNPs. Motifs being considered are gene promoter, exon-intron structure, AU-rich mRNA elements, transcription factor binding motifs, cryptic and enhancer splice sites together with expression in target tissue. As a case study, 396 genes on chromosome 6p21 in the extended HLA region were selected that contributed nearly 20,000 SNPs. By computer annotation ~2,500 SNPs in functional motifs could be identified. Most of these SNPs are disrupting transcription factor binding sites but only those introducing new sites had a significant depressing effect on SNP allele frequency. Other decision rules concern position within motifs, the validity of SNP database entries, the unique occurrence in the genome and conserved sequence context in other mammalian genomes.

CONCLUSION

Only 10% of all gene-based SNPs have sequence-predicted functional relevance making them a primary target for genotyping in association studies.

摘要

背景

存储在公共互联网站点上的大量单核苷酸多态性(SNP)数据为获取人类遗传变异提供了前所未有的途径。目前,在疾病基因关联研究中选择目标SNP或多或少是随机进行的,因为尚无选择功能相关SNP的决策规则。

结果

我们实施了一个计算流程,该流程检索目标基因的基因组序列,收集有关序列变异的信息,并选择包含SNP的功能基序。所考虑的基序包括基因启动子、外显子-内含子结构、富含AU的mRNA元件、转录因子结合基序、隐蔽和增强子剪接位点以及在目标组织中的表达。作为一个案例研究,我们选择了扩展的HLA区域中6号染色体p21上的396个基因,这些基因贡献了近20,000个SNP。通过计算机注释,可以识别出功能基序中的约2,500个SNP。这些SNP中的大多数破坏了转录因子结合位点,但只有那些引入新位点的SNP对SNP等位基因频率有显著的抑制作用。其他决策规则涉及基序内的位置、SNP数据库条目的有效性、在基因组中的独特出现以及其他哺乳动物基因组中的保守序列背景。

结论

所有基于基因的SNP中只有10%具有序列预测的功能相关性,这使其成为关联研究中基因分型的主要目标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0e/487897/f04f476a5f95/1471-2105-5-92-1.jpg

相似文献

1
Target SNP selection in complex disease association studies.
BMC Bioinformatics. 2004 Jul 12;5:92. doi: 10.1186/1471-2105-5-92.
2
An efficient computational method for screening functional SNPs in plants.
J Theor Biol. 2010 Jul 7;265(1):55-62. doi: 10.1016/j.jtbi.2010.04.017. Epub 2010 Apr 18.
4
LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.
Bioinformatics. 2005 Jun 15;21(12):2814-20. doi: 10.1093/bioinformatics/bti442. Epub 2005 Apr 12.
6
An SNP map of human chromosome 22.
Nature. 2000 Sep 28;407(6803):516-20. doi: 10.1038/35035089.
10
Selecting single-nucleotide polymorphisms for association studies with SNPbrowser software.
Methods Mol Biol. 2007;376:177-93. doi: 10.1007/978-1-59745-389-9_13.

引用本文的文献

1
Predictive and prognostic molecular markers for cholangiocarcinoma in Han Chinese population.
Int J Clin Exp Med. 2015 Aug 15;8(8):13680-9. eCollection 2015.
2
The possible role of EZH2 and DNMT1 polymorphisms in sporadic triple-negative breast carcinoma in southern Chinese females.
Tumour Biol. 2015 Dec;36(12):9849-55. doi: 10.1007/s13277-015-3754-y. Epub 2015 Jul 11.
4
Promoter polymorphisms of DNA methyltransferase 3B and risk of hepatocellular carcinoma.
Biomed Rep. 2013 Sep;1(5):771-775. doi: 10.3892/br.2013.142. Epub 2013 Jul 19.
5
DNMT3A -448A>G polymorphism and the risk for hepatocellular carcinoma.
Biomed Rep. 2013 Jul;1(4):664-668. doi: 10.3892/br.2013.121. Epub 2013 May 30.
6
SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.
BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-14-S1-S9. Epub 2013 Jan 14.
7
Risk-association of DNA methyltransferases polymorphisms with gastric cancer in the Southern Chinese population.
Int J Mol Sci. 2012;13(7):8364-8378. doi: 10.3390/ijms13078364. Epub 2012 Jul 5.
8
Mechanistic insights into the link between visfatin gene C-1535T polymorphism and coronary artery disease: an in vitro study.
Mol Cell Biochem. 2012 Apr;363(1-2):315-22. doi: 10.1007/s11010-011-1184-8. Epub 2011 Dec 7.
10
Domain altering SNPs in the human proteome and their impact on signaling pathways.
PLoS One. 2010 Sep 23;5(9):e12890. doi: 10.1371/journal.pone.0012890.

本文引用的文献

1
PARSESNP: A tool for the analysis of nucleotide polymorphisms.
Nucleic Acids Res. 2003 Jul 1;31(13):3808-11. doi: 10.1093/nar/gkg574.
2
High-resolution SNP scan of chromosome 6p21 in pooled samples from patients with complex diseases.
Genomics. 2003 May;81(5):510-8. doi: 10.1016/s0888-7543(02)00035-6.
3
Quality and completeness of SNP databases.
Nat Genet. 2003 Apr;33(4):457-8. doi: 10.1038/ng1133. Epub 2003 Mar 24.
4
TRANSFAC: transcriptional regulation, from patterns to profiles.
Nucleic Acids Res. 2003 Jan 1;31(1):374-8. doi: 10.1093/nar/gkg108.
5
6
SNPper: retrieval and analysis of human SNPs.
Bioinformatics. 2002 Dec;18(12):1681-5. doi: 10.1093/bioinformatics/18.12.1681.
7
SNP databases and pharmacogenetics: great start, but a long way to go.
Hum Mutat. 2002 Sep;20(3):174-9. doi: 10.1002/humu.10115.
8
Association of genes to genetically inherited diseases using data mining.
Nat Genet. 2002 Jul;31(3):316-9. doi: 10.1038/ng895. Epub 2002 May 13.
9
10
Comprehensive analysis of CpG islands in human chromosomes 21 and 22.
Proc Natl Acad Sci U S A. 2002 Mar 19;99(6):3740-5. doi: 10.1073/pnas.052410099. Epub 2002 Mar 12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验