解读全基因组关联研究：利用表观基因组学和基因组工程来理解人类基因组非编码区单核苷酸多态性的功能相关性。

Making sense of GWAS: using epigenomics and genome engineering to understand the functional relevance of SNPs in non-coding regions of the human genome.

作者信息

Tak Yu Gyoung, Farnham Peggy J

机构信息

Department of Biochemistry and Molecular Biology, Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA 90089 USA.

出版信息

Epigenetics Chromatin. 2015 Dec 30;8:57. doi: 10.1186/s13072-015-0050-4. eCollection 2015.

DOI:10.1186/s13072-015-0050-4

PMID:26719772

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4696349/

Abstract

Considerable progress towards an understanding of complex diseases has been made in recent years due to the development of high-throughput genotyping technologies. Using microarrays that contain millions of single-nucleotide polymorphisms (SNPs), Genome Wide Association Studies (GWASs) have identified SNPs that are associated with many complex diseases or traits. For example, as of February 2015, 2111 association studies have identified 15,396 SNPs for various diseases and traits, with the number of identified SNP-disease/trait associations increasing rapidly in recent years. However, it has been difficult for researchers to understand disease risk from GWAS results. This is because most GWAS-identified SNPs are located in non-coding regions of the genome. It is important to consider that the GWAS-identified SNPs serve only as representatives for all SNPs in the same haplotype block, and it is equally likely that other SNPs in high linkage disequilibrium (LD) with the array-identified SNPs are causal for the disease. Because it was hoped that disease-associated coding variants would be identified if the true casual SNPs were known, investigators have expanded their analyses using LD calculation and fine-mapping. However, such analyses also identified risk-associated SNPs located in non-coding regions. Thus, the GWAS field has been left with the conundrum as to how a single-nucleotide change in a non-coding region could confer increased risk for a specific disease. One possible answer to this puzzle is that the variant SNPs cause changes in gene expression levels rather than causing changes in protein function. This review provides a description of (1) advances in genomic and epigenomic approaches that incorporate functional annotation of regulatory elements to prioritize the disease risk-associated SNPs that are located in non-coding regions of the genome for follow-up studies, (2) various computational tools that aid in identifying gene expression changes caused by the non-coding disease-associated SNPs, and (3) experimental approaches to identify target genes of, and study the biological phenotypes conferred by, non-coding disease-associated SNPs.

摘要

近年来，由于高通量基因分型技术的发展，在理解复杂疾病方面取得了相当大的进展。通过使用包含数百万个单核苷酸多态性（SNP）的微阵列，全基因组关联研究（GWAS）已经鉴定出与许多复杂疾病或性状相关的SNP。例如，截至2015年2月，2111项关联研究已经鉴定出15396个与各种疾病和性状相关的SNP，近年来鉴定出的SNP-疾病/性状关联数量迅速增加。然而，研究人员很难从GWAS结果中理解疾病风险。这是因为大多数GWAS鉴定出的SNP位于基因组的非编码区域。需要考虑的是，GWAS鉴定出的SNP仅作为同一单倍型块中所有SNP的代表，与阵列鉴定出的SNP处于高连锁不平衡（LD）状态的其他SNP同样有可能是该疾病的致病因素。由于人们希望如果知道真正的致病SNP，就能够鉴定出与疾病相关的编码变异，研究人员已经使用LD计算和精细定位扩展了他们的分析。然而，这些分析也鉴定出了位于非编码区域的风险相关SNP。因此，GWAS领域面临着一个难题，即非编码区域的单核苷酸变化如何能够增加特定疾病的风险。这个谜题的一个可能答案是，变异的SNP导致基因表达水平的变化，而不是导致蛋白质功能的变化。本综述描述了：（1）基因组和表观基因组方法的进展，这些方法结合了调控元件的功能注释，以便对位于基因组非编码区域的疾病风险相关SNP进行优先级排序，用于后续研究；（2）各种有助于识别由非编码疾病相关SNP引起的基因表达变化的计算工具；（3）识别非编码疾病相关SNP的靶基因并研究其赋予的生物学表型的实验方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc4/4696349/564e8f789a85/13072_2015_50_Fig1_HTML.jpg

相似文献

Making sense of GWAS: using epigenomics and genome engineering to understand the functional relevance of SNPs in non-coding regions of the human genome.

Epigenetics Chromatin. 2015 Dec 30;8:57. doi: 10.1186/s13072-015-0050-4. eCollection 2015.

Endometrial vezatin and its association with endometriosis risk.

Hum Reprod. 2016 May;31(5):999-1013. doi: 10.1093/humrep/dew047. Epub 2016 Mar 22.

From GWAS to Gene: Transcriptome-Wide Association Studies and Other Methods to Functionally Understand GWAS Discoveries.

Front Genet. 2021 Sep 30;12:713230. doi: 10.3389/fgene.2021.713230. eCollection 2021.

Structural variants in linkage disequilibrium with GWAS-significant SNPs.

Heliyon. 2024 May 28;10(11):e32053. doi: 10.1016/j.heliyon.2024.e32053. eCollection 2024 Jun 15.

Human-genome single nucleotide polymorphisms affecting transcription factor binding and their role in pathogenesis.

Vavilovskii Zhurnal Genet Selektsii. 2023 Oct;27(6):662-675. doi: 10.18699/VJGB-23-77.

Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data.

PLoS One. 2013;8(1):e54359. doi: 10.1371/journal.pone.0054359. Epub 2013 Jan 30.

SNPinfo: integrating GWAS and candidate gene information into functional SNP selection for genetic association studies.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W600-5. doi: 10.1093/nar/gkp290. Epub 2009 May 5.

Identification of potential genetic causal variants for obesity-related traits using statistical fine mapping.

Mol Genet Genomics. 2023 Nov;298(6):1309-1319. doi: 10.1007/s00438-023-02055-9. Epub 2023 Jul 27.

Characterisation of non-coding genetic variation in histamine receptors using AnNCR-SNP.

Amino Acids. 2016 Oct;48(10):2433-42. doi: 10.1007/s00726-016-2265-5. Epub 2016 Jun 6.

Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.

G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.

引用本文的文献

The Importance of Regulatory Network Structure for Complex Trait Heritability and Evolution.

Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf174.

In Silico Analysis of Post-COVID-19 Condition (PCC) Associated SNP rs9367106 Predicts the Molecular Basis of Abnormalities in the Lungs and Brain Functions.

Int J Mol Sci. 2025 Jul 11;26(14):6680. doi: 10.3390/ijms26146680.

Genetics of growth rate in induced pluripotent stem cells.

bioRxiv. 2025 Jul 3:2025.07.02.662844. doi: 10.1101/2025.07.02.662844.

A genome-wide association study integrated with single-cell and bulk profiles uncovers susceptibility genes for nasopharyngeal carcinoma involved in tumorigenesis via regulation of T cells.

Genome Biol. 2025 Jul 7;26(1):195. doi: 10.1186/s13059-025-03657-9.

Application of human iPSC-derived white, beige, and brown adipocytes for metabolic disease modeling and transplantation therapy.

Cell Transplant. 2025 Jan-Dec;34:9636897251346599. doi: 10.1177/09636897251346599. Epub 2025 Jun 19.

In Silico Prioritization of STAT1 3' UTR SNPs Identifies rs190542524 as a miRNA-Linked Variant with Potential Oncogenic Impact.

Noncoding RNA. 2025 Apr 29;11(3):32. doi: 10.3390/ncrna11030032.

Genome-wide association study of biological nitrogen fixation traits in mini-core cowpea germplasm.

PLoS One. 2025 May 9;20(5):e0322203. doi: 10.1371/journal.pone.0322203. eCollection 2025.

CRISPR genome editing using a combined positive and negative selection system.

PLoS One. 2025 May 6;20(5):e0321881. doi: 10.1371/journal.pone.0321881. eCollection 2025.

Regulatory element map of sheep reproductive tissues: functional annotation of tissue-specific strong active enhancers.

Front Vet Sci. 2025 Apr 16;12:1564148. doi: 10.3389/fvets.2025.1564148. eCollection 2025.

An inflammation-associated lncRNA induces neuronal damage via mitochondrial dysfunction.

Mol Ther Nucleic Acids. 2025 Apr 2;36(2):102533. doi: 10.1016/j.omtn.2025.102533. eCollection 2025 Jun 10.

本文引用的文献

Demystifying the secret mission of enhancers: linking distal regulatory elements to target genes.

Crit Rev Biochem Mol Biol. 2015;50(6):550-73. doi: 10.3109/10409238.2015.1087961. Epub 2015 Oct 8.

A global reference for human genetic variation.

Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.

Partitioning heritability by functional annotation using genome-wide association summary statistics.

Nat Genet. 2015 Nov;47(11):1228-35. doi: 10.1038/ng.3404. Epub 2015 Sep 28.

BCL11A enhancer dissection by Cas9-mediated in situ saturating mutagenesis.

Nature. 2015 Nov 12;527(7577):192-7. doi: 10.1038/nature15521. Epub 2015 Sep 16.

Functional footprinting of regulatory DNA.

Nat Methods. 2015 Oct;12(10):927-30. doi: 10.1038/nmeth.3554. Epub 2015 Aug 31.

Genome-wide mapping of promoter-anchored interactions with close to single-enhancer resolution.

Genome Biol. 2015 Aug 3;16(1):156. doi: 10.1186/s13059-015-0727-9.

FTO Obesity Variant Circuitry and Adipocyte Browning in Humans.

N Engl J Med. 2015 Sep 3;373(10):895-907. doi: 10.1056/NEJMoa1502214. Epub 2015 Aug 19.

CRISPR Inversion of CTCF Sites Alters Genome Topology and Enhancer/Promoter Function.

Cell. 2015 Aug 13;162(4):900-10. doi: 10.1016/j.cell.2015.07.038.

A CTCF Code for 3D Genome Architecture.

Cell. 2015 Aug 13;162(4):703-5. doi: 10.1016/j.cell.2015.07.053.

motifbreakR: an R/Bioconductor package for predicting variant effects at transcription factor binding sites.

Bioinformatics. 2015 Dec 1;31(23):3847-9. doi: 10.1093/bioinformatics/btv470. Epub 2015 Aug 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

解读全基因组关联研究：利用表观基因组学和基因组工程来理解人类基因组非编码区单核苷酸多态性的功能相关性。

Making sense of GWAS: using epigenomics and genome engineering to understand the functional relevance of SNPs in non-coding regions of the human genome.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献