人类心脏增强子的全基因组发现。

Genome-wide discovery of human heart enhancers.

机构信息

Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health (NIH), Bethesda, Maryland 20894, USA.

出版信息

Genome Res. 2010 Mar;20(3):381-92. doi: 10.1101/gr.098657.109. Epub 2010 Jan 14.

DOI:10.1101/gr.098657.109

PMID:20075146

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2840982/

Abstract

The various organogenic programs deployed during embryonic development rely on the precise expression of a multitude of genes in time and space. Identifying the cis-regulatory elements responsible for this tightly orchestrated regulation of gene expression is an essential step in understanding the genetic pathways involved in development. We describe a strategy to systematically identify tissue-specific cis-regulatory elements that share combinations of sequence motifs. Using heart development as an experimental framework, we employed a combination of Gibbs sampling and linear regression to build a classifier that identifies heart enhancers based on the presence and/or absence of various sequence features, including known and putative transcription factor (TF) binding specificities. In distinguishing heart enhancers from a large pool of random noncoding sequences, the performance of our classifier is vastly superior to four commonly used methods, with an accuracy reaching 92% in cross-validation. Furthermore, most of the binding specificities learned by our method resemble the specificities of TFs widely recognized as key players in heart development and differentiation, such as SRF, MEF2, ETS1, SMAD, and GATA. Using our classifier as a predictor, a genome-wide scan identified over 40,000 novel human heart enhancers. Although the classifier used no gene expression information, these novel enhancers are strongly associated with genes expressed in the heart. Finally, in vivo tests of our predictions in mouse and zebrafish achieved a validation rate of 62%, significantly higher than what is expected by chance. These results support the existence of underlying cis-regulatory codes dictating tissue-specific transcription in mammalian genomes and validate our enhancer classifier strategy as a method to uncover these regulatory codes.

摘要

胚胎发育过程中各种器官发生程序依赖于众多基因在时间和空间上的精确表达。鉴定负责这种基因表达精确调控的顺式调控元件是理解参与发育的遗传途径的关键步骤。我们描述了一种系统识别具有组合序列基序的组织特异性顺式调控元件的策略。我们以心脏发育为实验框架，结合 Gibbs 抽样和线性回归来构建一个分类器，该分类器基于各种序列特征（包括已知和假定的转录因子 [TF] 结合特异性）的存在与否来识别心脏增强子。在将心脏增强子与大量随机非编码序列区分开来时，我们的分类器的性能远远优于四种常用方法，交叉验证的准确率达到 92%。此外，我们方法中学习到的大多数结合特异性与广泛认为是心脏发育和分化关键参与者的 TF 特异性相似，例如 SRF、MEF2、ETS1、SMAD 和 GATA。使用我们的分类器作为预测器，对人类基因组进行了全基因组扫描，鉴定出超过 40,000 个新的人类心脏增强子。尽管分类器没有使用基因表达信息，但这些新的增强子与在心脏中表达的基因强烈相关。最后，在小鼠和斑马鱼中的体内预测测试达到了 62%的验证率，明显高于随机预期。这些结果支持在哺乳动物基因组中存在决定组织特异性转录的潜在顺式调控代码，并验证了我们的增强子分类器策略作为揭示这些调控代码的方法的有效性。

相似文献

Genome-wide discovery of human heart enhancers.

Genome Res. 2010 Mar;20(3):381-92. doi: 10.1101/gr.098657.109. Epub 2010 Jan 14.

Sequence signatures extracted from proximal promoters can be used to predict distal enhancers.

Genome Biol. 2013;14(10):R117. doi: 10.1186/gb-2013-14-10-r117.

Experimental validation of predicted mammalian erythroid cis-regulatory modules.

Genome Res. 2006 Dec;16(12):1480-92. doi: 10.1101/gr.5353806. Epub 2006 Oct 12.

De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets.

BMC Genomics. 2014 Dec 2;15:1047. doi: 10.1186/1471-2164-15-1047.

cis-regulatory analysis of the Drosophila pdm locus reveals a diversity of neural enhancers.

BMC Genomics. 2015 Sep 16;16(1):700. doi: 10.1186/s12864-015-1897-2.

Novel mRNAs 3' end-associated -regulatory elements with epigenomic signatures of mammalian enhancers in the genome.

RNA. 2019 Oct;25(10):1242-1258. doi: 10.1261/rna.071209.119. Epub 2019 Jul 16.

Inferring dynamic gene regulatory networks in cardiac differentiation through the integration of multi-dimensional data.

BMC Bioinformatics. 2015 Mar 7;16:74. doi: 10.1186/s12859-015-0460-0.

Interrogating transcriptional regulatory sequences in Tol2-mediated Xenopus transgenics.

PLoS One. 2013 Jul 16;8(7):e68548. doi: 10.1371/journal.pone.0068548. Print 2013.

DNA specificity determinants associate with distinct transcription factor functions.

PLoS Genet. 2009 Dec;5(12):e1000778. doi: 10.1371/journal.pgen.1000778. Epub 2009 Dec 18.

Homotypic clusters of transcription factor binding sites are a key component of human promoters and enhancers.

Genome Res. 2010 May;20(5):565-77. doi: 10.1101/gr.104471.109. Epub 2010 Apr 2.

引用本文的文献

The combinatorial binding syntax of transcription factors in forebrain-specific enhancers.

Biol Open. 2025 Feb 15;14(2). doi: 10.1242/bio.061751. Epub 2025 Feb 19.

Cardiac Transcription Factors and Regulatory Networks.

Adv Exp Med Biol. 2024;1441:295-311. doi: 10.1007/978-3-031-44087-8_16.

Cell-type-directed design of synthetic enhancers.

Nature. 2024 Feb;626(7997):212-220. doi: 10.1038/s41586-023-06936-2. Epub 2023 Dec 12.

RefSeq Functional Elements as experimentally assayed nongenic reference standards and functional interactions in human and mouse.

Genome Res. 2022 Jan;32(1):175-188. doi: 10.1101/gr.275819.121. Epub 2021 Dec 7.

Genomic enhancers in cardiac development and disease.

Nat Rev Cardiol. 2022 Jan;19(1):7-25. doi: 10.1038/s41569-021-00597-2. Epub 2021 Aug 11.

Fish-Ing for Enhancers in the Heart.

Int J Mol Sci. 2021 Apr 10;22(8):3914. doi: 10.3390/ijms22083914.

Heart Enhancers: Development and Disease Control at a Distance.

Front Genet. 2021 Mar 10;12:642975. doi: 10.3389/fgene.2021.642975. eCollection 2021.

SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models.

BMC Res Notes. 2021 Mar 19;14(1):104. doi: 10.1186/s13104-021-05518-7.

Supervised enhancer prediction with epigenetic pattern recognition and targeted validation.

Nat Methods. 2020 Aug;17(8):807-814. doi: 10.1038/s41592-020-0907-8. Epub 2020 Jul 29.

Epigenetic and Transcriptional Networks Underlying Atrial Fibrillation.

Circ Res. 2020 Jun 19;127(1):34-50. doi: 10.1161/CIRCRESAHA.120.316574. Epub 2020 Jun 18.

本文引用的文献

Histone modifications at human enhancers reflect global cell-type-specific gene expression.

Nature. 2009 May 7;459(7243):108-12. doi: 10.1038/nature07829. Epub 2009 Mar 18.

Uncoupling time and space in the collinear regulation of Hox genes.

PLoS Genet. 2009 Mar;5(3):e1000398. doi: 10.1371/journal.pgen.1000398. Epub 2009 Mar 6.

ChIP-seq accurately predicts tissue-specific activity of enhancers.

Nature. 2009 Feb 12;457(7231):854-8. doi: 10.1038/nature07730.

The transcription factor LMO2 is a robust marker of vascular endothelium and vascular neoplasms and selected other entities.

Am J Clin Pathol. 2009 Feb;131(2):264-78. doi: 10.1309/AJCP5FP3NAXAXRJE.

Asymmetrical distribution of non-conserved regulatory sequences at PHOX2B is reflected at the ENCODE loci and illuminates a possible genome-wide trend.

BMC Genomics. 2009 Jan 7;10:8. doi: 10.1186/1471-2164-10-8.

Combinatorial regulation of endothelial gene expression by ets and forkhead transcription factors.

Cell. 2008 Dec 12;135(6):1053-64. doi: 10.1016/j.cell.2008.10.049.

The developmental genetics of congenital heart disease.

Nature. 2008 Feb 21;451(7181):943-8. doi: 10.1038/nature06801.

Predicting expression patterns from regulatory sequence in Drosophila segmentation.

Nature. 2008 Jan 31;451(7178):535-40. doi: 10.1038/nature06496. Epub 2008 Jan 2.

A nucleosome-guided map of transcription factor binding sites in yeast.

PLoS Comput Biol. 2007 Nov;3(11):e215. doi: 10.1371/journal.pcbi.0030215. Epub 2007 Sep 24.

A high-resolution atlas of nucleosome occupancy in yeast.

Nat Genet. 2007 Oct;39(10):1235-44. doi: 10.1038/ng2117. Epub 2007 Sep 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类心脏增强子的全基因组发现。

Genome-wide discovery of human heart enhancers.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献