基于转录因子结合基序出现情况的组合对增强子进行计算机识别。

In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences.

机构信息

Agricultural Bioinformatics Key Laboratory of Hubei Province, Huazhong Agricultural University, Wuhan 430070, China.

College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.

出版信息

Sci Rep. 2016 Sep 1;6:32476. doi: 10.1038/srep32476.

DOI:10.1038/srep32476

PMID:27582178

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5007594/

Abstract

Enhancers interact with gene promoters and form chromatin looping structures that serve important functions in various biological processes, such as the regulation of gene transcription and cell differentiation. However, enhancers are difficult to identify because they generally do not have fixed positions or consensus sequence features, and biological experiments for enhancer identification are costly in terms of labor and expense. In this work, several models were built by using various sequence-based feature sets and their combinations for enhancer prediction. The selected features derived from a recursive feature elimination method showed that the model using a combination of 141 transcription factor binding motif occurrences from 1,422 transcription factor position weight matrices achieved a favorably high prediction accuracy superior to that of other reported methods. The models demonstrated good prediction accuracy for different enhancer datasets obtained from different cell lines/tissues. In addition, prediction accuracy was further improved by integration of chromatin state features. Our method is complementary to wet-lab experimental methods and provides an additional method to identify enhancers.

摘要

增强子与基因启动子相互作用，形成染色质环结构，在基因转录调控和细胞分化等各种生物学过程中发挥重要功能。然而，增强子很难识别，因为它们通常没有固定的位置或一致的序列特征，并且增强子的生物学实验在劳动力和费用方面都很昂贵。在这项工作中，我们使用了各种基于序列的特征集及其组合来构建了几个模型，用于增强子预测。从 1,422 个转录因子位置权重矩阵中提取的 141 个转录因子结合基序出现的选择特征表明，使用组合的模型具有优于其他报道方法的有利的高预测准确性。这些模型对来自不同细胞系/组织的不同增强子数据集具有良好的预测准确性。此外，通过整合染色质状态特征，预测准确性进一步提高。我们的方法是对湿实验室实验方法的补充，并提供了一种额外的识别增强子的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf46/5007594/022f06a4bf91/srep32476-f1.jpg

相似文献

In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences.

Sci Rep. 2016 Sep 1;6:32476. doi: 10.1038/srep32476.

Enhancer prediction in the human genome by probabilistic modelling of the chromatin feature patterns.

BMC Bioinformatics. 2020 Jul 20;21(1):317. doi: 10.1186/s12859-020-03621-3.

Genome-wide prediction of transcription factor binding sites using an integrated model.

Genome Biol. 2010 Jan 22;11(1):R7. doi: 10.1186/gb-2010-11-1-r7.

Role of chromatin and transcriptional co-regulators in mediating p63-genome interactions in keratinocytes.

BMC Genomics. 2014 Nov 29;15(1):1042. doi: 10.1186/1471-2164-15-1042.

Identification of Transcribed Enhancers by Genome-Wide Chromatin Immunoprecipitation Sequencing.

Methods Mol Biol. 2017;1468:91-109. doi: 10.1007/978-1-4939-4035-6_8.

Evaluating Enhancer Function and Transcription.

Annu Rev Biochem. 2020 Jun 20;89:213-234. doi: 10.1146/annurev-biochem-011420-095916. Epub 2020 Mar 20.

RFECS: a random-forest based algorithm for enhancer identification from chromatin state.

PLoS Comput Biol. 2013;9(3):e1002968. doi: 10.1371/journal.pcbi.1002968. Epub 2013 Mar 14.

DELTA: A Distal Enhancer Locating Tool Based on AdaBoost Algorithm and Shape Features of Chromatin Modifications.

PLoS One. 2015 Jun 19;10(6):e0130622. doi: 10.1371/journal.pone.0130622. eCollection 2015.

HSA21 Single-Minded 2 (Sim2) Binding Sites Co-Localize with Super-Enhancers and Pioneer Transcription Factors in Pluripotent Mouse ES Cells.

PLoS One. 2015 May 8;10(5):e0126475. doi: 10.1371/journal.pone.0126475. eCollection 2015.

TFAP2C regulates transcription in human naive pluripotency by opening enhancers.

Nat Cell Biol. 2018 May;20(5):553-564. doi: 10.1038/s41556-018-0089-0. Epub 2018 Apr 25.

引用本文的文献

Integrative analysis of transcriptomic and epigenomic data reveals distinct patterns for developmental and housekeeping gene regulation.

BMC Biol. 2024 Apr 10;22(1):78. doi: 10.1186/s12915-024-01869-2.

Epigenomic landscape of enhancer elements during Hydra head organizer formation.

Epigenetics Chromatin. 2020 Oct 12;13(1):43. doi: 10.1186/s13072-020-00364-6.

iEnhancer-ECNN: identifying enhancers and their strength using ensembles of convolutional neural networks.

BMC Genomics. 2019 Dec 24;20(Suppl 9):951. doi: 10.1186/s12864-019-6336-3.

DNA Methylation of Enhancer Elements in Myeloid Neoplasms: Think Outside the Promoters?

Cancers (Basel). 2019 Sep 24;11(10):1424. doi: 10.3390/cancers11101424.

Three-dimensional texture features from intensity and high-order derivative maps for the discrimination between bladder tumors and wall tissues via MRI.

Int J Comput Assist Radiol Surg. 2017 Apr;12(4):645-656. doi: 10.1007/s11548-017-1522-8. Epub 2017 Jan 21.

本文引用的文献

CTCF-Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription.

Cell. 2015 Dec 17;163(7):1611-27. doi: 10.1016/j.cell.2015.11.024. Epub 2015 Dec 10.

Transcriptional enhancers: functional insights and role in human disease.

Curr Opin Genet Dev. 2015 Aug;33:71-6. doi: 10.1016/j.gde.2015.08.009. Epub 2015 Oct 6.

Mediator kinase inhibition further activates super-enhancer-associated genes in AML.

Nature. 2015 Oct 8;526(7572):273-276. doi: 10.1038/nature14904. Epub 2015 Sep 28.

A predictive modeling approach for cell line-specific long-range regulatory interactions.

Nucleic Acids Res. 2015 Oct 15;43(18):8694-712. doi: 10.1093/nar/gkv865. Epub 2015 Sep 3.

Architectural and Functional Commonalities between Enhancers and Promoters.

Cell. 2015 Aug 27;162(5):948-59. doi: 10.1016/j.cell.2015.08.008.

MPE-seq, a new method for the genome-wide analysis of chromatin structure.

Proc Natl Acad Sci U S A. 2015 Jul 7;112(27):E3457-65. doi: 10.1073/pnas.1424804112. Epub 2015 Jun 15.

A method to predict the impact of regulatory variants from DNA sequence.

Nat Genet. 2015 Aug;47(8):955-61. doi: 10.1038/ng.3331. Epub 2015 Jun 15.

The contribution of cohesin-SA1 to gene expression and chromatin architecture in two murine tissues.

Nucleic Acids Res. 2015 Mar 31;43(6):3056-67. doi: 10.1093/nar/gkv144. Epub 2015 Mar 3.

Eukaryotic enhancers: common features, regulation, and participation in diseases.

Cell Mol Life Sci. 2015 Jun;72(12):2361-75. doi: 10.1007/s00018-015-1871-9. Epub 2015 Feb 26.

Chromatin architecture reorganization during stem cell differentiation.

Nature. 2015 Feb 19;518(7539):331-6. doi: 10.1038/nature14222.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于转录因子结合基序出现情况的组合对增强子进行计算机识别。

In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献