• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

活性增强子的位置可以根据染色质标记和集体序列基序数据准确预测。

Active enhancer positions can be accurately predicted from chromatin marks and collective sequence motif data.

作者信息

Podsiadło Agnieszka, Wrzesień Mariusz, Paja Wiesław, Rudnicki Witold, Wilczyński Bartek

出版信息

BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S16. doi: 10.1186/1752-0509-7-S6-S16. Epub 2013 Dec 13.

DOI:10.1186/1752-0509-7-S6-S16
PMID:24565409
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4029456/
Abstract

BACKGROUND

Transcriptional regulation in multi-cellular organisms is a complex process involving multiple modular regulatory elements for each gene. Building whole-genome models of transcriptional networks requires mapping all relevant enhancers and then linking them to target genes. Previous methods of enhancer identification based either on sequence information or on epigenetic marks have different limitations stemming from incompleteness of each of these datasets taken separately.

RESULTS

In this work we present a new approach for discovery of regulatory elements based on the combination of sequence motifs and epigenetic marks measured with ChIP-Seq. Our method uses supervised learning approaches to train a model describing the dependence of enhancer activity on sequence features and histone marks. Our results indicate that using combination of features provides superior results to previous approaches based on either one of the datasets. While histone modifications remain the dominant feature for accurate predictions, the models based on sequence motifs have advantages in their general applicability to different tissues. Additionally, we assess the relevance of different sequence motifs in prediction accuracy showing that even tissue-specific enhancer activity depends on multiple motifs.

CONCLUSIONS

Based on our results, we conclude that it is worthwhile to include sequence motif data into computational approaches to active enhancer prediction and also that classifiers trained on a specific set of enhancers can generalize with significant accuracy beyond the training set.

摘要

背景

多细胞生物中的转录调控是一个复杂的过程,每个基因都涉及多个模块化调控元件。构建转录网络的全基因组模型需要绘制所有相关增强子,然后将它们与靶基因连接起来。以前基于序列信息或表观遗传标记的增强子识别方法,由于单独使用的每个数据集都不完整,存在不同的局限性。

结果

在这项工作中,我们提出了一种基于序列基序和通过ChIP-Seq测量的表观遗传标记相结合的调控元件发现新方法。我们的方法使用监督学习方法来训练一个模型,该模型描述增强子活性对序列特征和组蛋白标记的依赖性。我们的结果表明,使用特征组合比基于单个数据集的先前方法提供了更好的结果。虽然组蛋白修饰仍然是准确预测的主要特征,但基于序列基序的模型在对不同组织的普遍适用性方面具有优势。此外,我们评估了不同序列基序在预测准确性中的相关性,表明即使是组织特异性增强子活性也依赖于多个基序。

结论

基于我们的结果,我们得出结论,将序列基序数据纳入活跃增强子预测的计算方法是值得的,并且在一组特定增强子上训练的分类器可以以显著的准确性推广到训练集之外。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/f3d7480123bf/1752-0509-7-S6-S16-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/af9bfcbf5bbe/1752-0509-7-S6-S16-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/9a55570940e3/1752-0509-7-S6-S16-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/24f7cee4ce9d/1752-0509-7-S6-S16-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/f3d7480123bf/1752-0509-7-S6-S16-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/af9bfcbf5bbe/1752-0509-7-S6-S16-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/9a55570940e3/1752-0509-7-S6-S16-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/24f7cee4ce9d/1752-0509-7-S6-S16-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec97/4029456/f3d7480123bf/1752-0509-7-S6-S16-4.jpg

相似文献

1
Active enhancer positions can be accurately predicted from chromatin marks and collective sequence motif data.活性增强子的位置可以根据染色质标记和集体序列基序数据准确预测。
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S16. doi: 10.1186/1752-0509-7-S6-S16. Epub 2013 Dec 13.
2
Opening up the blackbox: an interpretable deep neural network-based classifier for cell-type specific enhancer predictions.打开黑箱:一种基于可解释深度神经网络的细胞类型特异性增强子预测分类器。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):54. doi: 10.1186/s12918-016-0302-3.
3
Enhancer prediction with histone modification marks using a hybrid neural network model.基于组蛋白修饰标记的增强子预测的混合神经网络模型。
Methods. 2019 Aug 15;166:48-56. doi: 10.1016/j.ymeth.2019.03.014. Epub 2019 Mar 21.
4
RFECS: a random-forest based algorithm for enhancer identification from chromatin state.RFECS:一种基于随机森林的算法,用于从染色质状态中识别增强子。
PLoS Comput Biol. 2013;9(3):e1002968. doi: 10.1371/journal.pcbi.1002968. Epub 2013 Mar 14.
5
Prediction of regulatory elements in mammalian genomes using chromatin signatures.利用染色质特征预测哺乳动物基因组中的调控元件。
BMC Bioinformatics. 2008 Dec 18;9:547. doi: 10.1186/1471-2105-9-547.
6
Enhancer prediction in the human genome by probabilistic modelling of the chromatin feature patterns.通过对染色质特征模式的概率建模来预测人类基因组中的增强子。
BMC Bioinformatics. 2020 Jul 20;21(1):317. doi: 10.1186/s12859-020-03621-3.
7
Integrating diverse datasets improves developmental enhancer prediction.整合多种数据集可提高发育增强子预测的准确性。
PLoS Comput Biol. 2014 Jun 26;10(6):e1003677. doi: 10.1371/journal.pcbi.1003677. eCollection 2014 Jun.
8
Sequence based prediction of enhancer regions from DNA random walk.基于 DNA 随机游走的增强子区域的序列预测。
Sci Rep. 2018 Oct 29;8(1):15912. doi: 10.1038/s41598-018-33413-y.
9
Prediction accuracy of regulatory elements from sequence varies by functional sequencing technique.从序列预测调控元件的准确性因功能测序技术而异。
Front Cell Infect Microbiol. 2023 Aug 2;13:1182567. doi: 10.3389/fcimb.2023.1182567. eCollection 2023.
10
Short DNA sequence patterns accurately identify broadly active human enhancers.短DNA序列模式可准确识别广泛活跃的人类增强子。
BMC Genomics. 2017 Jul 17;18(1):536. doi: 10.1186/s12864-017-3934-9.

引用本文的文献

1
Prediction of target genes and functional types of cis-regulatory modules in the human genome reveals their distinct properties.人类基因组中顺式调控模块的靶基因预测及其功能类型揭示了它们的独特性质。
BMC Biol. 2025 Jul 15;23(1):211. doi: 10.1186/s12915-025-02313-9.
2
A map of cis-regulatory modules and constituent transcription factor binding sites in 80% of the mouse genome.在 80%的小鼠基因组中顺式调控模块和组成转录因子结合位点的图谱。
BMC Genomics. 2022 Oct 19;23(1):714. doi: 10.1186/s12864-022-08933-7.
3
Accurate prediction of functional states of cis-regulatory modules reveals common epigenetic rules in humans and mice.

本文引用的文献

1
Predicting spatial and temporal gene expression using an integrative model of transcription factor occupancy and chromatin state.利用转录因子占据和染色质状态的综合模型预测时空基因表达。
PLoS Comput Biol. 2012;8(12):e1002798. doi: 10.1371/journal.pcbi.1002798. Epub 2012 Dec 6.
2
A transcription factor collective defines cardiac cell fate and reflects lineage history.转录因子集体定义了心脏细胞的命运,并反映了谱系历史。
Cell. 2012 Feb 3;148(3):473-86. doi: 10.1016/j.cell.2012.01.030.
3
Tissue-specific analysis of chromatin state identifies temporal signatures of enhancer activity during embryonic development.
准确预测顺式调控模块的功能状态揭示了人类和小鼠中常见的表观遗传规律。
BMC Biol. 2022 Oct 5;20(1):221. doi: 10.1186/s12915-022-01426-9.
4
Genome-wide identification of associations between enhancer and alternative splicing in human and mouse.在人类和小鼠中,全基因组鉴定增强子与可变剪接之间的关联。
BMC Genomics. 2022 May 9;22(Suppl 5):919. doi: 10.1186/s12864-022-08537-1.
5
Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers.在基于序列的组织特异性哺乳动物增强子预测中,将启动子从增强子中去除。
BMC Med Genomics. 2017 May 24;10(Suppl 1):34. doi: 10.1186/s12920-017-0264-3.
6
EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm.EP-DNN:一种基于深度神经网络的全局增强子预测算法。
Sci Rep. 2016 Dec 8;6:38433. doi: 10.1038/srep38433.
7
In silico identification of enhancers on the basis of a combination of transcription factor binding motif occurrences.基于转录因子结合基序出现情况的组合对增强子进行计算机识别。
Sci Rep. 2016 Sep 1;6:32476. doi: 10.1038/srep32476.
8
An Annotation Agnostic Algorithm for Detecting Nascent RNA Transcripts in GRO-Seq.一种在 GRO-Seq 中检测新生 RNA 转录本的无注释算法。
IEEE/ACM Trans Comput Biol Bioinform. 2017 Sep-Oct;14(5):1070-1081. doi: 10.1109/TCBB.2016.2520919. Epub 2016 Jan 26.
9
Computational schemes for the prediction and annotation of enhancers from epigenomic assays.基于表观基因组分析预测和注释增强子的计算方案。
Methods. 2015 Jan 15;72:86-94. doi: 10.1016/j.ymeth.2014.10.008. Epub 2014 Oct 15.
组织特异性分析染色质状态可确定胚胎发育过程中增强子活性的时间特征。
Nat Genet. 2012 Jan 8;44(2):148-56. doi: 10.1038/ng.1064.
4
Dynamic CRM occupancy reflects a temporal map of developmental progression.动态 CRM 占有率反映了发展进程的时间图谱。
Mol Syst Biol. 2010 Jun 22;6:383. doi: 10.1038/msb.2010.35.
5
Combinatorial binding predicts spatio-temporal cis-regulatory activity.组合结合预测时空顺式调控活性。
Nature. 2009 Nov 5;462(7269):65-70. doi: 10.1038/nature08531.
6
Challenges for modeling global gene regulatory networks during development: insights from Drosophila.发育过程中全球基因调控网络建模面临的挑战:来自果蝇的见解。
Dev Biol. 2010 Apr 15;340(2):161-9. doi: 10.1016/j.ydbio.2009.10.032. Epub 2009 Oct 27.
7
Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs.利用一组通用基序寻找进化上保守的顺式调控模块。
BMC Bioinformatics. 2009 Mar 10;10:82. doi: 10.1186/1471-2105-10-82.
8
ChIP-seq accurately predicts tissue-specific activity of enhancers.染色质免疫沉淀测序(ChIP-seq)能准确预测增强子的组织特异性活性。
Nature. 2009 Feb 12;457(7231):854-8. doi: 10.1038/nature07730.
9
BNFinder: exact and efficient method for learning Bayesian networks.BNFinder:用于学习贝叶斯网络的精确且高效的方法。
Bioinformatics. 2009 Jan 15;25(2):286-7. doi: 10.1093/bioinformatics/btn505. Epub 2008 Sep 30.
10
Evolution of the mammalian transcription factor binding repertoire via transposable elements.通过转座元件实现的哺乳动物转录因子结合序列的进化
Genome Res. 2008 Nov;18(11):1752-62. doi: 10.1101/gr.080663.108. Epub 2008 Aug 5.