• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用进化隐马尔可夫模型对启动子语法进行建模。

Modeling promoter grammars with evolving hidden Markov models.

作者信息

Won Kyoung-Jae, Sandelin Albin, Marstrand Troels Torben, Krogh Anders

机构信息

The Bioinformatics Centre, Department of Biology & Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaloes Vej 5, 2200 Copenhagen N, Denmark.

出版信息

Bioinformatics. 2008 Aug 1;24(15):1669-75. doi: 10.1093/bioinformatics/btn254. Epub 2008 Jun 5.

DOI:10.1093/bioinformatics/btn254
PMID:18535083
Abstract

MOTIVATION

Describing and modeling biological features of eukaryotic promoters remains an important and challenging problem within computational biology. The promoters of higher eukaryotes in particular display a wide variation in regulatory features, which are difficult to model. Often several factors are involved in the regulation of a set of co-regulated genes. If so, promoters can be modeled with connected regulatory features, where the network of connections is characteristic for a particular mode of regulation.

RESULTS

With the goal of automatically deciphering such regulatory structures, we present a method that iteratively evolves an ensemble of regulatory grammars using a hidden Markov Model (HMM) architecture composed of interconnected blocks representing transcription factor binding sites (TFBSs) and background regions of promoter sequences. The ensemble approach reduces the risk of overfitting and generally improves performance. We apply this method to identify TFBSs and to classify promoters preferentially expressed in macrophages, where it outperforms other methods due to the increased predictive power given by the grammar.

AVAILABILITY

The software and the datasets are available from http://modem.ucsd.edu/won/eHMM.tar.gz

摘要

动机

在计算生物学中,描述和建模真核生物启动子的生物学特征仍然是一个重要且具有挑战性的问题。特别是高等真核生物的启动子在调控特征方面表现出广泛的差异,难以进行建模。通常,一组共同调控基因的调控涉及多个因素。如果是这样,启动子可以用相互关联的调控特征来建模,其中连接网络是特定调控模式的特征。

结果

为了自动破解此类调控结构,我们提出了一种方法,该方法使用由代表转录因子结合位点(TFBS)和启动子序列背景区域的相互连接的模块组成的隐马尔可夫模型(HMM)架构,迭代地演化出一组调控语法。这种集成方法降低了过拟合的风险并总体上提高了性能。我们将此方法应用于识别TFBS并对在巨噬细胞中优先表达的启动子进行分类,由于语法赋予的预测能力增强,该方法在这方面优于其他方法。

可用性

软件和数据集可从http://modem.ucsd.edu/won/eHMM.tar.gz获取

相似文献

1
Modeling promoter grammars with evolving hidden Markov models.使用进化隐马尔可夫模型对启动子语法进行建模。
Bioinformatics. 2008 Aug 1;24(15):1669-75. doi: 10.1093/bioinformatics/btn254. Epub 2008 Jun 5.
2
A mixture model-based discriminate analysis for identifying ordered transcription factor binding site pairs in gene promoters directly regulated by estrogen receptor-alpha.基于混合模型的判别分析,用于识别由雌激素受体α直接调控的基因启动子中的有序转录因子结合位点对。
Bioinformatics. 2006 Sep 15;22(18):2210-6. doi: 10.1093/bioinformatics/btl329. Epub 2006 Jun 29.
3
A hidden Markov model for analyzing ChIP-chip experiments on genome tiling arrays and its application to p53 binding sequences.一种用于分析基因组平铺阵列上的染色质免疫沉淀芯片实验的隐马尔可夫模型及其在p53结合序列中的应用。
Bioinformatics. 2005 Jun;21 Suppl 1:i274-82. doi: 10.1093/bioinformatics/bti1046.
4
Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models.使用系统发育隐马尔可夫模型在果蝇中寻找顺式调控模块。
Bioinformatics. 2007 Aug 15;23(16):2031-7. doi: 10.1093/bioinformatics/btm299. Epub 2007 Jun 5.
5
Context-specific independence mixture modeling for positional weight matrices.针对位置权重矩阵的上下文特定独立混合建模
Bioinformatics. 2006 Jul 15;22(14):e166-73. doi: 10.1093/bioinformatics/btl249.
6
Finding motifs from all sequences with and without binding sites.从所有具有和不具有结合位点的序列中寻找基序。
Bioinformatics. 2006 Sep 15;22(18):2217-23. doi: 10.1093/bioinformatics/btl371. Epub 2006 Jul 26.
7
Functional inference from non-random distributions of conserved predicted transcription factor binding sites.从保守预测转录因子结合位点的非随机分布进行功能推断
Bioinformatics. 2004 Aug 4;20 Suppl 1:i109-15. doi: 10.1093/bioinformatics/bth908.
8
A graph-based approach to systematically reconstruct human transcriptional regulatory modules.一种基于图形的方法来系统地重建人类转录调控模块。
Bioinformatics. 2007 Jul 1;23(13):i577-86. doi: 10.1093/bioinformatics/btm227.
9
A mammalian promoter model links cis elements to genetic networks.一种哺乳动物启动子模型将顺式元件与基因网络联系起来。
Biochem Biophys Res Commun. 2006 Aug 18;347(1):166-77. doi: 10.1016/j.bbrc.2006.06.062. Epub 2006 Jun 21.
10
Computational modeling of oligonucleotide positional densities for human promoter prediction.用于人类启动子预测的寡核苷酸位置密度的计算建模。
Artif Intell Med. 2005 Sep-Oct;35(1-2):107-19. doi: 10.1016/j.artmed.2005.02.005.

引用本文的文献

1
Interpretable prediction of mRNA abundance from promoter sequence using contextual regression models.使用上下文回归模型从启动子序列对mRNA丰度进行可解释预测。
NAR Genom Bioinform. 2024 May 28;6(2):lqae055. doi: 10.1093/nargab/lqae055. eCollection 2024 Jun.
2
Efficient algorithms for training the parameters of hidden Markov models using stochastic expectation maximization (EM) training and Viterbi training.使用随机期望最大化(EM)训练和维特比训练来训练隐马尔可夫模型参数的高效算法。
Algorithms Mol Biol. 2010 Dec 9;5:38. doi: 10.1186/1748-7188-5-38.
3
An intuitionistic approach to scoring DNA sequences against transcription factor binding site motifs.
一种基于直觉的方法,用于对 DNA 序列进行评分,以对抗转录因子结合位点基序。
BMC Bioinformatics. 2010 Nov 8;11:551. doi: 10.1186/1471-2105-11-551.
4
Multivariate Hawkes process models of the occurrence of regulatory elements.调控元件出现的多元 Hawkes 过程模型。
BMC Bioinformatics. 2010 Sep 9;11:456. doi: 10.1186/1471-2105-11-456.
5
The construction and use of log-odds substitution scores for multiple sequence alignment.多序列比对中对对数几率替换评分的构建和使用。
PLoS Comput Biol. 2010 Jul 15;6(7):e1000852. doi: 10.1371/journal.pcbi.1000852.
6
Evolutionary mirages: selection on binding site composition creates the illusion of conserved grammars in Drosophila enhancers.进化的幻象:结合位点组成上的选择在果蝇增强子中产生了保守语法的错觉。
PLoS Genet. 2010 Jan 22;6(1):e1000829. doi: 10.1371/journal.pgen.1000829.
7
Computational models in plant-pathogen interactions: the case of Phytophthora infestans.植物-病原体相互作用中的计算模型:以致病疫霉为例。
Theor Biol Med Model. 2009 Nov 12;6:24. doi: 10.1186/1742-4682-6-24.
8
Modeling tissue-specific structural patterns in human and mouse promoters.对人类和小鼠启动子中的组织特异性结构模式进行建模。
Nucleic Acids Res. 2010 Jan;38(1):17-25. doi: 10.1093/nar/gkp866. Epub 2009 Oct 22.