Posch Stefan, Grau Jan, Gohr Andre, Ben-Gal Irad, Kel Alexander E, Grosse Ivo
Institute of Computer Science, University Halle, 06099 Halle (Saale), Germany.
J Bioinform Comput Biol. 2007 Apr;5(2B):561-77. doi: 10.1142/s0219720007002886.
Variable order Markov models and variable order Bayesian trees have been proposed for the recognition of cis-regulatory elements, and it has been demonstrated that they outperform traditional models such as position weight matrices, Markov models, and Bayesian trees for the recognition of binding sites in prokaryotes. Here, we study to which degree variable order models can improve the recognition of eukaryotic cis-regulatory elements. We find that variable order models can improve the recognition of binding sites of all the studied transcription factors. To ease a systematic evaluation of different model combinations based on problem-specific data sets and allow genomic scans of cis-regulatory elements based on fixed and variable order Markov models and Bayesian trees, we provide the VOMBATserver to the public community.
可变阶马尔可夫模型和可变阶贝叶斯树已被提出用于识别顺式调控元件,并且已经证明,在识别原核生物中的结合位点方面,它们优于传统模型,如位置权重矩阵、马尔可夫模型和贝叶斯树。在这里,我们研究可变阶模型在多大程度上可以提高对真核生物顺式调控元件的识别能力。我们发现可变阶模型可以提高对所有研究的转录因子结合位点的识别能力。为了便于基于特定问题的数据集对不同模型组合进行系统评估,并允许基于固定阶和可变阶马尔可夫模型以及贝叶斯树对顺式调控元件进行基因组扫描,我们向公共社区提供了VOMBAT服务器。