Suppr超能文献

VOMBAT:使用可变阶贝叶斯树预测转录因子结合位点

VOMBAT: prediction of transcription factor binding sites using variable order Bayesian trees.

作者信息

Grau Jan, Ben-Gal Irad, Posch Stefan, Grosse Ivo

机构信息

Institute of Computer Science, University Halle, 06099 Halle, Saale, Germany.

出版信息

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W529-33. doi: 10.1093/nar/gkl212.

Abstract

Variable order Markov models and variable order Bayesian trees have been proposed for the recognition of transcription factor binding sites, and it could be demonstrated that they outperform traditional models, such as position weight matrices, Markov models and Bayesian trees. We develop a web server for the recognition of DNA binding sites based on variable order Markov models and variable order Bayesian trees offering the following functionality: (i) given datasets with annotated binding sites and genomic background sequences, variable order Markov models and variable order Bayesian trees can be trained; (ii) given a set of trained models, putative DNA binding sites can be predicted in a given set of genomic sequences and (iii) given a dataset with annotated binding sites and a dataset with genomic background sequences, cross-validation experiments for different model combinations with different parameter settings can be performed. Several of the offered services are computationally demanding, such as genome-wide predictions of DNA binding sites in mammalian genomes or sets of 10(4)-fold cross-validation experiments for different model combinations based on problem-specific data sets. In order to execute these jobs, and in order to serve multiple users at the same time, the web server is attached to a Linux cluster with 150 processors. VOMBAT is available at http://pdw-24.ipk-gatersleben.de:8080/VOMBAT/.

摘要

可变阶马尔可夫模型和可变阶贝叶斯树已被用于识别转录因子结合位点,并且可以证明它们优于传统模型,如位置权重矩阵、马尔可夫模型和贝叶斯树。我们基于可变阶马尔可夫模型和可变阶贝叶斯树开发了一个用于识别DNA结合位点的网络服务器,它具有以下功能:(i)给定带有注释的结合位点和基因组背景序列的数据集,可以训练可变阶马尔可夫模型和可变阶贝叶斯树;(ii)给定一组训练好的模型,可以在给定的基因组序列集中预测潜在的DNA结合位点;(iii)给定一个带有注释的结合位点的数据集和一个带有基因组背景序列的数据集,可以针对不同参数设置的不同模型组合进行交叉验证实验。所提供的一些服务对计算要求很高,例如在哺乳动物基因组中进行全基因组范围的DNA结合位点预测,或基于特定问题数据集对不同模型组合进行10(4)倍交叉验证实验。为了执行这些任务,并为多个用户同时提供服务,该网络服务器连接到一个拥有150个处理器的Linux集群。VOMBAT可在http://pdw-24.ipk-gatersleben.de:8080/VOMBAT/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a3c/1538886/d6d0a75cc591/gkl212f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验