MotifLab：用于基序发现和调控序列分析的工具和数据集成工作台。

MotifLab: a tools and data integration workbench for motif discovery and regulatory sequence analysis.

机构信息

Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway.

出版信息

BMC Bioinformatics. 2013 Jan 16;14:9. doi: 10.1186/1471-2105-14-9.

DOI:10.1186/1471-2105-14-9

PMID:23323883

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3556059/

Abstract

BACKGROUND

Traditional methods for computational motif discovery often suffer from poor performance. In particular, methods that search for sequence matches to known binding motifs tend to predict many non-functional binding sites because they fail to take into consideration the biological state of the cell. In recent years, genome-wide studies have generated a lot of data that has the potential to improve our ability to identify functional motifs and binding sites, such as information about chromatin accessibility and epigenetic states in different cell types. However, it is not always trivial to make use of this data in combination with existing motif discovery tools, especially for researchers who are not skilled in bioinformatics programming.

RESULTS

Here we present MotifLab, a general workbench for analysing regulatory sequence regions and discovering transcription factor binding sites and cis-regulatory modules. MotifLab supports comprehensive motif discovery and analysis by allowing users to integrate several popular motif discovery tools as well as different kinds of additional information, including phylogenetic conservation, epigenetic marks, DNase hypersensitive sites, ChIP-Seq data, positional binding preferences of transcription factors, transcription factor interactions and gene expression. MotifLab offers several data-processing operations that can be used to create, manipulate and analyse data objects, and complete analysis workflows can be constructed and automatically executed within MotifLab, including graphical presentation of the results.

CONCLUSIONS

We have developed MotifLab as a flexible workbench for motif analysis in a genomic context. The flexibility and effectiveness of this workbench has been demonstrated on selected test cases, in particular two previously published benchmark data sets for single motifs and modules, and a realistic example of genes responding to treatment with forskolin. MotifLab is freely available at http://www.motiflab.org.

摘要

背景

传统的计算基序发现方法通常性能不佳。特别是，搜索与已知结合基序序列匹配的方法往往会预测许多非功能结合位点，因为它们没有考虑到细胞的生物学状态。近年来，全基因组研究产生了大量的数据，这些数据有可能提高我们识别功能基序和结合位点的能力，例如不同细胞类型中染色质可及性和表观遗传状态的信息。然而，将这些数据与现有的基序发现工具结合使用并不总是一件简单的事情，特别是对于不擅长生物信息学编程的研究人员来说。

结果

在这里，我们提出了 MotifLab，这是一个用于分析调控序列区域和发现转录因子结合位点和顺式调控模块的通用工作台。MotifLab 通过允许用户集成几个流行的基序发现工具以及不同类型的附加信息，包括系统发育保守性、表观遗传标记、DNase 超敏位点、ChIP-Seq 数据、转录因子的位置结合偏好、转录因子相互作用和基因表达，来支持全面的基序发现和分析。MotifLab 提供了几种数据处理操作，可以用于创建、操作和分析数据对象，并且可以在 MotifLab 内构建和自动执行完整的分析工作流程，包括结果的图形表示。

结论

我们已经开发了 MotifLab 作为基因组背景下基序分析的灵活工作台。该工作台的灵活性和有效性已在选定的测试用例中得到证明，特别是两个以前发表的用于单基序和模块的基准数据集，以及一个使用 forskolin 处理基因的现实示例。MotifLab 可在 http://www.motiflab.org 免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ea5/3556059/f4388ecef693/1471-2105-14-9-1.jpg

相似文献

MotifLab: a tools and data integration workbench for motif discovery and regulatory sequence analysis.MotifLab：用于基序发现和调控序列分析的工具和数据集成工作台。

BMC Bioinformatics. 2013 Jan 16;14:9. doi: 10.1186/1471-2105-14-9.

TrawlerWeb: an online de novo motif discovery tool for next-generation sequencing datasets.拖网生物：下一代测序数据集的在线从头基序发现工具。

BMC Genomics. 2018 Apr 5;19(1):238. doi: 10.1186/s12864-018-4630-0.

Comparative analysis of regulatory motif discovery tools for transcription factor binding sites.用于转录因子结合位点的调控基序发现工具的比较分析。

Genomics Proteomics Bioinformatics. 2007 May;5(2):131-42. doi: 10.1016/S1672-0229(07)60023-0.

Assessment of composite motif discovery methods.复合基序发现方法的评估。

BMC Bioinformatics. 2008 Feb 26;9:123. doi: 10.1186/1471-2105-9-123.

EXTREME: an online EM algorithm for motif discovery.极端：一种用于基序发现的在线 EM 算法。

Bioinformatics. 2014 Jun 15;30(12):1667-73. doi: 10.1093/bioinformatics/btu093. Epub 2014 Feb 14.

COPS: detecting co-occurrence and spatial arrangement of transcription factor binding motifs in genome-wide datasets.COPS：在全基因组数据集中检测转录因子结合基序的共现和空间排列。

PLoS One. 2012;7(12):e52055. doi: 10.1371/journal.pone.0052055. Epub 2012 Dec 18.

A Monte Carlo-based framework enhances the discovery and interpretation of regulatory sequence motifs.基于蒙特卡罗的框架增强了调控序列基序的发现和解释。

BMC Bioinformatics. 2012 Nov 27;13:317. doi: 10.1186/1471-2105-13-317.

MEME-ChIP: motif analysis of large DNA datasets.MEME-ChIP：大 DNA 数据集的基序分析。

Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12.

MEDEA: analysis of transcription factor binding motifs in accessible chromatin.Medea：分析可及染色质中转录因子结合基序。

Genome Res. 2020 May;30(5):736-748. doi: 10.1101/gr.260877.120. Epub 2020 May 18.

CompMoby: comparative MobyDick for detection of cis-regulatory motifs.CompMoby：用于检测顺式调控基序的比较型《白鲸记》工具

BMC Bioinformatics. 2008 Oct 27;9:455. doi: 10.1186/1471-2105-9-455.

引用本文的文献

Genomic structure and transcript analysis of the Rapid Alkalinization Factor (RALF) gene family during host-pathogen crosstalk in Fragaria vesca and Fragaria x ananassa strawberry.基因组结构和转录分析快速碱化因子（RALF）基因家族在草莓 Fragaria vesca 和 Fragaria x ananassa 与病原菌互作过程中的作用。

PLoS One. 2020 Mar 26;15(3):e0226448. doi: 10.1371/journal.pone.0226448. eCollection 2020.

MODSIDE: a motif discovery pipeline and similarity detector.MODSIDE：一种基序发现管道和相似度探测器。

BMC Genomics. 2018 Oct 19;19(1):755. doi: 10.1186/s12864-018-5148-1.

Prediction and Validation of Transcription Factors Modulating the Expression of Sestrin3 Gene Using an Integrated Computational and Experimental Approach.使用综合计算和实验方法对调控Sestrin3基因表达的转录因子进行预测和验证

PLoS One. 2016 Jul 28;11(7):e0160228. doi: 10.1371/journal.pone.0160228. eCollection 2016.

DynaMIT: the dynamic motif integration toolkit.DynaMIT：动态基序整合工具包。

Nucleic Acids Res. 2016 Jan 8;44(1):e2. doi: 10.1093/nar/gkv807. Epub 2015 Aug 7.

c-Myb Binding Sites in Haematopoietic Chromatin Landscapes.造血染色质景观中的c-Myb结合位点

PLoS One. 2015 Jul 24;10(7):e0133280. doi: 10.1371/journal.pone.0133280. eCollection 2015.

GUDM: Automatic Generation of Unified Datasets for Learning and Reasoning in Healthcare.GUDM：用于医疗保健领域学习与推理的统一数据集自动生成

Sensors (Basel). 2015 Jul 2;15(7):15772-98. doi: 10.3390/s150715772.

Genome Wide Binding Site Analysis Reveals Transcriptional Coactivation of Cytokinin-Responsive Genes by DELLA Proteins.全基因组结合位点分析揭示DELLA蛋白对细胞分裂素响应基因的转录共激活作用。

PLoS Genet. 2015 Jul 2;11(7):e1005337. doi: 10.1371/journal.pgen.1005337. eCollection 2015 Jul.

Large-scale identification of gibberellin-related transcription factors defines group VII ETHYLENE RESPONSE FACTORS as functional DELLA partners.赤霉素相关转录因子的大规模鉴定将VII组乙烯响应因子定义为功能性DELLA蛋白互作伙伴。

Plant Physiol. 2014 Oct;166(2):1022-32. doi: 10.1104/pp.114.244723. Epub 2014 Aug 12.

The catecholamine biosynthetic enzyme dopamine β-hydroxylase (DBH): first genome-wide search positions trait-determining variants acting additively in the proximal promoter.儿茶酚胺生物合成酶多巴胺β-羟化酶（DBH）：首次全基因组搜索确定在近端启动子中以加性方式起作用的性状决定变体的位置。

Hum Mol Genet. 2014 Dec 1;23(23):6375-84. doi: 10.1093/hmg/ddu332. Epub 2014 Jun 30.

Otx2 ChIP-seq reveals unique and redundant functions in the mature mouse retina.Otx2染色质免疫沉淀测序揭示了成熟小鼠视网膜中的独特功能和冗余功能。

PLoS One. 2014 Feb 18;9(2):e89110. doi: 10.1371/journal.pone.0089110. eCollection 2014.

本文引用的文献

An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。

Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.

i-cisTarget: an integrative genomics method for the prediction of regulatory features and cis-regulatory modules.i-cisTarget：一种综合基因组学方法，用于预测调控特征和顺式调控模块。

Nucleic Acids Res. 2012 Aug;40(15):e114. doi: 10.1093/nar/gks543. Epub 2012 Jun 20.

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species.ScerTF：酿酒酵母属物种基准化位置权重矩阵的综合数据库。

Nucleic Acids Res. 2012 Jan;40(Database issue):D162-8. doi: 10.1093/nar/gkr1180. Epub 2011 Dec 2.

The UCSC Genome Browser database: extensions and updates 2011.UCSC 基因组浏览器数据库：扩展和更新 2011 年版。

Nucleic Acids Res. 2012 Jan;40(Database issue):D918-23. doi: 10.1093/nar/gkr1055. Epub 2011 Nov 15.

Epigenetic priors for identifying active transcription factor binding sites.用于识别活性转录因子结合位点的表观遗传先验

Bioinformatics. 2012 Jan 1;28(1):56-62. doi: 10.1093/bioinformatics/btr614. Epub 2011 Nov 8.

Decoding the genome with an integrative analysis tool: combinatorial CRM Decoder.利用整合分析工具解码基因组：组合 CRM 解码器。

Nucleic Acids Res. 2011 Sep 1;39(17):e116. doi: 10.1093/nar/gkr516. Epub 2011 Jun 30.

RSAT 2011: regulatory sequence analysis tools.RSAT 2011：调控序列分析工具。

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W86-91. doi: 10.1093/nar/gkr377.

GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge.GRISOTTO：一种利用先验知识改进用于基序发现的组合算法的贪心方法。

Algorithms Mol Biol. 2011 Apr 22;6:13. doi: 10.1186/1748-7188-6-13.

FIMO: scanning for occurrences of a given motif.FIMO：扫描给定基序的出现情况。

Bioinformatics. 2011 Apr 1;27(7):1017-8. doi: 10.1093/bioinformatics/btr064. Epub 2011 Feb 16.

CompleteMOTIFs: DNA motif discovery platform for transcription factor binding experiments.CompleteMOTIFs：用于转录因子结合实验的 DNA motif 发现平台。

Bioinformatics. 2011 Mar 1;27(5):715-7. doi: 10.1093/bioinformatics/btq707. Epub 2010 Dec 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MotifLab：用于基序发现和调控序列分析的工具和数据集成工作台。

MotifLab: a tools and data integration workbench for motif discovery and regulatory sequence analysis.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献