利用无比对进化保守信息寻找调控 DNA 基序。

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

机构信息

Department of Computer Science, Duke University, Box 90129, Durham, NC 27708, USA.

出版信息

Nucleic Acids Res. 2010 Apr;38(6):e90. doi: 10.1093/nar/gkp1166. Epub 2010 Jan 4.

DOI:10.1093/nar/gkp1166

PMID:20047961

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2847231/

Abstract

As an increasing number of eukaryotic genomes are being sequenced, comparative studies aimed at detecting regulatory elements in intergenic sequences are becoming more prevalent. Most comparative methods for transcription factor (TF) binding site discovery make use of global or local alignments of orthologous regulatory regions to assess whether a particular DNA site is conserved across related organisms, and thus more likely to be functional. Since binding sites are usually short, sometimes degenerate, and often independent of orientation, alignment algorithms may not align them correctly. Here, we present a novel, alignment-free approach for using conservation information for TF binding site discovery. We relax the definition of conserved sites: we consider a DNA site within a regulatory region to be conserved in an orthologous sequence if it occurs anywhere in that sequence, irrespective of orientation. We use this definition to derive informative priors over DNA sequence positions, and incorporate these priors into a Gibbs sampling algorithm for motif discovery. Our approach is simple and fast. It requires neither sequence alignments nor the phylogenetic relationships between the orthologous sequences, yet it is more effective on real biological data than methods that do.

摘要

随着越来越多的真核生物基因组被测序，旨在检测基因间序列调控元件的比较研究变得越来越普遍。大多数用于转录因子（TF）结合位点发现的比较方法都利用同源调控区的全局或局部比对来评估特定的 DNA 位点是否在相关生物中保守，因此更有可能具有功能。由于结合位点通常较短，有时会退化，并且通常与方向无关，因此对齐算法可能无法正确对齐它们。在这里，我们提出了一种新颖的、无需对齐的方法，用于利用保守信息进行 TF 结合位点发现。我们放宽了保守位点的定义：如果一个 DNA 位点出现在调控区域的同源序列中的任何位置，无论方向如何，我们都认为该位点在该序列中是保守的。我们使用这个定义来推导出关于 DNA 序列位置的有用先验概率，并将这些先验概率纳入 motif 发现的 Gibbs 采样算法中。我们的方法简单快速。它既不需要序列比对，也不需要同源序列之间的系统发育关系，但在真实生物数据上比需要进行序列比对的方法更有效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5fb/2847231/d1755dfbc020/gkp1166f1.jpg

相似文献

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

Nucleic Acids Res. 2010 Apr;38(6):e90. doi: 10.1093/nar/gkp1166. Epub 2010 Jan 4.

PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.

PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.

BLSSpeller: exhaustive comparative discovery of conserved cis-regulatory elements.

Bioinformatics. 2015 Dec 1;31(23):3758-66. doi: 10.1093/bioinformatics/btv466. Epub 2015 Aug 8.

Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data.

BMC Bioinformatics. 2006 Apr 27;7:229. doi: 10.1186/1471-2105-7-229.

Informative priors based on transcription factor structural class improve de novo motif discovery.

Bioinformatics. 2006 Jul 15;22(14):e384-92. doi: 10.1093/bioinformatics/btl251.

WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences.

BMC Bioinformatics. 2007 Feb 7;8:46. doi: 10.1186/1471-2105-8-46.

A survey of DNA motif finding algorithms.

BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S21. doi: 10.1186/1471-2105-8-S7-S21.

Phylogeny based discovery of regulatory elements.

BMC Bioinformatics. 2006 May 22;7:266. doi: 10.1186/1471-2105-7-266.

GATA: a graphic alignment tool for comparative sequence analysis.

BMC Bioinformatics. 2005 Jan 17;6:9. doi: 10.1186/1471-2105-6-9.

CONREAL web server: identification and visualization of conserved transcription factor binding sites.

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W447-50. doi: 10.1093/nar/gki378.

引用本文的文献

Species-aware DNA language models capture regulatory elements and their evolution.

Genome Biol. 2024 Apr 2;25(1):83. doi: 10.1186/s13059-024-03221-x.

Precise temporal control of neuroblast migration through combined regulation and feedback of a Wnt receptor.

Elife. 2023 May 15;12:e82675. doi: 10.7554/eLife.82675.

Deep Conservation of -Element Variants Regulating Plant Hormonal Responses.

Plant Cell. 2019 Nov;31(11):2559-2572. doi: 10.1105/tpc.19.00129. Epub 2019 Aug 29.

A likelihood approach to testing hypotheses on the co-evolution of epigenome and genome.

PLoS Comput Biol. 2018 Dec 26;14(12):e1006673. doi: 10.1371/journal.pcbi.1006673. eCollection 2018 Dec.

Punctuated evolution and transitional hybrid network in an ancestral cell cycle of fungi.

Elife. 2016 May 10;5:e09492. doi: 10.7554/eLife.09492.

BLSSpeller: exhaustive comparative discovery of conserved cis-regulatory elements.

Bioinformatics. 2015 Dec 1;31(23):3758-66. doi: 10.1093/bioinformatics/btv466. Epub 2015 Aug 8.

Identification and computational analysis of gene regulatory elements.

Cold Spring Harb Protoc. 2015 Jan 5;2015(1):pdb.top083642. doi: 10.1101/pdb.top083642.

Multi-species, multi-transcription factor binding highlights conserved control of tissue-specific biological pathways.

Elife. 2014 Oct 3;3:e02626. doi: 10.7554/eLife.02626.

TargetOrtho: a phylogenetic footprinting tool to identify transcription factor targets.

Genetics. 2014 May;197(1):61-76. doi: 10.1534/genetics.113.160721. Epub 2014 Feb 20.

Identification of functional cis-regulatory elements by sequential enrichment from a randomized synthetic DNA library.

BMC Plant Biol. 2013 Oct 18;13:164. doi: 10.1186/1471-2229-13-164.

本文引用的文献

High-resolution DNA-binding specificity analysis of yeast transcription factors.

Genome Res. 2009 Apr;19(4):556-66. doi: 10.1101/gr.090233.108. Epub 2009 Jan 21.

A library of yeast transcription factor motifs reveals a widespread function for Rsc3 in targeting nucleosome exclusion at promoters.

Mol Cell. 2008 Dec 26;32(6):878-87. doi: 10.1016/j.molcel.2008.11.020.

Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences.

Bioinformatics. 2008 Jul 1;24(13):i165-71. doi: 10.1093/bioinformatics/btn154.

Integration of external signaling pathways with the core transcriptional network in embryonic stem cells.

Cell. 2008 Jun 13;133(6):1106-17. doi: 10.1016/j.cell.2008.04.043.

A novel Bayesian DNA motif comparison method for clustering and retrieval.

PLoS Comput Biol. 2008 Feb 29;4(2):e1000010. doi: 10.1371/journal.pcbi.1000010.

A systems approach to delineate functions of paralogous transcription factors: role of the Yap family in the DNA damage response.

Proc Natl Acad Sci U S A. 2008 Feb 26;105(8):2934-9. doi: 10.1073/pnas.0708670105. Epub 2008 Feb 19.

Using DNA duplex stability information for transcription factor binding site discovery.

Pac Symp Biocomput. 2008:453-64.

A nucleosome-guided map of transcription factor binding sites in yeast.

PLoS Comput Biol. 2007 Nov;3(11):e215. doi: 10.1371/journal.pcbi.0030215. Epub 2007 Sep 24.

Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.

Nature. 2007 Nov 8;450(7167):219-32. doi: 10.1038/nature06340.

Evolution of genes and genomes on the Drosophila phylogeny.

Nature. 2007 Nov 8;450(7167):203-18. doi: 10.1038/nature06341.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用无比对进化保守信息寻找调控 DNA 基序。

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

机构信息

Department of Computer Science, Duke University, Box 90129, Durham, NC 27708, USA.

出版信息

Nucleic Acids Res. 2010 Apr;38(6):e90. doi: 10.1093/nar/gkp1166. Epub 2010 Jan 4.

DOI:10.1093/nar/gkp1166

PMID:20047961

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2847231/

Abstract

摘要

利用无比对进化保守信息寻找调控 DNA 基序。

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用无比对进化保守信息寻找调控 DNA 基序。

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献