• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CSMET:通过多分辨率系统发育影子进行比较基因组基序检测

CSMET: comparative genomic motif detection via multi-resolution phylogenetic shadowing.

作者信息

Ray Pradipta, Shringarpure Suyash, Kolar Mladen, Xing Eric P

机构信息

School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America.

出版信息

PLoS Comput Biol. 2008 Jun 6;4(6):e1000090. doi: 10.1371/journal.pcbi.1000090.

DOI:10.1371/journal.pcbi.1000090
PMID:18535663
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2396503/
Abstract

Functional turnover of transcription factor binding sites (TFBSs), such as whole-motif loss or gain, are common events during genome evolution. Conventional probabilistic phylogenetic shadowing methods model the evolution of genomes only at nucleotide level, and lack the ability to capture the evolutionary dynamics of functional turnover of aligned sequence entities. As a result, comparative genomic search of non-conserved motifs across evolutionarily related taxa remains a difficult challenge, especially in higher eukaryotes, where the cis-regulatory regions containing motifs can be long and divergent; existing methods rely heavily on specialized pattern-driven heuristic search or sampling algorithms, which can be difficult to generalize and hard to interpret based on phylogenetic principles. We propose a new method: Conditional Shadowing via Multi-resolution Evolutionary Trees, or CSMET, which uses a context-dependent probabilistic graphical model that allows aligned sites from different taxa in a multiple alignment to be modeled by either a background or an appropriate motif phylogeny conditioning on the functional specifications of each taxon. The functional specifications themselves are the output of a phylogeny which models the evolution not of individual nucleotides, but of the overall functionality (e.g., functional retention or loss) of the aligned sequence segments over lineages. Combining this method with a hidden Markov model that autocorrelates evolutionary rates on successive sites in the genome, CSMET offers a principled way to take into consideration lineage-specific evolution of TFBSs during motif detection, and a readily computable analytical form of the posterior distribution of motifs under TFBS turnover. On both simulated and real Drosophila cis-regulatory modules, CSMET outperforms other state-of-the-art comparative genomic motif finders.

摘要

转录因子结合位点(TFBSs)的功能转换,如整个基序的丢失或获得,是基因组进化过程中的常见事件。传统的概率系统发育阴影方法仅在核苷酸水平上对基因组进化进行建模,缺乏捕捉比对序列实体功能转换的进化动态的能力。因此,在进化相关的分类群中对非保守基序进行比较基因组搜索仍然是一项艰巨的挑战,尤其是在高等真核生物中,其中包含基序的顺式调控区域可能很长且存在差异;现有方法严重依赖专门的模式驱动启发式搜索或采样算法,这些方法可能难以推广且难以根据系统发育原理进行解释。我们提出了一种新方法:通过多分辨率进化树进行条件阴影法(CSMET),该方法使用一种上下文相关的概率图形模型,该模型允许通过背景或基于每个分类单元功能规范的适当基序系统发育来对多重比对中不同分类单元的比对位点进行建模。功能规范本身是一个系统发育的输出,该系统发育不是对单个核苷酸的进化进行建模,而是对谱系中比对序列片段的整体功能(例如功能保留或丧失)进行建模。将该方法与一个隐藏马尔可夫模型相结合,该模型使基因组中连续位点的进化速率自相关,CSMET提供了一种有原则的方法,在基序检测过程中考虑TFBSs的谱系特异性进化,以及在TFBS转换下基序后验分布的易于计算的分析形式。在模拟的和真实的果蝇顺式调控模块上,CSMET均优于其他最先进的比较基因组基序发现工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/f63942b9a732/pcbi.1000090.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/9afbb51b4bb1/pcbi.1000090.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/02942cdac82e/pcbi.1000090.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/3ced84329634/pcbi.1000090.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/c68fd39d854e/pcbi.1000090.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/cec292a86408/pcbi.1000090.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/759273e35fbb/pcbi.1000090.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/0b791610bdf4/pcbi.1000090.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/189b5aeff342/pcbi.1000090.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/d322497ec77c/pcbi.1000090.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/b78d6cfcac92/pcbi.1000090.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/107d5339bde8/pcbi.1000090.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/cc0b8dffc07c/pcbi.1000090.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/f63942b9a732/pcbi.1000090.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/9afbb51b4bb1/pcbi.1000090.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/02942cdac82e/pcbi.1000090.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/3ced84329634/pcbi.1000090.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/c68fd39d854e/pcbi.1000090.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/cec292a86408/pcbi.1000090.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/759273e35fbb/pcbi.1000090.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/0b791610bdf4/pcbi.1000090.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/189b5aeff342/pcbi.1000090.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/d322497ec77c/pcbi.1000090.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/b78d6cfcac92/pcbi.1000090.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/107d5339bde8/pcbi.1000090.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/cc0b8dffc07c/pcbi.1000090.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/364b/2396503/f63942b9a732/pcbi.1000090.g013.jpg

相似文献

1
CSMET: comparative genomic motif detection via multi-resolution phylogenetic shadowing.CSMET:通过多分辨率系统发育影子进行比较基因组基序检测
PLoS Comput Biol. 2008 Jun 6;4(6):e1000090. doi: 10.1371/journal.pcbi.1000090.
2
Phylogenetic simulation of promoter evolution: estimation and modeling of binding site turnover events and assessment of their impact on alignment tools.启动子进化的系统发育模拟:结合位点周转事件的估计与建模及其对序列比对工具影响的评估
Genome Biol. 2007;8(10):R225. doi: 10.1186/gb-2007-8-10-r225.
3
Multiple-sequence functional annotation and the generalized hidden Markov phylogeny.多序列功能注释与广义隐马尔可夫系统发育
Bioinformatics. 2004 Aug 12;20(12):1850-60. doi: 10.1093/bioinformatics/bth153. Epub 2004 Feb 26.
4
Alignment and prediction of cis-regulatory modules based on a probabilistic model of evolution.基于进化概率模型的顺式调控模块的比对与预测。
PLoS Comput Biol. 2009 Mar;5(3):e1000299. doi: 10.1371/journal.pcbi.1000299. Epub 2009 Mar 13.
5
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
6
Searching for statistically significant regulatory modules.寻找具有统计学意义的调控模块。
Bioinformatics. 2003 Oct;19 Suppl 2:ii16-25. doi: 10.1093/bioinformatics/btg1054.
7
Comparative genomics reveals unusually long motifs in mammalian genomes.比较基因组学揭示了哺乳动物基因组中异常长的基序。
Bioinformatics. 2006 Jul 15;22(14):e236-42. doi: 10.1093/bioinformatics/btl265.
8
Gene finding with a hidden Markov model of genome structure and evolution.基于基因组结构与进化的隐马尔可夫模型进行基因发现。
Bioinformatics. 2003 Jan 22;19(2):219-27. doi: 10.1093/bioinformatics/19.2.219.
9
MotifCut: regulatory motifs finding with maximum density subgraphs.MotifCut:通过最大密度子图寻找调控基序
Bioinformatics. 2006 Jul 15;22(14):e150-7. doi: 10.1093/bioinformatics/btl243.
10
Statistical significance of cis-regulatory modules.顺式调控模块的统计学显著性
BMC Bioinformatics. 2007 Jan 22;8:19. doi: 10.1186/1471-2105-8-19.

引用本文的文献

1
Towards an evolutionary model of transcription networks.转录网络的进化模型研究。
PLoS Comput Biol. 2011 Jun;7(6):e1002064. doi: 10.1371/journal.pcbi.1002064. Epub 2011 Jun 9.
2
Evolution of gene regulation of pluripotency--the case for wiki tracks at genome browsers.多能性基因调控的进化——基因组浏览器中 wiki 轨迹的案例。
Biol Direct. 2010 Dec 29;5:67. doi: 10.1186/1745-6150-5-67.
3
Modeling the evolution of regulatory elements by simultaneous detection and alignment with phylogenetic pair HMMs.通过同时检测和与系统发生对 HMM 对齐来模拟调控元件的进化。

本文引用的文献

1
Web-based identification of evolutionary conserved DNA cis-regulatory elements.基于网络的进化保守DNA顺式调控元件鉴定
Methods Mol Biol. 2007;395:425-36. doi: 10.1007/978-1-59745-514-5_26.
2
Phylogenetic simulation of promoter evolution: estimation and modeling of binding site turnover events and assessment of their impact on alignment tools.启动子进化的系统发育模拟:结合位点周转事件的估计与建模及其对序列比对工具影响的评估
Genome Biol. 2007;8(10):R225. doi: 10.1186/gb-2007-8-10-r225.
3
A statistical method for alignment-free comparison of regulatory sequences.
PLoS Comput Biol. 2010 Dec 16;6(12):e1001037. doi: 10.1371/journal.pcbi.1001037.
4
Evolution of conserved non-coding sequences within the vertebrate Hox clusters through the two-round whole genome duplications revealed by phylogenetic footprinting analysis.通过系统发育足迹分析揭示脊椎动物 Hox 簇中保守非编码序列在两轮全基因组复制过程中的演化。
J Mol Evol. 2010 Dec;71(5-6):427-36. doi: 10.1007/s00239-010-9396-1. Epub 2010 Oct 28.
5
Thermodynamics-based models of transcriptional regulation by enhancers: the roles of synergistic activation, cooperative binding and short-range repression.基于热力学的增强子转录调控模型:协同激活、合作结合和短程抑制的作用。
PLoS Comput Biol. 2010 Sep 16;6(9):e1000935. doi: 10.1371/journal.pcbi.1000935.
6
An effective model for natural selection in promoters.启动子中自然选择的有效模型。
Genome Res. 2010 May;20(5):685-92. doi: 10.1101/gr.096719.109. Epub 2010 Mar 1.
7
The effect of orthology and coregulation on detecting regulatory motifs.同源性和共调控对检测调控基序的影响。
PLoS One. 2010 Feb 3;5(2):e8938. doi: 10.1371/journal.pone.0008938.
8
Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.原核生物转录调控中控制逻辑的机制与进化
Microbiol Mol Biol Rev. 2009 Sep;73(3):481-509, Table of Contents. doi: 10.1128/MMBR.00037-08.
9
DISCOVER: a feature-based discriminative method for motif search in complex genomes.DISCOVER:一种用于在复杂基因组中进行基序搜索的基于特征的判别方法。
Bioinformatics. 2009 Jun 15;25(12):i321-9. doi: 10.1093/bioinformatics/btp230.
10
Alignment and prediction of cis-regulatory modules based on a probabilistic model of evolution.基于进化概率模型的顺式调控模块的比对与预测。
PLoS Comput Biol. 2009 Mar;5(3):e1000299. doi: 10.1371/journal.pcbi.1000299. Epub 2009 Mar 13.
一种用于调控序列无比对比较的统计方法。
Bioinformatics. 2007 Jul 1;23(13):i249-55. doi: 10.1093/bioinformatics/btm211.
4
Large-scale turnover of functional transcription factor binding sites in Drosophila.果蝇中功能性转录因子结合位点的大规模更替
PLoS Comput Biol. 2006 Oct;2(10):e130. doi: 10.1371/journal.pcbi.0020130. Epub 2006 Aug 21.
5
VISTA family of computational tools for comparative analysis of DNA sequences and whole genomes.用于DNA序列和全基因组比较分析的VISTA计算工具家族。
Methods Mol Biol. 2006;338:69-89. doi: 10.1385/1-59745-097-9:69.
6
LOGOS: a modular Bayesian model for de novo motif detection.LOGOS:一种用于从头基序检测的模块化贝叶斯模型。
Proc IEEE Comput Soc Bioinform Conf. 2003;2:266-76.
7
ORegAnno: an open access database and curation system for literature-derived promoters, transcription factor binding sites and regulatory variation.ORegAnno:一个用于文献衍生启动子、转录因子结合位点和调控变异的开放获取数据库及注释系统。
Bioinformatics. 2006 Mar 1;22(5):637-40. doi: 10.1093/bioinformatics/btk027. Epub 2006 Jan 5.
8
Functional evolution of a cis-regulatory module.顺式调控模块的功能进化
PLoS Biol. 2005 Apr;3(4):e93. doi: 10.1371/journal.pbio.0030093. Epub 2005 Mar 15.
9
Assessing computational tools for the discovery of transcription factor binding sites.评估用于发现转录因子结合位点的计算工具。
Nat Biotechnol. 2005 Jan;23(1):137-44. doi: 10.1038/nbt1053.
10
MONKEY: identifying conserved transcription-factor binding sites in multiple alignments using a binding site-specific evolutionary model.MONKEY:使用结合位点特异性进化模型在多序列比对中识别保守转录因子结合位点。
Genome Biol. 2004;5(12):R98. doi: 10.1186/gb-2004-5-12-r98. Epub 2004 Nov 30.