• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PhyloGibbs-MP:通过吉布斯采样进行模块预测和判别基序查找。

PhyloGibbs-MP: module prediction and discriminative motif-finding by Gibbs sampling.

作者信息

Siddharthan Rahul

机构信息

The Institute of Mathematical Sciences, Chennai, India.

出版信息

PLoS Comput Biol. 2008 Aug 29;4(8):e1000156. doi: 10.1371/journal.pcbi.1000156.

DOI:10.1371/journal.pcbi.1000156
PMID:18769735
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2518514/
Abstract

PhyloGibbs, our recent Gibbs-sampling motif-finder, takes phylogeny into account in detecting binding sites for transcription factors in DNA and assigns posterior probabilities to its predictions obtained by sampling the entire configuration space. Here, in an extension called PhyloGibbs-MP, we widen the scope of the program, addressing two major problems in computational regulatory genomics. First, PhyloGibbs-MP can localise predictions to small, undetermined regions of a large input sequence, thus effectively predicting cis-regulatory modules (CRMs) ab initio while simultaneously predicting binding sites in those modules-tasks that are usually done by two separate programs. PhyloGibbs-MP's performance at such ab initio CRM prediction is comparable with or superior to dedicated module-prediction software that use prior knowledge of previously characterised transcription factors. Second, PhyloGibbs-MP can predict motifs that differentiate between two (or more) different groups of regulatory regions, that is, motifs that occur preferentially in one group over the others. While other "discriminative motif-finders" have been published in the literature, PhyloGibbs-MP's implementation has some unique features and flexibility. Benchmarks on synthetic and actual genomic data show that this algorithm is successful at enhancing predictions of differentiating sites and suppressing predictions of common sites and compares with or outperforms other discriminative motif-finders on actual genomic data. Additional enhancements include significant performance and speed improvements, the ability to use "informative priors" on known transcription factors, and the ability to output annotations in a format that can be visualised with the Generic Genome Browser. In stand-alone motif-finding, PhyloGibbs-MP remains competitive, outperforming PhyloGibbs-1.0 and other programs on benchmark data.

摘要

我们最新的吉布斯采样基序查找工具PhyloGibbs在检测DNA中转录因子的结合位点时会考虑系统发育,并为通过对整个配置空间进行采样获得的预测赋予后验概率。在此,在一个名为PhyloGibbs-MP的扩展版本中,我们拓宽了该程序的范围,解决了计算调控基因组学中的两个主要问题。首先,PhyloGibbs-MP可以将预测定位到大型输入序列的小的未确定区域,从而有效地从头预测顺式调控模块(CRM),同时预测这些模块中的结合位点——这些任务通常由两个单独的程序完成。PhyloGibbs-MP在这种从头CRM预测方面的性能与使用先前表征的转录因子的先验知识的专用模块预测软件相当或更优。其次,PhyloGibbs-MP可以预测区分两个(或更多)不同调控区域组的基序,即优先在一组中而非其他组中出现的基序。虽然文献中已经发表了其他“判别性基序查找工具”,但PhyloGibbs-MP的实现具有一些独特的特性和灵活性。对合成和实际基因组数据的基准测试表明,该算法在增强区分位点的预测和抑制常见位点的预测方面是成功的,并且在实际基因组数据上与其他判别性基序查找工具相当或更优。其他改进包括显著的性能和速度提升、对已知转录因子使用“信息先验”的能力以及以可通过通用基因组浏览器可视化的格式输出注释的能力。在独立的基序查找中,PhyloGibbs-MP仍然具有竞争力,在基准数据上优于PhyloGibbs-1.0和其他程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/1a338567a95e/pcbi.1000156.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/926b1ad6db55/pcbi.1000156.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/bf1911eab7bd/pcbi.1000156.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/56d71752158a/pcbi.1000156.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/77dcca703083/pcbi.1000156.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/c087e16f4d71/pcbi.1000156.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/b641073b8b73/pcbi.1000156.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/f2ca01b526ed/pcbi.1000156.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/287e2e006c5c/pcbi.1000156.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/e56c8c713e53/pcbi.1000156.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/1a338567a95e/pcbi.1000156.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/926b1ad6db55/pcbi.1000156.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/bf1911eab7bd/pcbi.1000156.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/56d71752158a/pcbi.1000156.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/77dcca703083/pcbi.1000156.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/c087e16f4d71/pcbi.1000156.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/b641073b8b73/pcbi.1000156.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/f2ca01b526ed/pcbi.1000156.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/287e2e006c5c/pcbi.1000156.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/e56c8c713e53/pcbi.1000156.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc0a/2518514/1a338567a95e/pcbi.1000156.g010.jpg

相似文献

1
PhyloGibbs-MP: module prediction and discriminative motif-finding by Gibbs sampling.PhyloGibbs-MP:通过吉布斯采样进行模块预测和判别基序查找。
PLoS Comput Biol. 2008 Aug 29;4(8):e1000156. doi: 10.1371/journal.pcbi.1000156.
2
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
3
Parsing regulatory DNA: general tasks, techniques, and the PhyloGibbs approach.解析调控DNA:一般任务、技术及系统发育吉布斯方法
J Biosci. 2007 Aug;32(5):863-70. doi: 10.1007/s12038-007-0086-0.
4
De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets.通过对大量染色质免疫沉淀数据集进行综合分析,从头预测顺式调控元件和模块。
BMC Genomics. 2014 Dec 2;15:1047. doi: 10.1186/1471-2164-15-1047.
5
Computational detection of genomic cis-regulatory modules applied to body patterning in the early Drosophila embryo.应用于早期果蝇胚胎身体模式形成的基因组顺式调控模块的计算检测。
BMC Bioinformatics. 2002 Oct 24;3:30. doi: 10.1186/1471-2105-3-30.
6
CisMiner: genome-wide in-silico cis-regulatory module prediction by fuzzy itemset mining.CisMiner:通过模糊项集挖掘进行全基因组的计算机模拟顺式调控模块预测
PLoS One. 2014 Sep 30;9(9):e108065. doi: 10.1371/journal.pone.0108065. eCollection 2014.
7
Detecting regulatory sites using PhyloGibbs.使用系统发育吉布斯抽样法检测调控位点。
Methods Mol Biol. 2007;395:381-402. doi: 10.1007/978-1-59745-514-5_24.
8
Identifying cis-regulatory modules by combining comparative and compositional analysis of DNA.通过结合DNA的比较分析和组成分析来识别顺式调控模块。
Bioinformatics. 2006 Dec 1;22(23):2858-64. doi: 10.1093/bioinformatics/btl499. Epub 2006 Oct 10.
9
Discriminative motif discovery in DNA and protein sequences using the DEME algorithm.使用DEME算法在DNA和蛋白质序列中发现鉴别性基序。
BMC Bioinformatics. 2007 Oct 15;8:385. doi: 10.1186/1471-2105-8-385.
10
MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs.MOPAT:一种基于图形的方法,用于从已知基序预测复发性顺式调控模块。
Nucleic Acids Res. 2008 Aug;36(13):4488-97. doi: 10.1093/nar/gkn407. Epub 2008 Jul 7.

引用本文的文献

1
Loss of centromere function drives karyotype evolution in closely related species.着丝粒功能丧失驱动近缘物种的核型进化。
Elife. 2020 Jan 20;9:e53944. doi: 10.7554/eLife.53944.
2
TFforge utilizes large-scale binding site divergence to identify transcriptional regulators involved in phenotypic differences.TFforge 利用大规模结合位点分歧来识别参与表型差异的转录调控因子。
Nucleic Acids Res. 2019 Feb 28;47(4):e19. doi: 10.1093/nar/gky1200.
3
THiCweed: fast, sensitive detection of sequence features by clustering big datasets.THiCweed:通过对大数据集进行聚类实现快速、灵敏的序列特征检测。

本文引用的文献

1
REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila.REDfly 2.0:果蝇顺式调控模块和转录因子结合位点的综合数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D594-8. doi: 10.1093/nar/gkm876. Epub 2007 Nov 26.
2
Evolution of genes and genomes on the Drosophila phylogeny.果蝇系统发育中基因和基因组的进化。
Nature. 2007 Nov 8;450(7167):203-18. doi: 10.1038/nature06341.
3
Discriminative motif discovery in DNA and protein sequences using the DEME algorithm.使用DEME算法在DNA和蛋白质序列中发现鉴别性基序。
Nucleic Acids Res. 2018 Mar 16;46(5):e29. doi: 10.1093/nar/gkx1251.
4
Recent computational developments on CLIP-seq data analysis and microRNA targeting implications.CLIP-seq 数据分析和 miRNA 靶向作用的最新计算进展。
Brief Bioinform. 2018 Nov 27;19(6):1290-1301. doi: 10.1093/bib/bbx063.
5
Combining phylogenetic footprinting with motif models incorporating intra-motif dependencies.将系统发育足迹法与纳入基序内依赖性的基序模型相结合。
BMC Bioinformatics. 2017 Mar 1;18(1):141. doi: 10.1186/s12859-017-1495-1.
6
Unrealistic phylogenetic trees may improve phylogenetic footprinting.不切实际的系统发育树可能会改善系统发育足迹分析。
Bioinformatics. 2017 Jun 1;33(11):1639-1646. doi: 10.1093/bioinformatics/btx033.
7
STEME: a robust, accurate motif finder for large data sets.STEME:一种用于大型数据集的强大、精确的基序查找工具。
PLoS One. 2014 Mar 13;9(3):e90735. doi: 10.1371/journal.pone.0090735. eCollection 2014.
8
On the value of intra-motif dependencies of human insulator protein CTCF.关于人类绝缘子蛋白CTCF基序内依赖性的价值
PLoS One. 2014 Jan 22;9(1):e85629. doi: 10.1371/journal.pone.0085629. eCollection 2014.
9
Discriminative motif optimization based on perceptron training.基于感知机训练的判别模式优化。
Bioinformatics. 2014 Apr 1;30(7):941-8. doi: 10.1093/bioinformatics/btt748. Epub 2013 Dec 24.
10
Towards an evolutionary model of transcription networks.转录网络的进化模型研究。
PLoS Comput Biol. 2011 Jun;7(6):e1002064. doi: 10.1371/journal.pcbi.1002064. Epub 2011 Jun 9.
BMC Bioinformatics. 2007 Oct 15;8:385. doi: 10.1186/1471-2105-8-385.
4
dPattern: transcription factor binding site (TFBS) discovery in human genome using a discriminative pattern analysis.d模式:利用判别模式分析在人类基因组中发现转录因子结合位点(TFBS)
Bioinformatics. 2007 Oct 1;23(19):2619-21. doi: 10.1093/bioinformatics/btm288. Epub 2007 Jun 5.
5
A phylogenetic Gibbs sampler that yields centroid solutions for cis-regulatory site prediction.一种用于顺式调控位点预测并产生质心解的系统发生吉布斯采样器。
Bioinformatics. 2007 Jul 15;23(14):1718-27. doi: 10.1093/bioinformatics/btm241. Epub 2007 May 8.
6
On counting position weight matrix matches in a sequence, with application to discriminative motif finding.关于计算序列中的位置权重矩阵匹配及其在判别性基序发现中的应用。
Bioinformatics. 2006 Jul 15;22(14):e454-63. doi: 10.1093/bioinformatics/btl227.
7
Finding motifs from all sequences with and without binding sites.从所有具有和不具有结合位点的序列中寻找基序。
Bioinformatics. 2006 Sep 15;22(18):2217-23. doi: 10.1093/bioinformatics/btl371. Epub 2006 Jul 26.
8
Stubb: a program for discovery and analysis of cis-regulatory modules.Stubb:一个用于发现和分析顺式调控模块的程序。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W555-9. doi: 10.1093/nar/gkl224.
9
Sigma: multiple alignment of weakly-conserved non-coding DNA sequence.西格玛:弱保守非编码DNA序列的多重比对
BMC Bioinformatics. 2006 Mar 16;7:143. doi: 10.1186/1471-2105-7-143.
10
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.