• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

同源性和共调控对检测调控基序的影响。

The effect of orthology and coregulation on detecting regulatory motifs.

机构信息

CMPG, Department of Microbial and Molecular Systems, Katholieke Universiteit Leuven, Leuven, Belgium.

出版信息

PLoS One. 2010 Feb 3;5(2):e8938. doi: 10.1371/journal.pone.0008938.

DOI:10.1371/journal.pone.0008938
PMID:20140085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2815771/
Abstract

BACKGROUND

Computational de novo discovery of transcription factor binding sites is still a challenging problem. The growing number of sequenced genomes allows integrating orthology evidence with coregulation information when searching for motifs. Moreover, the more advanced motif detection algorithms explicitly model the phylogenetic relatedness between the orthologous input sequences and thus should be well adapted towards using orthologous information. In this study, we evaluated the conditions under which complementing coregulation with orthologous information improves motif detection for the class of probabilistic motif detection algorithms with an explicit evolutionary model.

METHODOLOGY

We designed datasets (real and synthetic) covering different degrees of coregulation and orthologous information to test how well Phylogibbs and Phylogenetic sampler, as representatives of the motif detection algorithms with evolutionary model performed as compared to MEME, a more classical motif detection algorithm that treats orthologs independently.

RESULTS AND CONCLUSIONS

Under certain conditions detecting motifs in the combined coregulation-orthology space is indeed more efficient than using each space separately, but this is not always the case. Moreover, the difference in success rate between the advanced algorithms and MEME is still marginal. The success rate of motif detection depends on the complex interplay between the added information and the specificities of the applied algorithms. Insights in this relation provide information useful to both developers and users. All benchmark datasets are available at http://homes.esat.kuleuven.be/~kmarchal/Supplementary_Storms_Valerie_PlosONE.

摘要

背景

计算从头发现转录因子结合位点仍然是一个具有挑战性的问题。随着测序基因组数量的增加,在搜索基序时,可以将同源证据与共调控信息集成在一起。此外,更先进的基序检测算法明确地对同源输入序列之间的系统发育相关性进行建模,因此应该很好地适应使用同源信息。在这项研究中,我们评估了在哪些条件下,通过补充共调控信息来改善具有显式进化模型的概率基序检测算法的基序检测。

方法

我们设计了数据集(真实和合成),涵盖了不同程度的共调控和同源信息,以测试 Phylogibbs 和 Phylogenetic sampler 作为具有进化模型的基序检测算法的代表,与 MEME 相比,在检测基序方面的表现如何,MEME 是一种更经典的基序检测算法,它独立地处理同源物。

结果与结论

在某些条件下,在共调控-同源空间中检测基序确实比分别使用每个空间更有效,但情况并非总是如此。此外,先进算法与 MEME 之间的成功率差异仍然很小。基序检测的成功率取决于添加信息与应用算法的特异性之间的复杂相互作用。对这种关系的深入了解为开发人员和用户提供了有用的信息。所有基准数据集均可在 http://homes.esat.kuleuven.be/~kmarchal/Supplementary_Storms_Valerie_PlosONE 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/e96db1458999/pone.0008938.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/eb01a93b23e1/pone.0008938.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/c5a9dc14f7c5/pone.0008938.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/34384ea8447c/pone.0008938.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/6673445b234e/pone.0008938.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/53f426292c97/pone.0008938.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/e96db1458999/pone.0008938.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/eb01a93b23e1/pone.0008938.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/c5a9dc14f7c5/pone.0008938.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/34384ea8447c/pone.0008938.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/6673445b234e/pone.0008938.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/53f426292c97/pone.0008938.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3faf/2815771/e96db1458999/pone.0008938.g006.jpg

相似文献

1
The effect of orthology and coregulation on detecting regulatory motifs.同源性和共调控对检测调控基序的影响。
PLoS One. 2010 Feb 3;5(2):e8938. doi: 10.1371/journal.pone.0008938.
2
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
3
The value of position-specific priors in motif discovery using MEME.MEME 中位置特异性先验在基序发现中的价值。
BMC Bioinformatics. 2010 Apr 9;11:179. doi: 10.1186/1471-2105-11-179.
4
Phylogenetic motif detection by expectation-maximization on evolutionary mixtures.基于进化混合模型的期望最大化算法进行系统发育基序检测。
Pac Symp Biocomput. 2004:324-35. doi: 10.1142/9789812704856_0031.
5
More robust detection of motifs in coexpressed genes by using phylogenetic information.通过利用系统发育信息更可靠地检测共表达基因中的基序。
BMC Bioinformatics. 2006 Mar 20;7:160. doi: 10.1186/1471-2105-7-160.
6
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
7
GADEM: a genetic algorithm guided formation of spaced dyads coupled with an EM algorithm for motif discovery.GADEM:一种遗传算法引导的间隔二元组形成,结合期望最大化算法用于基序发现。
J Comput Biol. 2009 Feb;16(2):317-29. doi: 10.1089/cmb.2008.16TT.
8
Discovering multiple realistic TFBS motifs based on a generalized model.基于广义模型发现多个真实的 TFBS 基序。
BMC Bioinformatics. 2009 Oct 7;10:321. doi: 10.1186/1471-2105-10-321.
9
PhyME: a probabilistic algorithm for finding motifs in sets of orthologous sequences.PhyME:一种用于在直系同源序列集中寻找基序的概率算法。
BMC Bioinformatics. 2004 Oct 28;5:170. doi: 10.1186/1471-2105-5-170.
10
Assessing phylogenetic motif models for predicting transcription factor binding sites.评估用于预测转录因子结合位点的系统发育基序模型。
Bioinformatics. 2009 Jun 15;25(12):i339-47. doi: 10.1093/bioinformatics/btp201.

引用本文的文献

1
Known and novel post-transcriptional regulatory sequences are conserved across plant families.已知和新的转录后调控序列在植物科之间是保守的。
RNA. 2012 Mar;18(3):368-84. doi: 10.1261/rna.031179.111. Epub 2012 Jan 11.
2
A mutation degree model for the identification of transcriptional regulatory elements.转录调控元件识别的突变程度模型。
BMC Bioinformatics. 2011 Jun 27;12:262. doi: 10.1186/1471-2105-12-262.

本文引用的文献

1
Evolution of transcriptional regulatory circuits in bacteria.细菌中转录调控回路的进化
Cell. 2009 Jul 23;138(2):233-44. doi: 10.1016/j.cell.2009.07.002.
2
CSMET: comparative genomic motif detection via multi-resolution phylogenetic shadowing.CSMET:通过多分辨率系统发育影子进行比较基因组基序检测
PLoS Comput Biol. 2008 Jun 6;4(6):e1000090. doi: 10.1371/journal.pcbi.1000090.
3
A survey of DNA motif finding algorithms.DNA基序查找算法综述。
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S21. doi: 10.1186/1471-2105-8-S7-S21.
4
Tree Gibbs Sampler: identifying conserved motifs without aligning orthologous sequences.树状吉布斯采样器:无需比对直系同源序列即可识别保守基序。
Bioinformatics. 2007 Aug 1;23(15):2013-4. doi: 10.1093/bioinformatics/btm282. Epub 2007 May 31.
5
Binding site graphs: a new graph theoretical framework for prediction of transcription factor binding sites.结合位点图:一种预测转录因子结合位点的新图论框架。
PLoS Comput Biol. 2007 May;3(5):e90. doi: 10.1371/journal.pcbi.0030090. Epub 2007 Apr 10.
6
A phylogenetic Gibbs sampler that yields centroid solutions for cis-regulatory site prediction.一种用于顺式调控位点预测并产生质心解的系统发生吉布斯采样器。
Bioinformatics. 2007 Jul 15;23(14):1718-27. doi: 10.1093/bioinformatics/btm241. Epub 2007 May 8.
7
The Gibbs Centroid Sampler.吉布斯质心采样器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W232-7. doi: 10.1093/nar/gkm265. Epub 2007 May 5.
8
Using pyrosequencing to shed light on deep mine microbial ecology.利用焦磷酸测序技术揭示深部矿井微生物生态。
BMC Genomics. 2006 Mar 20;7:57. doi: 10.1186/1471-2164-7-57.
9
More robust detection of motifs in coexpressed genes by using phylogenetic information.通过利用系统发育信息更可靠地检测共表达基因中的基序。
BMC Bioinformatics. 2006 Mar 20;7:160. doi: 10.1186/1471-2105-7-160.
10
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.