• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用一组通用基序寻找进化上保守的顺式调控模块。

Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs.

作者信息

Wilczynski Bartek, Dojer Norbert, Patelak Mateusz, Tiuryn Jerzy

机构信息

Institute of Informatics, University of Warsaw, Warsaw, Poland.

出版信息

BMC Bioinformatics. 2009 Mar 10;10:82. doi: 10.1186/1471-2105-10-82.

DOI:10.1186/1471-2105-10-82
PMID:19284541
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2669485/
Abstract

BACKGROUND

Finding functional regulatory elements in DNA sequences is a very important problem in computational biology and providing a reliable algorithm for this task would be a major step towards understanding regulatory mechanisms on genome-wide scale. Major obstacles in this respect are that the fact that the amount of non-coding DNA is vast, and that the methods for predicting functional transcription factor binding sites tend to produce results with a high percentage of false positives. This makes the problem of finding regions significantly enriched in binding sites difficult.

RESULTS

We develop a novel method for predicting regulatory regions in DNA sequences, which is designed to exploit the evolutionary conservation of regulatory elements between species without assuming that the order of motifs is preserved across species. We have implemented our method and tested its predictive abilities on various datasets from different organisms.

CONCLUSION

We show that our approach enables us to find a majority of the known CRMs using only sequence information from different species together with currently publicly available motif data. Also, our method is robust enough to perform well in predicting CRMs, despite differences in tissue specificity and even across species, provided that the evolutionary distances between compared species do not change substantially. The complexity of the proposed algorithm is polynomial, and the observed running times show that it may be readily applied.

摘要

背景

在DNA序列中寻找功能调控元件是计算生物学中的一个非常重要的问题,为该任务提供一种可靠的算法将是朝着在全基因组范围内理解调控机制迈出的重要一步。这方面的主要障碍是,非编码DNA的数量巨大,而且预测功能性转录因子结合位点的方法往往会产生高比例的假阳性结果。这使得寻找结合位点显著富集区域的问题变得困难。

结果

我们开发了一种预测DNA序列中调控区域的新方法,该方法旨在利用物种间调控元件的进化保守性,而不假设基序顺序在物种间保持不变。我们已经实现了我们的方法,并在来自不同生物体的各种数据集上测试了其预测能力。

结论

我们表明,我们的方法使我们能够仅使用来自不同物种的序列信息以及当前公开可用的基序数据来找到大多数已知的顺式调控模块。此外,我们的方法足够稳健,即使在组织特异性不同甚至跨物种的情况下,只要所比较物种之间的进化距离没有实质性变化,也能在预测顺式调控模块方面表现良好。所提出算法的复杂度是多项式的,观察到的运行时间表明它可以很容易地应用。

相似文献

1
Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs.利用一组通用基序寻找进化上保守的顺式调控模块。
BMC Bioinformatics. 2009 Mar 10;10:82. doi: 10.1186/1471-2105-10-82.
2
Parsing regulatory DNA: general tasks, techniques, and the PhyloGibbs approach.解析调控DNA:一般任务、技术及系统发育吉布斯方法
J Biosci. 2007 Aug;32(5):863-70. doi: 10.1007/s12038-007-0086-0.
3
Evolutionary analysis of regulatory sequences (EARS) in plants.植物调控序列的进化分析(EARS)。
Plant J. 2010 Oct;64(1):165-76. doi: 10.1111/j.1365-313X.2010.04314.x. Epub 2010 Sep 16.
4
Context specific transcription factor prediction.上下文特异性转录因子预测
Ann Biomed Eng. 2007 Jun;35(6):1053-67. doi: 10.1007/s10439-007-9268-z. Epub 2007 Mar 22.
5
De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets.通过对大量染色质免疫沉淀数据集进行综合分析,从头预测顺式调控元件和模块。
BMC Genomics. 2014 Dec 2;15:1047. doi: 10.1186/1471-2164-15-1047.
6
WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences.WeederH:一种用于在同源序列中寻找保守调控基序和区域的算法。
BMC Bioinformatics. 2007 Feb 7;8:46. doi: 10.1186/1471-2105-8-46.
7
Identification of tissue-specific cis-regulatory modules based on interactions between transcription factors.基于转录因子之间的相互作用鉴定组织特异性顺式调控模块
BMC Bioinformatics. 2007 Nov 9;8:437. doi: 10.1186/1471-2105-8-437.
8
Using PhyloCon to identify conserved regulatory motifs.使用PhyloCon来识别保守的调控基序。
Curr Protoc Bioinformatics. 2007 Sep;Chapter 2:Unit 2.12. doi: 10.1002/0471250953.bi0212s19.
9
Systematic identification of mammalian regulatory motifs' target genes and functions.哺乳动物调控基序的靶基因及功能的系统鉴定。
Nat Methods. 2008 Apr;5(4):347-53. doi: 10.1038/nmeth.1188. Epub 2008 Mar 2.
10
Identifying cis-regulatory modules by combining comparative and compositional analysis of DNA.通过结合DNA的比较分析和组成分析来识别顺式调控模块。
Bioinformatics. 2006 Dec 1;22(23):2858-64. doi: 10.1093/bioinformatics/btl499. Epub 2006 Oct 10.

引用本文的文献

1
A lncRNA identifies enhancer element in negative feedback control of dendritic cell differentiation.一个长链非编码 RNA 鉴定出树突状细胞分化负反馈调控中的增强子元件。
Elife. 2023 Mar 14;12:e83342. doi: 10.7554/eLife.83342.
2
RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data.RGT:一个用于高通量调控基因组学数据综合分析的工具包。
BMC Bioinformatics. 2023 Mar 6;24(1):79. doi: 10.1186/s12859-023-05184-5.
3
Identification of transcription factor binding sites using ATAC-seq.利用 ATAC-seq 鉴定转录因子结合位点。

本文引用的文献

1
MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs.MOPAT:一种基于图形的方法,用于从已知基序预测复发性顺式调控模块。
Nucleic Acids Res. 2008 Aug;36(13):4488-97. doi: 10.1093/nar/gkn407. Epub 2008 Jul 7.
2
FlyBase: integration and improvements to query tools.果蝇数据库:查询工具的整合与改进
Nucleic Acids Res. 2008 Jan;36(Database issue):D588-93. doi: 10.1093/nar/gkm930. Epub 2007 Dec 26.
3
RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.
Genome Biol. 2019 Feb 26;20(1):45. doi: 10.1186/s13059-019-1642-2.
4
Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers.在基于序列的组织特异性哺乳动物增强子预测中,将启动子从增强子中去除。
BMC Med Genomics. 2017 May 24;10(Suppl 1):34. doi: 10.1186/s12920-017-0264-3.
5
Sox5 regulates beta-cell phenotype and is reduced in type 2 diabetes.Sox5 调节β细胞表型,并在 2 型糖尿病中减少。
Nat Commun. 2017 Jun 6;8:15652. doi: 10.1038/ncomms15652.
6
Analysis of computational footprinting methods for DNase sequencing experiments.计算足迹法在 DNA 测序实验中的分析。
Nat Methods. 2016 Apr;13(4):303-9. doi: 10.1038/nmeth.3772. Epub 2016 Feb 22.
7
Enhanceosome transcription factors preferentially dimerize with high mobility group proteins.增强体转录因子优先与高迁移率族蛋白形成二聚体。
BMC Syst Biol. 2016 Feb 4;10:14. doi: 10.1186/s12918-016-0258-3.
8
Mtss1 is a critical epigenetically regulated tumor suppressor in CML.Mtss1 是 CML 中受关键表观遗传调控的肿瘤抑制因子。
Leukemia. 2016 Apr;30(4):823-32. doi: 10.1038/leu.2015.329. Epub 2015 Dec 1.
9
Optimally choosing PWM motif databases and sequence scanning approaches based on ChIP-seq data.基于染色质免疫沉淀测序(ChIP-seq)数据优化选择PWM基序数据库和序列扫描方法。
BMC Bioinformatics. 2015 May 1;16:140. doi: 10.1186/s12859-015-0573-5.
10
Replicative senescence is associated with nuclear reorganization and with DNA methylation at specific transcription factor binding sites.复制性衰老与核重排以及特定转录因子结合位点的 DNA 甲基化有关。
Clin Epigenetics. 2015 Mar 4;7(1):19. doi: 10.1186/s13148-015-0057-5. eCollection 2015.
RegulonDB(版本6.0):大肠杆菌K-12超越转录的基因调控模型、活跃(实验性)注释启动子及Textpresso导航
Nucleic Acids Res. 2008 Jan;36(Database issue):D120-4. doi: 10.1093/nar/gkm994. Epub 2007 Dec 23.
4
REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila.REDfly 2.0:果蝇顺式调控模块和转录因子结合位点的综合数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D594-8. doi: 10.1093/nar/gkm876. Epub 2007 Nov 26.
5
Ensembl 2008.Ensembl 2008。
Nucleic Acids Res. 2008 Jan;36(Database issue):D707-14. doi: 10.1093/nar/gkm988. Epub 2007 Nov 13.
6
MORPH: probabilistic alignment combined with hidden Markov models of cis-regulatory modules.MORPH:概率比对与顺式调控模块的隐马尔可夫模型相结合。
PLoS Comput Biol. 2007 Nov;3(11):e216. doi: 10.1371/journal.pcbi.0030216. Epub 2007 Sep 24.
7
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.利用进化特征在12个果蝇基因组中发现功能元件。
Nature. 2007 Nov 8;450(7167):219-32. doi: 10.1038/nature06340.
8
BLISS 2.0: a web-based tool for predicting conserved regulatory modules in distantly-related orthologous sequences.BLISS 2.0:一种用于预测远缘直系同源序列中保守调控模块的基于网络的工具。
Bioinformatics. 2007 Dec 1;23(23):3249-50. doi: 10.1093/bioinformatics/btm368. Epub 2007 Jul 27.
9
An atomic model of the interferon-beta enhanceosome.干扰素-β增强体的原子模型。
Cell. 2007 Jun 15;129(6):1111-23. doi: 10.1016/j.cell.2007.05.019.
10
Large-scale analysis of transcriptional cis-regulatory modules reveals both common features and distinct subclasses.转录顺式调控模块的大规模分析揭示了共同特征和不同的亚类。
Genome Biol. 2007;8(6):R101. doi: 10.1186/gb-2007-8-6-r101.