• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

转录调控元件识别的突变程度模型。

A mutation degree model for the identification of transcriptional regulatory elements.

机构信息

State Key Laboratory of Pharmaceutical Biotechnology, School of Life Science, Nanjing University, Nanjing 210093, China.

出版信息

BMC Bioinformatics. 2011 Jun 27;12:262. doi: 10.1186/1471-2105-12-262.

DOI:10.1186/1471-2105-12-262
PMID:21708002
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3228546/
Abstract

BACKGROUND

Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection algorithms, the discovery of conserved motifs in a wide range of phylogenetically related promoters is still a challenge, especially for the short motifs embedded in distantly related gene promoters or very closely related promoters, or in the situation that there are not enough orthologous genes available.

RESULTS

A mutation degree model is proposed and a new word counting method is developed for the identification of transcriptional regulatory elements from a set of co-expressed genes. The new method comprises two parts: 1) identifying overrepresented oligo-nucleotides in promoters of co-expressed genes, 2) estimating the conservation of the oligo-nucleotides in promoters of phylogenetically related genes by the mutation degree model. Compared with the performance of other algorithms, our method shows the advantages of low false positive rate and higher specificity, especially the robustness to noisy data. Applying the method to co-expressed gene sets from Arabidopsis, most of known cis-elements were successfully detected. The tool and example are available at http://mcube.nju.edu.cn/jwang/lab/soft/ocw/OCW.html.

CONCLUSIONS

The mutation degree model proposed in this paper is adapted to phylogenetic data of different qualities, and to a wide range of evolutionary distances. The new word-counting method based on this model has the advantage of better performance in detecting short sequence of cis-elements from co-expressed genes of eukaryotes and is robust to less complete phylogenetic data.

摘要

背景

目前识别转录调控元件的方法主要是结合两个特性,即进化保守性和功能元件在共调控基因启动子中的过度表达。尽管已经开发了许多基序检测算法,但在广泛的系统发育相关启动子中发现保守基序仍然是一个挑战,特别是对于嵌入在远缘基因启动子或非常近缘基因启动子中的短基序,或者在没有足够的直系同源基因可用的情况下。

结果

提出了一种突变程度模型,并开发了一种新的单词计数方法,用于从一组共表达基因中识别转录调控元件。该新方法包括两部分:1)识别共表达基因启动子中过度表达的寡核苷酸,2)通过突变程度模型估计系统发育相关基因启动子中寡核苷酸的保守性。与其他算法的性能相比,我们的方法具有低假阳性率和更高特异性的优势,特别是对噪声数据的稳健性。将该方法应用于拟南芥的共表达基因集,成功检测到了大多数已知的顺式元件。该工具和示例可在 http://mcube.nju.edu.cn/jwang/lab/soft/ocw/OCW.html 上获得。

结论

本文提出的突变程度模型适用于不同质量和广泛进化距离的系统发育数据。基于该模型的新单词计数方法在检测真核生物共表达基因中短序列的顺式元件方面具有更好的性能,并且对不完整的系统发育数据具有稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfd6/3228546/fc136d5cf176/1471-2105-12-262-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfd6/3228546/495d598bbe91/1471-2105-12-262-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfd6/3228546/4ec00c41c43b/1471-2105-12-262-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfd6/3228546/fc136d5cf176/1471-2105-12-262-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfd6/3228546/495d598bbe91/1471-2105-12-262-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfd6/3228546/4ec00c41c43b/1471-2105-12-262-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfd6/3228546/fc136d5cf176/1471-2105-12-262-3.jpg

相似文献

1
A mutation degree model for the identification of transcriptional regulatory elements.转录调控元件识别的突变程度模型。
BMC Bioinformatics. 2011 Jun 27;12:262. doi: 10.1186/1471-2105-12-262.
2
[Computational identification of transcriptional regulatory elements in Arabidopsis TCH4 promoter].[拟南芥TCH4启动子中转录调控元件的计算鉴定]
Yi Chuan. 2008 May;30(5):620-6. doi: 10.3724/sp.j.1005.2008.00620.
3
Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.拟南芥ABA/VP1和CBF/DREB1调控基因中顺式调控序列的定量统计分析
Plant Physiol. 2005 Sep;139(1):437-47. doi: 10.1104/pp.104.058412. Epub 2005 Aug 19.
4
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae.种子储存蛋白基因启动子在十字花科、豆科和禾本科植物中含有保守的DNA基序。
BMC Plant Biol. 2009 Oct 20;9:126. doi: 10.1186/1471-2229-9-126.
5
Genome wide analysis of Arabidopsis core promoters.拟南芥核心启动子的全基因组分析。
BMC Genomics. 2005 Feb 25;6:25. doi: 10.1186/1471-2164-6-25.
6
Computational approaches to identify promoters and cis-regulatory elements in plant genomes.用于识别植物基因组中启动子和顺式调控元件的计算方法。
Plant Physiol. 2003 Jul;132(3):1162-76. doi: 10.1104/pp.102.017715.
7
Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data.利用基因表达和启动子分析数据对人类启动子的转录调控元件进行全基因组预测。
BMC Bioinformatics. 2006 Jul 4;7:330. doi: 10.1186/1471-2105-7-330.
8
More robust detection of motifs in coexpressed genes by using phylogenetic information.通过利用系统发育信息更可靠地检测共表达基因中的基序。
BMC Bioinformatics. 2006 Mar 20;7:160. doi: 10.1186/1471-2105-7-160.
9
Development of a novel data mining tool to find cis-elements in rice gene promoter regions.开发一种新型数据挖掘工具以在水稻基因启动子区域中寻找顺式元件。
BMC Plant Biol. 2008 Feb 27;8:20. doi: 10.1186/1471-2229-8-20.
10
A universal algorithm for genome-wide in silicio identification of biologically significant gene promoter putative cis-regulatory-elements; identification of new elements for reactive oxygen species and sucrose signaling in Arabidopsis.一种用于全基因组范围内在计算机上鉴定具有生物学意义的基因启动子假定顺式调控元件的通用算法;鉴定拟南芥中活性氧和蔗糖信号传导的新元件。
Plant J. 2006 Feb;45(3):384-98. doi: 10.1111/j.1365-313X.2005.02634.x.

引用本文的文献

1
Cascading cis-cleavage on transcript from trans-acting siRNA-producing locus 3.反式作用 siRNA 产生基因座 3 的转录物的级联 cis 切割
Int J Mol Sci. 2013 Jul 12;14(7):14689-99. doi: 10.3390/ijms140714689.

本文引用的文献

1
Insulators and promoters: closer than we think.绝缘子和启动子:比我们想象的更接近。
Nat Rev Genet. 2010 Jun;11(6):439-46. doi: 10.1038/nrg2765. Epub 2010 May 5.
2
The effect of orthology and coregulation on detecting regulatory motifs.同源性和共调控对检测调控基序的影响。
PLoS One. 2010 Feb 3;5(2):e8938. doi: 10.1371/journal.pone.0008938.
3
Cis-regulatory elements in plant cell signaling.植物细胞信号传导中的顺式调控元件。
Curr Opin Plant Biol. 2009 Oct;12(5):643-9. doi: 10.1016/j.pbi.2009.07.016. Epub 2009 Aug 28.
4
Pscan: finding over-represented transcription factor binding site motifs in sequences from co-regulated or co-expressed genes.Pscan:在共调控或共表达基因的序列中寻找过度富集的转录因子结合位点基序。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W247-52. doi: 10.1093/nar/gkp464. Epub 2009 May 31.
5
Tissue-specific regulatory network extractor (TS-REX): a database and software resource for the tissue and cell type-specific investigation of transcription factor-gene networks.组织特异性调控网络提取器(TS-REX):用于转录因子-基因网络的组织和细胞类型特异性研究的数据库及软件资源。
Nucleic Acids Res. 2009 Jun;37(11):e82. doi: 10.1093/nar/gkp311. Epub 2009 May 13.
6
COTRASIF: conservation-aided transcription-factor-binding site finder.COTRASIF:保护辅助转录因子结合位点查找器。
Nucleic Acids Res. 2009 Apr;37(7):e49. doi: 10.1093/nar/gkp084. Epub 2009 Mar 5.
7
Discovering sequence motifs with arbitrary insertions and deletions.发现带有任意插入和缺失的序列基序。
PLoS Comput Biol. 2008 May 9;4(4):e1000071. doi: 10.1371/journal.pcbi.1000071.
8
Gene regulation by transcription factors and microRNAs.转录因子和微小RNA对基因的调控。
Science. 2008 Mar 28;319(5871):1785-6. doi: 10.1126/science.1151651.
9
PhyME: a software tool for finding motifs in sets of orthologous sequences.PhyME:一种用于在直系同源序列集中查找基序的软件工具。
Methods Mol Biol. 2007;395:309-18.
10
WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences.WeederH:一种用于在同源序列中寻找保守调控基序和区域的算法。
BMC Bioinformatics. 2007 Feb 7;8:46. doi: 10.1186/1471-2105-8-46.