• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

启动子区域DNA基序的位置分布分析及其生物学相关性。

An analysis of the positional distribution of DNA motifs in promoter regions and its biological relevance.

作者信息

Casimiro Ana C, Vinga Susana, Freitas Ana T, Oliveira Arlindo L

机构信息

INESC-ID/IST, Rua Alves Redol, 9 1000-029 Lisboa, Portugal.

出版信息

BMC Bioinformatics. 2008 Feb 7;9:89. doi: 10.1186/1471-2105-9-89.

DOI:10.1186/1471-2105-9-89
PMID:18257925
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2375121/
Abstract

BACKGROUND

Motif finding algorithms have developed in their ability to use computationally efficient methods to detect patterns in biological sequences. However the posterior classification of the output still suffers from some limitations, which makes it difficult to assess the biological significance of the motifs found. Previous work has highlighted the existence of positional bias of motifs in the DNA sequences, which might indicate not only that the pattern is important, but also provide hints of the positions where these patterns occur preferentially.

RESULTS

We propose to integrate position uniformity tests and over-representation tests to improve the accuracy of the classification of motifs. Using artificial data, we have compared three different statistical tests (Chi-Square, Kolmogorov-Smirnov and a Chi-Square bootstrap) to assess whether a given motif occurs uniformly in the promoter region of a gene. Using the test that performed better in this dataset, we proceeded to study the positional distribution of several well known cis-regulatory elements, in the promoter sequences of different organisms (S. cerevisiae, H. sapiens, D. melanogaster, E. coli and several Dicotyledons plants). The results show that position conservation is relevant for the transcriptional machinery.

CONCLUSION

We conclude that many biologically relevant motifs appear heterogeneously distributed in the promoter region of genes, and therefore, that non-uniformity is a good indicator of biological relevance and can be used to complement over-representation tests commonly used. In this article we present the results obtained for the S. cerevisiae data sets.

摘要

背景

基序发现算法在利用计算高效的方法检测生物序列中的模式方面已有发展。然而,输出结果的后续分类仍存在一些局限性,这使得评估所发现基序的生物学意义变得困难。先前的工作已强调了DNA序列中基序存在位置偏差,这不仅可能表明该模式很重要,还能为这些模式优先出现的位置提供线索。

结果

我们提议整合位置均匀性测试和过表达测试,以提高基序分类的准确性。使用人工数据,我们比较了三种不同的统计测试(卡方检验、柯尔莫哥洛夫-斯米尔诺夫检验和卡方自展法),以评估给定基序在基因启动子区域是否均匀出现。使用在该数据集中表现更佳的测试,我们进而研究了几种知名顺式调控元件在不同生物体(酿酒酵母、智人、黑腹果蝇、大肠杆菌和几种双子叶植物)启动子序列中的位置分布。结果表明位置保守性与转录机制相关。

结论

我们得出结论,许多具有生物学相关性的基序在基因启动子区域呈异质分布,因此,不均匀性是生物学相关性的良好指标,可用于补充常用的过表达测试。在本文中,我们展示了酿酒酵母数据集所获得的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/8bc753116c7a/1471-2105-9-89-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/604830742b4e/1471-2105-9-89-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/63b76a6a9251/1471-2105-9-89-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/52d2173d9ab1/1471-2105-9-89-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/6d365b8808ad/1471-2105-9-89-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/8bc753116c7a/1471-2105-9-89-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/604830742b4e/1471-2105-9-89-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/63b76a6a9251/1471-2105-9-89-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/52d2173d9ab1/1471-2105-9-89-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/6d365b8808ad/1471-2105-9-89-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e54/2375121/8bc753116c7a/1471-2105-9-89-5.jpg

相似文献

1
An analysis of the positional distribution of DNA motifs in promoter regions and its biological relevance.启动子区域DNA基序的位置分布分析及其生物学相关性。
BMC Bioinformatics. 2008 Feb 7;9:89. doi: 10.1186/1471-2105-9-89.
2
A survey of DNA motif finding algorithms.DNA基序查找算法综述。
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S21. doi: 10.1186/1471-2105-8-S7-S21.
3
Finding motifs in the twilight zone.在模糊地带寻找基序。
Bioinformatics. 2002 Oct;18(10):1374-81. doi: 10.1093/bioinformatics/18.10.1374.
4
Cis-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves.利用频率分布曲线,通过真核生物基因组中的位置不平衡,有效揭示了转录和翻译起始位点上游的顺式基序。
BMC Bioinformatics. 2006 Nov 30;7:522. doi: 10.1186/1471-2105-7-522.
5
Subtle motifs: defining the limits of motif finding algorithms.微妙基序:界定基序查找算法的局限性
Bioinformatics. 2002 Oct;18(10):1382-90. doi: 10.1093/bioinformatics/18.10.1382.
6
MotifCut: regulatory motifs finding with maximum density subgraphs.MotifCut:通过最大密度子图寻找调控基序
Bioinformatics. 2006 Jul 15;22(14):e150-7. doi: 10.1093/bioinformatics/btl243.
7
Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences.核苷酸序列中由位置特异性评分矩阵表示的基序簇的统计学意义。
Nucleic Acids Res. 2002 Jul 15;30(14):3214-24. doi: 10.1093/nar/gkf438.
8
Prediction of cis-regulatory elements: from high-information content analysis to motif identification.顺式调控元件的预测:从高信息含量分析到基序识别
J Bioinform Comput Biol. 2007 Aug;5(4):817-38. doi: 10.1142/s021972000700293x.
9
A novel Bayesian DNA motif comparison method for clustering and retrieval.一种用于聚类和检索的新型贝叶斯DNA基序比较方法。
PLoS Comput Biol. 2008 Feb 29;4(2):e1000010. doi: 10.1371/journal.pcbi.1000010.
10
Finding motifs from all sequences with and without binding sites.从所有具有和不具有结合位点的序列中寻找基序。
Bioinformatics. 2006 Sep 15;22(18):2217-23. doi: 10.1093/bioinformatics/btl371. Epub 2006 Jul 26.

引用本文的文献

1
Transcription factors organize into functional groups on the linear genome and in 3D chromatin.转录因子在线性基因组和三维染色质上组织形成功能组。
Heliyon. 2023 Jul 17;9(8):e18211. doi: 10.1016/j.heliyon.2023.e18211. eCollection 2023 Aug.
2
The orientation of transcription factor binding site motifs in gene promoter regions: does it matter?基因启动子区域中转录因子结合位点基序的方向:这重要吗?
BMC Genomics. 2016 Mar 3;17:185. doi: 10.1186/s12864-016-2549-x.
3
Genome-wide computational prediction and analysis of core promoter elements across plant monocots and dicots.

本文引用的文献

1
An efficient algorithm for the identification of structured motifs in DNA promoter sequences.一种识别DNA启动子序列中结构化基序的高效算法。
IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):126-40. doi: 10.1109/TCBB.2006.16.
2
Informative priors based on transcription factor structural class improve de novo motif discovery.基于转录因子结构类别的信息先验改进了从头基序发现。
Bioinformatics. 2006 Jul 15;22(14):e384-92. doi: 10.1093/bioinformatics/btl251.
3
A genomic code for nucleosome positioning.一种核小体定位的基因组编码。
跨单子叶植物和双子叶植物核心启动子元件的全基因组计算预测与分析
PLoS One. 2013 Oct 29;8(10):e79011. doi: 10.1371/journal.pone.0079011. eCollection 2013.
4
A Parzen window-based approach for the detection of locally enriched transcription factor binding sites.基于 Parzen 窗的方法用于检测局部富集的转录因子结合位点。
BMC Bioinformatics. 2013 Jan 21;14:26. doi: 10.1186/1471-2105-14-26.
5
MochiView: versatile software for genome browsing and DNA motif analysis.MochiView:功能多样的基因组浏览和 DNA 基序分析软件。
BMC Biol. 2010 Apr 21;8:49. doi: 10.1186/1741-7007-8-49.
6
Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.原核生物转录调控中控制逻辑的机制与进化
Microbiol Mol Biol Rev. 2009 Sep;73(3):481-509, Table of Contents. doi: 10.1128/MMBR.00037-08.
7
Entropic Profiler - detection of conservation in genomes using information theory.熵分析器——利用信息论检测基因组中的保守性。
BMC Res Notes. 2009 May 5;2:72. doi: 10.1186/1756-0500-2-72.
8
Motif discovery in promoters of genes co-localized and co-expressed during myeloid cells differentiation.在髓样细胞分化过程中共定位和共表达的基因启动子中的基序发现。
Nucleic Acids Res. 2009 Feb;37(2):533-49. doi: 10.1093/nar/gkn948. Epub 2008 Dec 5.
9
Bioinformatics resources for the study of gene regulation in bacteria.用于研究细菌基因调控的生物信息学资源。
J Bacteriol. 2009 Jan;191(1):23-31. doi: 10.1128/JB.01017-08. Epub 2008 Oct 31.
Nature. 2006 Aug 17;442(7104):772-8. doi: 10.1038/nature04979. Epub 2006 Jul 19.
4
Analysis of computational approaches for motif discovery.基序发现的计算方法分析
Algorithms Mol Biol. 2006 May 19;1:8. doi: 10.1186/1748-7188-1-8.
5
EPD in its twentieth year: towards complete promoter coverage of selected model organisms.EPD二十年:迈向选定模式生物启动子的完全覆盖
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D82-5. doi: 10.1093/nar/gkj146.
6
The YEASTRACT database: a tool for the analysis of transcription regulatory associations in Saccharomyces cerevisiae.YEASTRACT数据库:一种用于分析酿酒酵母中转录调控关联的工具。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D446-51. doi: 10.1093/nar/gkj013.
7
Coordinate enhancers share common organizational features in the Drosophila genome.协同增强子在果蝇基因组中具有共同的组织特征。
Proc Natl Acad Sci U S A. 2004 Mar 16;101(11):3851-6. doi: 10.1073/pnas.0400611101.
8
Identification and distinct regulation of yeast TATA box-containing genes.酵母中含TATA框基因的鉴定与差异调控。
Cell. 2004 Mar 5;116(5):699-709. doi: 10.1016/s0092-8674(04)00205-3.
9
Distance preferences in the arrangement of binding motifs and hierarchical levels in organization of transcription regulatory information.转录调控信息组织中结合基序排列的距离偏好和层次水平
Nucleic Acids Res. 2003 Oct 15;31(20):6016-26. doi: 10.1093/nar/gkg799.
10
Occurrence probability of structured motifs in random sequences.
J Comput Biol. 2002;9(6):761-73. doi: 10.1089/10665270260518254.