• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

寻找具有统计学意义的调控模块。

Searching for statistically significant regulatory modules.

作者信息

Bailey Timothy L, Noble William Stafford

机构信息

Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia.

出版信息

Bioinformatics. 2003 Oct;19 Suppl 2:ii16-25. doi: 10.1093/bioinformatics/btg1054.

DOI:10.1093/bioinformatics/btg1054
PMID:14534166
Abstract

MOTIVATION

The regulatory machinery controlling gene expression is complex, frequently requiring multiple, simultaneous DNA-protein interactions. The rate at which a gene is transcribed may depend upon the presence or absence of a collection of transcription factors bound to the DNA near the gene. Locating transcription factor binding sites in genomic DNA is difficult because the individual sites are small and tend to occur frequently by chance. True binding sites may be identified by their tendency to occur in clusters, sometimes known as regulatory modules.

RESULTS

We describe an algorithm for detecting occurrences of regulatory modules in genomic DNA. The algorithm, called mcast, takes as input a DNA database and a collection of binding site motifs that are known to operate in concert. mcast uses a motif-based hidden Markov model with several novel features. The model incorporates motif-specific p-values, thereby allowing scores from motifs of different widths and specificities to be compared directly. The p-value scoring also allows mcast to only accept motif occurrences with significance below a user-specified threshold, while still assigning better scores to motif occurrences with lower p-values. mcast can search long DNA sequences, modeling length distributions between motifs within a regulatory module, but ignoring length distributions between modules. The algorithm produces a list of predicted regulatory modules, ranked by E-value. We validate the algorithm using simulated data as well as real data sets from fruitfly and human.

AVAILABILITY

http://meme.sdsc.edu/MCAST/paper

摘要

动机

控制基因表达的调控机制很复杂,常常需要多个DNA - 蛋白质同时相互作用。一个基因的转录速率可能取决于与该基因附近DNA结合的一系列转录因子的存在与否。在基因组DNA中定位转录因子结合位点很困难,因为单个位点很小且往往会偶然频繁出现。真正的结合位点可通过它们在簇中出现的倾向来识别,这些簇有时被称为调控模块。

结果

我们描述了一种用于检测基因组DNA中调控模块出现情况的算法。该算法名为mcast,它将一个DNA数据库和一组已知协同作用的结合位点基序作为输入。mcast使用了一种具有几个新特性的基于基序的隐马尔可夫模型。该模型纳入了基序特异性p值,从而允许直接比较不同宽度和特异性的基序得分。p值评分还允许mcast仅接受显著性低于用户指定阈值的基序出现情况,同时仍为p值较低的基序出现情况赋予更好的分数。mcast可以搜索长DNA序列,对调控模块内基序之间的长度分布进行建模,但忽略模块之间的长度分布。该算法生成一份按E值排序的预测调控模块列表。我们使用模拟数据以及果蝇和人类的真实数据集对该算法进行了验证。

可用性

http://meme.sdsc.edu/MCAST/paper

相似文献

1
Searching for statistically significant regulatory modules.寻找具有统计学意义的调控模块。
Bioinformatics. 2003 Oct;19 Suppl 2:ii16-25. doi: 10.1093/bioinformatics/btg1054.
2
Statistical significance of cis-regulatory modules.顺式调控模块的统计学显著性
BMC Bioinformatics. 2007 Jan 22;8:19. doi: 10.1186/1471-2105-8-19.
3
Computational detection of cis -regulatory modules.顺式调控模块的计算检测
Bioinformatics. 2003 Oct;19 Suppl 2:ii5-14. doi: 10.1093/bioinformatics/btg1052.
4
A graph-based approach to systematically reconstruct human transcriptional regulatory modules.一种基于图形的方法来系统地重建人类转录调控模块。
Bioinformatics. 2007 Jul 1;23(13):i577-86. doi: 10.1093/bioinformatics/btm227.
5
On counting position weight matrix matches in a sequence, with application to discriminative motif finding.关于计算序列中的位置权重矩阵匹配及其在判别性基序发现中的应用。
Bioinformatics. 2006 Jul 15;22(14):e454-63. doi: 10.1093/bioinformatics/btl227.
6
MCAST: scanning for cis-regulatory motif clusters.MCAST:扫描顺式调控基序簇
Bioinformatics. 2016 Apr 15;32(8):1217-9. doi: 10.1093/bioinformatics/btv750. Epub 2015 Dec 24.
7
MotifCut: regulatory motifs finding with maximum density subgraphs.MotifCut:通过最大密度子图寻找调控基序
Bioinformatics. 2006 Jul 15;22(14):e150-7. doi: 10.1093/bioinformatics/btl243.
8
Finding optimal degenerate patterns in DNA sequences.寻找DNA序列中的最优简并模式。
Bioinformatics. 2003 Oct;19 Suppl 2:ii206-14. doi: 10.1093/bioinformatics/btg1079.
9
Finding regulatory elements and regulatory motifs: a general probabilistic framework.寻找调控元件和调控基序:一个通用的概率框架。
BMC Bioinformatics. 2007 Sep 27;8 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-8-S6-S4.
10
Learning probabilistic models of cis-regulatory modules that represent logical and spatial aspects.学习代表逻辑和空间方面的顺式调控模块的概率模型。
Bioinformatics. 2007 Jan 15;23(2):e156-62. doi: 10.1093/bioinformatics/btl319.

引用本文的文献

1
BestCRM: An Exhaustive Search for Optimal Cis-Regulatory Modules in Promoters Accelerated by the Multidimensional Hash Function.BestCRM:多维哈希函数加速启动子中最优顺式调控模块的全面搜索。
Int J Mol Sci. 2024 Feb 5;25(3):1903. doi: 10.3390/ijms25031903.
2
In-silico identification and comparison of transcription factor binding sites cluster in anterior-posterior patterning genes in Drosophila melanogaster and Tribolium castaneum.在果蝇和赤拟谷盗的前后模式基因中,转录因子结合位点簇的计算机识别和比较。
PLoS One. 2023 Aug 17;18(8):e0290035. doi: 10.1371/journal.pone.0290035. eCollection 2023.
3
A survey on algorithms to characterize transcription factor binding sites.
一种用于刻画转录因子结合位点的算法研究综述。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad156.
4
Dimeric p53 Mutant Elicits Unique Tumor-Suppressive Activities through an Altered Metabolic Program.二聚体 p53 突变通过改变代谢程序发挥独特的肿瘤抑制活性。
Cancer Discov. 2023 May 4;13(5):1230-1249. doi: 10.1158/2159-8290.CD-22-0872.
5
Replication stress generates distinctive landscapes of DNA copy number alterations and chromosome scale losses.复制压力会产生独特的 DNA 拷贝数改变和染色体尺度丢失景观。
Genome Biol. 2022 Oct 20;23(1):223. doi: 10.1186/s13059-022-02781-0.
6
Motif and conserved module analysis in DNA (promoters, enhancers) and RNA (lncRNA, mRNA) using AlModules.使用 AlModules 对 DNA(启动子、增强子)和 RNA(lncRNA、mRNA)进行模体和保守模块分析。
Sci Rep. 2022 Oct 20;12(1):17588. doi: 10.1038/s41598-022-21732-0.
7
Identification of upstream transcription factor binding sites in orthologous genes using mixed Student's t-test statistics.使用混合 Student's t 检验统计量鉴定直系同源基因中的上游转录因子结合位点。
PLoS Comput Biol. 2022 Jun 7;18(6):e1009773. doi: 10.1371/journal.pcbi.1009773. eCollection 2022 Jun.
8
Genome-Wide Identification of Auxin Response Factors in Peanut ( L.) and Functional Analysis in Root Morphology.花生中生长素响应因子的全基因组鉴定及其在根系形态建成中的功能分析。
Int J Mol Sci. 2022 May 10;23(10):5309. doi: 10.3390/ijms23105309.
9
The transcription factor TaMYB31 regulates the benzoxazinoid biosynthetic pathway in wheat.转录因子 TaMYB31 调控小麦苯并恶嗪类生物合成途径。
J Exp Bot. 2022 Sep 12;73(16):5634-5649. doi: 10.1093/jxb/erac204.
10
regCNN: identifying genome-wide -regulatory modules via integrating the local patterns in epigenetic marks and transcription factor binding motifs.regCNN:通过整合表观遗传标记中的局部模式和转录因子结合基序来识别全基因组调控模块。
Comput Struct Biotechnol J. 2021 Dec 18;20:296-308. doi: 10.1016/j.csbj.2021.12.015. eCollection 2022.