• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Occurrence probability of structured motifs in random sequences.

作者信息

Robin S, Daudin J-J, Richard H, Sagot M-F, Schbath S

机构信息

INA-PG / INRA, UMR Biométrie et Intelligence Artificielle, 16, rue Claude Bernard, F-75005 Paris, France.

出版信息

J Comput Biol. 2002;9(6):761-73. doi: 10.1089/10665270260518254.

DOI:10.1089/10665270260518254
PMID:12614545
Abstract

The problem of extracting from a set of nucleic acid sequences motifs which may have biological function is more and more important. In this paper, we are interested in particular motifs that may be implicated in the transcription process. These motifs, called structured motifs, are composed of two ordered parts separated by a variable distance and allowing for substitutions. In order to assess their statistical significance, we propose approximations of the probability of occurrences of such a structured motif in a given sequence. An application of our method to evaluate candidate promoters in E. coli and B. subtilis is presented. Simulations show the goodness of the approximations.

摘要

相似文献

1
Occurrence probability of structured motifs in random sequences.
J Comput Biol. 2002;9(6):761-73. doi: 10.1089/10665270260518254.
2
Inferring regulatory elements from a whole genome. An analysis of Helicobacter pylori sigma(80) family of promoter signals.从全基因组推断调控元件。幽门螺杆菌σ80启动子信号家族分析。
J Mol Biol. 2000 Mar 24;297(2):335-53. doi: 10.1006/jmbi.2000.3576.
3
Checking homogeneity of motifs' distribution in heterogenous sequences.
J Comput Biol. 2005 Jul-Aug;12(6):672-85. doi: 10.1089/cmb.2005.12.672.
4
A greedy strategy for finding motifs from yes-no examples.
Pac Symp Biocomput. 1996:599-613.
5
Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification.使用后缀树提取结构化基序的算法及其在启动子和调控位点共有序列识别中的应用。
J Comput Biol. 2000;7(3-4):345-62. doi: 10.1089/106652700750050826.
6
A novel pairwise comparison method for in silico discovery of statistically significant cis-regulatory elements in eukaryotic promoter regions: application to Arabidopsis.一种用于在真核生物启动子区域进行计算机模拟发现具有统计学意义的顺式调控元件的新型成对比较方法:应用于拟南芥。
J Theor Biol. 2015 Jan 7;364:364-76. doi: 10.1016/j.jtbi.2014.09.038. Epub 2014 Oct 7.
7
A novel method for prokaryotic promoter prediction based on DNA stability.一种基于DNA稳定性的原核生物启动子预测新方法。
BMC Bioinformatics. 2005 Jan 5;6:1. doi: 10.1186/1471-2105-6-1.
8
Effect of DNA structural flexibility on promoter strength--molecular dynamics studies of E. coli promoter sequences.DNA结构灵活性对启动子强度的影响——大肠杆菌启动子序列的分子动力学研究
Biochem Biophys Res Commun. 2006 Mar 10;341(2):557-66. doi: 10.1016/j.bbrc.2005.12.215. Epub 2006 Jan 13.
9
Toward Algorithms for Automation of Postgenomic Data Analyses: Promoter Prediction with Artificial Neural Network.迈向基因组后数据分析自动化算法的研究:基于人工神经网络的启动子预测。
OMICS. 2020 May;24(5):300-309. doi: 10.1089/omi.2019.0041. Epub 2019 Oct 1.
10
Hybrid Gibbs-sampling algorithm for challenging motif discovery: GibbsDST.用于具有挑战性的基序发现的混合吉布斯采样算法:GibbsDST
Genome Inform. 2006;17(2):3-13.

引用本文的文献

1
An average-case efficient two-stage algorithm for enumerating all longest common substrings of minimum length between genome pairs.一种用于枚举基因组对之间所有最短长度最长公共子串的平均情况高效两阶段算法。
Proc (IEEE Int Conf Healthc Inform). 2024 Jun;2024:93-102. doi: 10.1109/ichi61247.2024.00020. Epub 2024 Aug 22.
2
Fast and exact quantification of motif occurrences in biological sequences.快速准确地定量生物序列中的基序出现次数。
BMC Bioinformatics. 2021 Sep 18;22(1):445. doi: 10.1186/s12859-021-04355-6.
3
Unsupervised statistical discovery of spaced motifs in prokaryotic genomes.
原核生物基因组中间隔基序的无监督统计发现。
BMC Genomics. 2017 Jan 5;18(1):27. doi: 10.1186/s12864-016-3400-0.
4
Importance sampling of word patterns in DNA and protein sequences.DNA和蛋白质序列中词模式的重要性抽样
J Comput Biol. 2010 Dec;17(12):1697-709. doi: 10.1089/cmb.2008.0233.
5
An analysis of the positional distribution of DNA motifs in promoter regions and its biological relevance.启动子区域DNA基序的位置分布分析及其生物学相关性。
BMC Bioinformatics. 2008 Feb 7;9:89. doi: 10.1186/1471-2105-9-89.
6
False occurrences of functional motifs in protein sequences highlight evolutionary constraints.蛋白质序列中功能基序的错误出现突出了进化限制。
BMC Bioinformatics. 2007 Mar 1;8:68. doi: 10.1186/1471-2105-8-68.
7
Stem-loop structures in prokaryotic genomes.原核生物基因组中的茎环结构。
BMC Genomics. 2006 Jul 4;7:170. doi: 10.1186/1471-2164-7-170.
8
Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics.使用有限马尔可夫链嵌入(FMCI)进行有效的p值计算:应用于局部得分和模式统计。
Algorithms Mol Biol. 2006 Apr 7;1(1):5. doi: 10.1186/1748-7188-1-5.
9
Flexible promoter architecture requirements for coactivator recruitment.共激活因子招募所需的灵活启动子结构要求。
BMC Mol Biol. 2006 Apr 28;7:16. doi: 10.1186/1471-2199-7-16.
10
BIPAD: a web server for modeling bipartite sequence elements.BIPAD:用于对二分序列元件进行建模的网络服务器。
BMC Bioinformatics. 2006 Feb 17;7:76. doi: 10.1186/1471-2105-7-76.