• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种在序列中寻找短基序的精确方法及其在核糖体结合位点问题中的应用。

An exact method for finding short motifs in sequences, with application to the ribosome binding site problem.

作者信息

Tompa M

机构信息

Department of Computer Science and Engineering, University of Washington, Seattle 98195-2350, USA.

出版信息

Proc Int Conf Intell Syst Mol Biol. 1999:262-71.

PMID:10786309
Abstract

This is an investigation of methods for finding short motifs that only occur in a fraction of the input sequences. Unlike local search techniques that may not reach a global optimum, the method proposed here is guaranteed to produce the motifs with greatest z-scores. This method is illustrated for the Ribosome Binding Site Problem, which is to identify the short mRNA 5' untranslated sequence that is recognized by the ribosome during initiation of protein synthesis. Experiments were performed to solve this problem for each of fourteen sequenced prokaryotes, by applying the method to the full complement of genes from each. One of the interesting results of this experimentation is evidence that the recognized sequence of the thermophilic archaea A. fulgidus, M. jannaschii, M. thermoautotrophicum, and P. horikoshii may be somewhat different than the well known Shine-Dalgarno sequence.

摘要

这是一项关于寻找仅在部分输入序列中出现的短基序的方法的研究。与可能无法达到全局最优的局部搜索技术不同,这里提出的方法可保证产生具有最大z分数的基序。该方法通过核糖体结合位点问题进行说明,该问题是识别蛋白质合成起始过程中被核糖体识别的短mRNA 5'非翻译序列。通过将该方法应用于14种已测序原核生物中每种生物的全部基因互补序列,进行了实验以解决这个问题。该实验的一个有趣结果是,有证据表明嗜热古菌嗜热栖热菌、詹氏甲烷球菌、嗜热自养甲烷杆菌和掘越栖热菌的识别序列可能与众所周知的夏因-达尔加诺序列有所不同。

相似文献

1
An exact method for finding short motifs in sequences, with application to the ribosome binding site problem.一种在序列中寻找短基序的精确方法及其在核糖体结合位点问题中的应用。
Proc Int Conf Intell Syst Mol Biol. 1999:262-71.
2
A statistical method for finding transcription factor binding sites.一种寻找转录因子结合位点的统计方法。
Proc Int Conf Intell Syst Mol Biol. 2000;8:344-54.
3
Improved Exact Enumerative Algorithms for the Planted (l, d)-Motif Search Problem.用于植入式(l, d)基序搜索问题的改进精确枚举算法。
IEEE/ACM Trans Comput Biol Bioinform. 2014 Mar-Apr;11(2):361-74. doi: 10.1109/TCBB.2014.2306842.
4
Anatomy of Escherichia coli ribosome binding sites.大肠杆菌核糖体结合位点的剖析。
J Mol Biol. 2001 Oct 12;313(1):215-28. doi: 10.1006/jmbi.2001.5040.
5
Finding motifs with insufficient number of strong binding sites.发现具有数量不足的强结合位点的基序。
J Comput Biol. 2005 Jul-Aug;12(6):686-701. doi: 10.1089/cmb.2005.12.686.
6
Finding motifs from all sequences with and without binding sites.从所有具有和不具有结合位点的序列中寻找基序。
Bioinformatics. 2006 Sep 15;22(18):2217-23. doi: 10.1093/bioinformatics/btl371. Epub 2006 Jul 26.
7
Naturally occurring adenines within mRNA coding sequences affect ribosome binding and expression in Escherichia coli.信使核糖核酸(mRNA)编码序列中的天然腺嘌呤会影响大肠杆菌中的核糖体结合及表达。
J Bacteriol. 2007 Jan;189(2):501-10. doi: 10.1128/JB.01356-06. Epub 2006 Nov 3.
8
On counting position weight matrix matches in a sequence, with application to discriminative motif finding.关于计算序列中的位置权重矩阵匹配及其在判别性基序发现中的应用。
Bioinformatics. 2006 Jul 15;22(14):e454-63. doi: 10.1093/bioinformatics/btl227.
9
Deriving ribosomal binding site (RBS) statistical models from unannotated DNA sequences and the use of the RBS model for N-terminal prediction.从无注释的DNA序列推导核糖体结合位点(RBS)统计模型以及将RBS模型用于N端预测。
Pac Symp Biocomput. 1998:279-90.
10
An extension and novel solution to the (l,d)-motif challenge problem.针对(l,d)-基序挑战问题的一种扩展及新颖解决方案。
Genome Inform. 2004;15(2):63-71.

引用本文的文献

1
Transcription factor-binding k-mer analysis clarifies the cell type dependency of binding specificities and cis-regulatory SNPs in humans.转录因子结合 k- -mer 分析阐明了人类结合特异性和顺式调控 SNP 的细胞类型依赖性。
BMC Genomics. 2023 Oct 7;24(1):597. doi: 10.1186/s12864-023-09692-9.
2
Reducing metabolic burden in the PACEmid evolver system by remastering high-copy phagemid vectors.通过重新构建高拷贝噬菌粒载体减轻PACEmid进化系统中的代谢负担。
Eng Biol. 2022 May 20;6(2-3):50-61. doi: 10.1049/enb2.12021. eCollection 2022 Jun-Sep.
3
Prokaryotic coding regions have little if any specific depletion of Shine-Dalgarno motifs.
原核生物编码区几乎没有特有的 Shine-Dalgarno 基序缺失。
PLoS One. 2018 Aug 23;13(8):e0202768. doi: 10.1371/journal.pone.0202768. eCollection 2018.
4
Re-annotation of 12,495 prokaryotic 16S rRNA 3' ends and analysis of Shine-Dalgarno and anti-Shine-Dalgarno sequences.重新注释 12495 个原核生物 16S rRNA 3' 端并分析 Shine-Dalgarno 和反 Shine-Dalgarno 序列。
PLoS One. 2018 Aug 23;13(8):e0202767. doi: 10.1371/journal.pone.0202767. eCollection 2018.
5
A novel -mer set memory (KSM) motif representation improves regulatory variant prediction.一种新型 -mer 集记忆 (KSM) 基序表示法可提高调控变异预测的准确性。
Genome Res. 2018 Jun;28(6):891-900. doi: 10.1101/gr.226852.117. Epub 2018 Apr 13.
6
Unsupervised statistical discovery of spaced motifs in prokaryotic genomes.原核生物基因组中间隔基序的无监督统计发现。
BMC Genomics. 2017 Jan 5;18(1):27. doi: 10.1186/s12864-016-3400-0.
7
SeqGL Identifies Context-Dependent Binding Signals in Genome-Wide Regulatory Element Maps.SeqGL在全基因组调控元件图谱中识别上下文相关的结合信号。
PLoS Comput Biol. 2015 May 27;11(5):e1004271. doi: 10.1371/journal.pcbi.1004271. eCollection 2015 May.
8
A hybrid method for the exact planted (l, d) motif finding problem and its parallelization.用于精确种植 (l, d) 模式问题的混合方法及其并行化。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S10. doi: 10.1186/1471-2105-13-S17-S10. Epub 2012 Dec 13.
9
A Monte Carlo-based framework enhances the discovery and interpretation of regulatory sequence motifs.基于蒙特卡罗的框架增强了调控序列基序的发现和解释。
BMC Bioinformatics. 2012 Nov 27;13:317. doi: 10.1186/1471-2105-13-317.
10
Direct vs 2-stage approaches to structured motif finding.用于结构化基序发现的直接方法与两阶段方法
Algorithms Mol Biol. 2012 Aug 21;7(1):20. doi: 10.1186/1748-7188-7-20.