Suppr超能文献

利用RNA二级结构引导在单链区域寻找序列基序。

Using RNA secondary structures to guide sequence motif finding towards single-stranded regions.

作者信息

Hiller Michael, Pudimat Rainer, Busch Anke, Backofen Rolf

机构信息

Institute of Computer Science, Chair for Bioinformatics, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany.

出版信息

Nucleic Acids Res. 2006;34(17):e117. doi: 10.1093/nar/gkl544. Epub 2006 Sep 20.

Abstract

RNA binding proteins recognize RNA targets in a sequence specific manner. Apart from the sequence, the secondary structure context of the binding site also affects the binding affinity. Binding sites are often located in single-stranded RNA regions and it was shown that the sequestration of a binding motif in a double-strand abolishes protein binding. Thus, it is desirable to include knowledge about RNA secondary structures when searching for the binding motif of a protein. We present the approach MEMERIS for searching sequence motifs in a set of RNA sequences and simultaneously integrating information about secondary structures. To abstract from specific structural elements, we precompute position-specific values measuring the single-strandedness of all substrings of an RNA sequence. These values are used as prior knowledge about the motif starts to guide the motif search. Extensive tests with artificial and biological data demonstrate that MEMERIS is able to identify motifs in single-stranded regions even if a stronger motif located in double-strand parts exists. The discovered motif occurrences in biological datasets mostly coincide with known protein-binding sites. This algorithm can be used for finding the binding motif of single-stranded RNA-binding proteins in SELEX or other biological sequence data.

摘要

RNA结合蛋白以序列特异性方式识别RNA靶标。除了序列之外,结合位点的二级结构背景也会影响结合亲和力。结合位点通常位于单链RNA区域,并且研究表明双链中结合基序的隔离会消除蛋白质结合。因此,在寻找蛋白质的结合基序时,纳入有关RNA二级结构的知识是很有必要的。我们提出了MEMERIS方法,用于在一组RNA序列中搜索序列基序,并同时整合有关二级结构的信息。为了从特定结构元件中抽象出来,我们预先计算测量RNA序列所有子串单链性的位置特异性值。这些值用作有关基序起始的先验知识,以指导基序搜索。对人工和生物学数据的广泛测试表明,即使存在位于双链部分的更强基序,MEMERIS也能够识别单链区域中的基序。在生物学数据集中发现的基序出现情况大多与已知的蛋白质结合位点一致。该算法可用于在SELEX或其他生物学序列数据中寻找单链RNA结合蛋白的结合基序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d1e/1903381/bd3d39bb3784/gkl544f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验