新型RNA拓扑结构的候选者。

Candidates for novel RNA topologies.

作者信息

Kim Namhee, Shiffeldrim Nahum, Gan Hin Hark, Schlick Tamar

机构信息

Department of Chemistry, New York University, 100 Washington Square East, Room 1001, New York, NY 10003, USA.

出版信息

J Mol Biol. 2004 Aug 27;341(5):1129-44. doi: 10.1016/j.jmb.2004.06.054.

DOI:10.1016/j.jmb.2004.06.054

PMID:15321711

Abstract

Because the functional repertiore of RNA molecules, like proteins, is closely linked to the diversity of their shapes, uncovering RNA's structural repertoire is vital for identifying novel RNAs, especially in genomic sequences. To help expand the limited number of known RNA families, we use graphical representation and clustering analysis of RNA secondary structures to predict novel RNA topologies and their abundance as a function of size. Representing the essential topological properties of RNA secondary structures as graphs enables enumeration, generation, and prediction of novel RNA motifs. We apply a probabilistic graph-growing method to construct the RNA structure space encompassing the topologies of existing and hypothetical RNAs and cluster all RNA topologies into two groups using topological descriptors and a standard clustering algorithm. Significantly, we find that nearly all existing RNAs fall into one group, which we refer to as "RNA-like"; we consider the other group "non-RNA-like". Our method predicts many candidates for novel RNA secondary topologies, some of which are remarkably similar to existing structures; interestingly, the centroid of the RNA-like group is the tmRNA fold, a pseudoknot having both tRNA-like and mRNA-like functions. Additionally, our approach allows estimation of the relative abundance of pseudoknot and other (e.g. tree) motifs using the "edge-cut" property of RNA graphs. This analysis suggests that pseudoknots dominate the RNA structure universe, representing more than 90% when the sequence length exceeds 120 nt; the predicted trend for <100 nt agrees with data for existing RNAs. Together with our predictions for novel "RNA-like" topologies, our analysis can help direct the design of functional RNAs and identification of novel RNA folds in genomes through an efficient topology-directed search, which grows much more slowly in complexity with RNA size compared to the traditional sequence-based search.

摘要

由于RNA分子的功能 repertoire，如同蛋白质一样，与其形状的多样性紧密相连，揭示RNA的结构 repertoire对于识别新型RNA至关重要，尤其是在基因组序列中。为了帮助扩展已知RNA家族数量的限制，我们使用RNA二级结构的图形表示和聚类分析来预测新型RNA拓扑结构及其作为大小函数的丰度。将RNA二级结构的基本拓扑特性表示为图形能够枚举、生成和预测新型RNA基序。我们应用一种概率性图形生长方法来构建包含现有和假设RNA拓扑结构的RNA结构空间，并使用拓扑描述符和标准聚类算法将所有RNA拓扑结构聚类为两组。值得注意的是，我们发现几乎所有现有的RNA都属于一组，我们将其称为“类RNA”；我们将另一组视为“非类RNA”。我们的方法预测了许多新型RNA二级拓扑结构的候选者，其中一些与现有结构非常相似；有趣的是，类RNA组的质心是tmRNA折叠，一种具有tRNA样和mRNA样功能的假结。此外，我们的方法允许使用RNA图形的“边切割”特性估计假结和其他（例如树状）基序的相对丰度。该分析表明，假结在RNA结构宇宙中占主导地位，当序列长度超过120 nt时占比超过90%；对于<100 nt的预测趋势与现有RNA的数据一致。连同我们对新型“类RNA”拓扑结构的预测，我们的分析可以通过高效的拓扑导向搜索帮助指导功能性RNA的设计和基因组中新型RNA折叠的识别，与传统的基于序列的搜索相比，其复杂度随RNA大小的增长要慢得多。

相似文献

Candidates for novel RNA topologies.

J Mol Biol. 2004 Aug 27;341(5):1129-44. doi: 10.1016/j.jmb.2004.06.054.

RAG: RNA-As-Graphs database--concepts, analysis, and features.

Bioinformatics. 2004 May 22;20(8):1285-91. doi: 10.1093/bioinformatics/bth084. Epub 2004 Feb 12.

A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences.

Bioinformatics. 2004 Jul 10;20(10):1591-602. doi: 10.1093/bioinformatics/bth131. Epub 2004 Feb 12.

Exploring the repertoire of RNA secondary motifs using graph theory; implications for RNA design.

Nucleic Acids Res. 2003 Jun 1;31(11):2926-43. doi: 10.1093/nar/gkg365.

A new algorithm for RNA secondary structure design.

J Mol Biol. 2004 Feb 20;336(3):607-24. doi: 10.1016/j.jmb.2003.12.041.

Predicting candidate genomic sequences that correspond to synthetic functional RNA motifs.

Nucleic Acids Res. 2005 Oct 27;33(18):6057-69. doi: 10.1093/nar/gki911. Print 2005.

An algorithm for searching RNA motifs in genomic sequences.

Biomol Eng. 2007 Sep;24(3):343-50. doi: 10.1016/j.bioeng.2007.02.005. Epub 2007 Mar 3.

Topological classification of RNA structures.

J Mol Biol. 2008 Jun 13;379(4):900-11. doi: 10.1016/j.jmb.2008.04.033. Epub 2008 Apr 18.

Second eigenvalue of the Laplacian matrix for predicting RNA conformational switch by mutation.

Bioinformatics. 2004 Aug 12;20(12):1861-9. doi: 10.1093/bioinformatics/bth157. Epub 2004 Feb 26.

A method for aligning RNA secondary structures and its application to RNA motif detection.

BMC Bioinformatics. 2005 Apr 7;6:89. doi: 10.1186/1471-2105-6-89.

引用本文的文献

How large is the universe of RNA-like motifs? A clustering analysis of RNA graph motifs using topological descriptors.

PLoS Comput Biol. 2025 Jul 15;21(7):e1013230. doi: 10.1371/journal.pcbi.1013230. eCollection 2025 Jul.

How Large is the Universe of RNA-Like Motifs? A Clustering Analysis of RNA Graph Motifs Using Topological Descriptors.

ArXiv. 2025 Jan 8:arXiv:2501.04258v1.

RNA-As-Graphs Motif Atlas-Dual Graph Library of RNA Modules and Viral Frameshifting-Element Applications.

Int J Mol Sci. 2022 Aug 17;23(16):9249. doi: 10.3390/ijms23169249.

Identification of novel RNA design candidates by clustering the extended RNA-As-Graphs library.

Biochim Biophys Acta Gen Subj. 2020 Jun;1864(6):129534. doi: 10.1016/j.bbagen.2020.129534. Epub 2020 Jan 16.

Inverse folding with RNA-As-Graphs produces a large pool of candidate sequences with target topologies.

J Struct Biol. 2020 Mar 1;209(3):107438. doi: 10.1016/j.jsb.2019.107438. Epub 2019 Dec 23.

An extended dual graph library and partitioning algorithm applicable to pseudoknotted RNA structures.

Methods. 2019 Jun 1;162-163:74-84. doi: 10.1016/j.ymeth.2019.03.022. Epub 2019 Mar 27.

A pipeline for computational design of novel RNA-like topologies.

Nucleic Acids Res. 2018 Aug 21;46(14):7040-7051. doi: 10.1093/nar/gky524.

Dual Graph Partitioning Highlights a Small Group of Pseudoknot-Containing RNA Submotifs.

Genes (Basel). 2018 Jul 25;9(8):371. doi: 10.3390/genes9080371.

Adventures with RNA graphs.

Methods. 2018 Jul 1;143:16-33. doi: 10.1016/j.ymeth.2018.03.009. Epub 2018 Apr 3.

Opportunities and Challenges in RNA Structural Modeling and Design.

Biophys J. 2017 Jul 25;113(2):225-234. doi: 10.1016/j.bpj.2016.12.037. Epub 2017 Feb 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

新型RNA拓扑结构的候选者。

Candidates for novel RNA topologies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献