• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于边着色图的最大公共子图快速算法寻找 RNA 结构网络的周期性

Finding recurrent RNA structural networks with fast maximal common subgraphs of edge-colored graphs.

机构信息

School of Computer Science, McGill University, Montréal, Canada.

LiX, École Polytechnique, Paris, France.

出版信息

PLoS Comput Biol. 2021 May 28;17(5):e1008990. doi: 10.1371/journal.pcbi.1008990. eCollection 2021 May.

DOI:10.1371/journal.pcbi.1008990
PMID:34048427
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8191989/
Abstract

RNA tertiary structure is crucial to its many non-coding molecular functions. RNA architecture is shaped by its secondary structure composed of stems, stacked canonical base pairs, enclosing loops. While stems are precisely captured by free-energy models, loops composed of non-canonical base pairs are not. Nor are distant interactions linking together those secondary structure elements (SSEs). Databases of conserved 3D geometries (a.k.a. modules) not captured by energetic models are leveraged for structure prediction and design, but the computational complexity has limited their study to local elements, loops. Representing the RNA structure as a graph has recently allowed to expend this work to pairs of SSEs, uncovering a hierarchical organization of these 3D modules, at great computational cost. Systematically capturing recurrent patterns on a large scale is a main challenge in the study of RNA structures. In this paper, we present an efficient algorithm to compute maximal isomorphisms in edge colored graphs. We extend this algorithm to a framework well suited to identify RNA modules, and fast enough to considerably generalize previous approaches. To exhibit the versatility of our framework, we first reproduce results identifying all common modules spanning more than 2 SSEs, in a few hours instead of weeks. The efficiency of our new algorithm is demonstrated by computing the maximal modules between any pair of entire RNA in the non-redundant corpus of known RNA 3D structures. We observe that the biggest modules our method uncovers compose large shared sub-structure spanning hundreds of nucleotides and base pairs between the ribosomes of Thermus thermophilus, Escherichia Coli, and Pseudomonas aeruginosa.

摘要

RNA 的三级结构对其许多非编码分子功能至关重要。RNA 的结构由其二级结构形成,二级结构由茎、堆积的规范碱基对、封闭的环组成。虽然茎可以被自由能模型精确捕捉,但由非规范碱基对组成的环则不能。连接这些二级结构元件(SSEs)的远距离相互作用也不能。数据库中保守的 3D 几何形状(也称为模块)没有被能量模型捕获,用于结构预测和设计,但计算复杂性限制了它们对局部元素、环的研究。最近,将 RNA 结构表示为图的方法允许将这项工作扩展到 SSE 对,揭示了这些 3D 模块的层次组织,但计算成本很高。在大规模上系统地捕捉重复模式是 RNA 结构研究的主要挑战。在本文中,我们提出了一种有效算法来计算边着色图中的最大同构。我们将此算法扩展到一个非常适合识别 RNA 模块的框架中,并且足够快,可以极大地推广以前的方法。为了展示我们框架的多功能性,我们首先重现了识别跨越超过 2 个 SSE 的所有常见模块的结果,仅需几个小时而不是几周。我们的新算法的效率通过计算已知 RNA 3D 结构非冗余文库中任何两个完整 RNA 之间的最大模块来证明。我们观察到,我们的方法揭示的最大模块由跨越 Thermus thermophilus、Escherichia Coli 和 Pseudomonas aeruginosa 核糖体数百个核苷酸和碱基对的大共享子结构组成。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/dae7d9f11d0d/pcbi.1008990.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/bb64809d2d6a/pcbi.1008990.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/a8fb4902b7a0/pcbi.1008990.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/6ffdca1546ee/pcbi.1008990.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/e49f5b237468/pcbi.1008990.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/4519cea36128/pcbi.1008990.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/c45d1e283bbc/pcbi.1008990.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/d832a6e1cb34/pcbi.1008990.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/68094c8fb408/pcbi.1008990.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/2590c1ebf6ca/pcbi.1008990.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/bcb9a44e839c/pcbi.1008990.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/3b4c39042f97/pcbi.1008990.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/34cd063b8fbf/pcbi.1008990.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/a4f9d87910d4/pcbi.1008990.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/dae7d9f11d0d/pcbi.1008990.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/bb64809d2d6a/pcbi.1008990.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/a8fb4902b7a0/pcbi.1008990.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/6ffdca1546ee/pcbi.1008990.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/e49f5b237468/pcbi.1008990.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/4519cea36128/pcbi.1008990.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/c45d1e283bbc/pcbi.1008990.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/d832a6e1cb34/pcbi.1008990.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/68094c8fb408/pcbi.1008990.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/2590c1ebf6ca/pcbi.1008990.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/bcb9a44e839c/pcbi.1008990.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/3b4c39042f97/pcbi.1008990.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/34cd063b8fbf/pcbi.1008990.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/a4f9d87910d4/pcbi.1008990.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9937/8191989/dae7d9f11d0d/pcbi.1008990.g014.jpg

相似文献

1
Finding recurrent RNA structural networks with fast maximal common subgraphs of edge-colored graphs.基于边着色图的最大公共子图快速算法寻找 RNA 结构网络的周期性
PLoS Comput Biol. 2021 May 28;17(5):e1008990. doi: 10.1371/journal.pcbi.1008990. eCollection 2021 May.
2
Towards 3D structure prediction of large RNA molecules: an integer programming framework to insert local 3D motifs in RNA secondary structure.针对大型 RNA 分子的 3D 结构预测:在 RNA 二级结构中插入局部 3D 模体的整数规划框架。
Bioinformatics. 2012 Jun 15;28(12):i207-14. doi: 10.1093/bioinformatics/bts226.
3
FR3D: finding local and composite recurrent structural motifs in RNA 3D structures.FR3D:在RNA三维结构中寻找局部和复合重复结构基序
J Math Biol. 2008 Jan;56(1-2):215-52. doi: 10.1007/s00285-007-0110-x. Epub 2007 Aug 11.
4
Automated, customizable and efficient identification of 3D base pair modules with BayesPairing.使用 BayesPairing 实现 3D 碱基对模块的自动化、可定制和高效识别。
Nucleic Acids Res. 2019 Apr 23;47(7):3321-3332. doi: 10.1093/nar/gkz102.
5
RNA 3D Modules in Genome-Wide Predictions of RNA 2D Structure.全基因组RNA二维结构预测中的RNA三维模块
PLoS One. 2015 Oct 28;10(10):e0139900. doi: 10.1371/journal.pone.0139900. eCollection 2015.
6
Mining for recurrent long-range interactions in RNA structures reveals embedded hierarchies in network families.挖掘 RNA 结构中的反复长程相互作用揭示了网络家族中的嵌入层次结构。
Nucleic Acids Res. 2018 May 4;46(8):3841-3851. doi: 10.1093/nar/gky197.
7
Classification and Identification of Non-canonical Base Pairs and Structural Motifs.非规范碱基对和结构模体的分类与鉴定。
Methods Mol Biol. 2024;2726:143-168. doi: 10.1007/978-1-0716-3519-3_7.
8
Going beyond base-pairs: topology-based characterization of base-multiplets in RNA.超越碱基对:RNA 中碱基多联体的拓扑特征分析。
RNA. 2019 May;25(5):573-589. doi: 10.1261/rna.068551.118. Epub 2019 Feb 21.
9
An extended dual graph library and partitioning algorithm applicable to pseudoknotted RNA structures.适用于假结 RNA 结构的扩展对偶图库和分区算法。
Methods. 2019 Jun 1;162-163:74-84. doi: 10.1016/j.ymeth.2019.03.022. Epub 2019 Mar 27.
10
RNA structure and dynamics: a base pairing perspective.RNA 结构与动态:碱基配对视角。
Prog Biophys Mol Biol. 2013 Nov;113(2):264-83. doi: 10.1016/j.pbiomolbio.2013.07.003. Epub 2013 Jul 23.

引用本文的文献

1
RNAMotifProfile: a graph-based approach to build RNA structural motif profiles.RNA基序图谱:一种基于图形的构建RNA结构基序图谱的方法。
NAR Genom Bioinform. 2024 Sep 26;6(3):lqae128. doi: 10.1093/nargab/lqae128. eCollection 2024 Sep.
2
Sampling globally and locally correct RNA 3D structures using Ernwin, SPQR and experimental SAXS data.使用 Ernwin、SPQR 和实验 SAXS 数据全局和局部采样 RNA 三维结构。
Nucleic Acids Res. 2024 Sep 9;52(16):e73. doi: 10.1093/nar/gkae602.
3
Concurrent prediction of RNA secondary structures with pseudoknots and local 3D motifs in an integer programming framework.

本文引用的文献

1
Inverse folding with RNA-As-Graphs produces a large pool of candidate sequences with target topologies.通过 RNA-As-Graphs 进行反向折叠会产生大量具有目标拓扑结构的候选序列。
J Struct Biol. 2020 Mar 1;209(3):107438. doi: 10.1016/j.jsb.2019.107438. Epub 2019 Dec 23.
2
Mining for recurrent long-range interactions in RNA structures reveals embedded hierarchies in network families.挖掘 RNA 结构中的反复长程相互作用揭示了网络家族中的嵌入层次结构。
Nucleic Acids Res. 2018 May 4;46(8):3841-3851. doi: 10.1093/nar/gky197.
3
RNA 3-dimensional structural motifs as a critical constraint of viroid RNA evolution.
在整数规划框架中同时预测具有假结和局部 3D 模体的 RNA 二级结构。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae022.
4
A comprehensive survey of long-range tertiary interactions and motifs in non-coding RNA structures.非编码 RNA 结构中长程三级相互作用和基序的综合调查。
Nucleic Acids Res. 2023 Sep 8;51(16):8367-8382. doi: 10.1093/nar/gkad605.
5
RNAMotifComp: a comprehensive method to analyze and identify structurally similar RNA motif families.RNAMotifComp:一种全面的分析和识别结构相似 RNA 基序家族的方法。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i337-i346. doi: 10.1093/bioinformatics/btad223.
6
On the predictibility of A-minor motifs from their local contexts.从局部环境预测 A 小调动机的可预测性。
RNA Biol. 2022 Jan;19(1):1208-1227. doi: 10.1080/15476286.2022.2144611.
RNA三维结构基序作为类病毒RNA进化的关键限制因素。
PLoS Pathog. 2018 Feb 22;14(2):e1006801. doi: 10.1371/journal.ppat.1006801. eCollection 2018 Feb.
4
Combining structure probing data on RNA mutants with evolutionary information reveals RNA-binding interfaces.将RNA突变体的结构探测数据与进化信息相结合,可揭示RNA结合界面。
Nucleic Acids Res. 2016 Jun 20;44(11):e104. doi: 10.1093/nar/gkw217. Epub 2016 Apr 19.
5
InterRNA: a database of base interactions in RNA structures.InterRNA:RNA结构中碱基相互作用的数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D266-71. doi: 10.1093/nar/gkv1186. Epub 2015 Nov 8.
6
Identifying novel sequence variants of RNA 3D motifs.识别RNA三维基序的新型序列变体。
Nucleic Acids Res. 2015 Sep 3;43(15):7504-20. doi: 10.1093/nar/gkv651. Epub 2015 Jun 29.
7
Evolution of the ribosome at atomic resolution.原子分辨率下核糖体的进化。
Proc Natl Acad Sci U S A. 2014 Jul 15;111(28):10251-6. doi: 10.1073/pnas.1407205111. Epub 2014 Jun 30.
8
RNA Bricks--a database of RNA 3D motifs and their interactions.RNA 积木——RNA 三维模体及其相互作用数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D123-31. doi: 10.1093/nar/gkt1084. Epub 2013 Nov 12.
9
Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas.RNA 三维结构基序的自动分类和 RNA 三维结构基序图集。
RNA. 2013 Oct;19(10):1327-40. doi: 10.1261/rna.039438.113. Epub 2013 Aug 22.
10
Towards 3D structure prediction of large RNA molecules: an integer programming framework to insert local 3D motifs in RNA secondary structure.针对大型 RNA 分子的 3D 结构预测:在 RNA 二级结构中插入局部 3D 模体的整数规划框架。
Bioinformatics. 2012 Jun 15;28(12):i207-14. doi: 10.1093/bioinformatics/bts226.