• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

类RNA基序的范围有多大?使用拓扑描述符对RNA图形基序进行聚类分析。

How large is the universe of RNA-like motifs? A clustering analysis of RNA graph motifs using topological descriptors.

作者信息

Wang Rui, Schlick Tamar

机构信息

Simons Center for Computational Physical Chemistry, New York University, New York, New York, United States of America.

Department of Chemistry, New York University, New York, New York, United States of America.

出版信息

PLoS Comput Biol. 2025 Jul 15;21(7):e1013230. doi: 10.1371/journal.pcbi.1013230. eCollection 2025 Jul.

DOI:10.1371/journal.pcbi.1013230
PMID:40663589
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12262848/
Abstract

Identifying novel and functional RNA structures remains a significant challenge in RNA motif design and is crucial for developing RNA-based therapeutics. Here we introduce a computational topology-based approach with unsupervised machine-learning algorithms to estimate the database size and content of RNA-like graph topologies. Specifically, we apply graph theory enumeration to generate all 110,667 possible 2D dual graphs for vertex numbers ranging from 2 to 9. Among them, only 0.11% (121 dual graphs) correspond to approximately 200,000 known RNA atomic fragments/substructures (collected in 2021) using the RNA-as-Graphs (RAG) framework. The remaining 99.89% of the dual graphs may be RNA-like or non-RNA-like. To determine which dual graphs in the 99.89% hypothetical set are more likely to be associated with RNA structures, we apply computational topology descriptors using the Persistent Spectral Graphs (PSG) method to characterize each graph using 19 PSG-based features and use clustering algorithms that partition all possible dual graphs into two clusters. The cluster with the higher percentage of known dual graphs for RNA is defined as the "RNA-like" cluster, while the other is considered as "non-RNA-like". The distance between each dual graph and the center of the RNA-like cluster represents the likelihood of it belonging to RNA structures. From validation, our PSG-based RNA-like cluster includes 97.3% of the 121 known RNA dual graphs, suggesting good performance. Furthermore, 46.017% of the hypothetical RNAs are predicted to be RNA-like. Among the top 15 graphs identified as high-likelihood candidates for novel RNA motifs, 4 were confirmed from the RNA dataset collected in 2022. Significantly, we observe that all the top 15 RNA-like dual graphs can be separated into multiple subgraphs, whereas the top 15 non-RNA-like dual graphs tend not to have any subgraphs (subgraphs preserve pseudoknots and junctions). Moreover, a significant topological difference between top RNA-like and non-RNA-like graphs is evident when comparing their topological features (e.g., Betti-0 and Betti-1 numbers). These findings provide valuable insights into the size of the RNA motif universe and RNA design strategies, offering a novel framework for predicting RNA graph topologies and guiding the discovery of novel RNA motifs, perhaps anti-viral therapeutics by subgraph assembly.

摘要

识别新型且具有功能的RNA结构仍然是RNA基序设计中的一项重大挑战,对于开发基于RNA的疗法至关重要。在此,我们引入一种基于计算拓扑学的方法,并结合无监督机器学习算法,来估计类RNA图形拓扑的数据库大小和内容。具体而言,我们应用图论枚举法生成顶点数从2到9的所有110,667种可能的二维对偶图。其中,使用“RNA作为图形”(RAG)框架,只有0.11%(121个对偶图)对应于约200,000个已知的RNA原子片段/子结构(于2021年收集)。其余99.89%的对偶图可能是类RNA的,也可能不是类RNA的。为了确定99.89%的假设集合中的哪些对偶图更有可能与RNA结构相关联,我们应用基于持久谱图(PSG)方法的计算拓扑描述符,使用19个基于PSG的特征来表征每个图形,并使用聚类算法将所有可能的对偶图划分为两个聚类。已知RNA对偶图比例较高的聚类被定义为“类RNA”聚类,而另一个则被视为“非类RNA”聚类。每个对偶图与类RNA聚类中心之间的距离代表其属于RNA结构的可能性。通过验证,我们基于PSG的类RNA聚类包含121个已知RNA对偶图中的97.3%,表明性能良好。此外,预计46.017%的假设RNA是类RNA的。在被确定为新型RNA基序高可能性候选的前15个图形中,有4个从2022年收集的RNA数据集中得到了证实。值得注意的是,我们观察到所有前15个类RNA对偶图都可以分离为多个子图,而前15个非类RNA对偶图往往没有任何子图(子图保留假结和连接)。此外,在比较它们的拓扑特征(例如,贝蒂数0和贝蒂数1)时,顶级类RNA和非类RNA图形之间存在明显的拓扑差异。这些发现为RNA基序宇宙的大小和RNA设计策略提供了有价值的见解,为预测RNA图形拓扑和指导新型RNA基序的发现(可能通过子图组装开发抗病毒疗法)提供了一个新的框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/c17e115689b3/pcbi.1013230.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/dcbb72a473cd/pcbi.1013230.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/6cc0a16b2720/pcbi.1013230.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/3c73d86af820/pcbi.1013230.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/f26749005a5b/pcbi.1013230.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/95056221f4fc/pcbi.1013230.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/7ae589e35f00/pcbi.1013230.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/c17e115689b3/pcbi.1013230.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/dcbb72a473cd/pcbi.1013230.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/6cc0a16b2720/pcbi.1013230.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/3c73d86af820/pcbi.1013230.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/f26749005a5b/pcbi.1013230.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/95056221f4fc/pcbi.1013230.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/7ae589e35f00/pcbi.1013230.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ed6/12262848/c17e115689b3/pcbi.1013230.g007.jpg

相似文献

1
How large is the universe of RNA-like motifs? A clustering analysis of RNA graph motifs using topological descriptors.类RNA基序的范围有多大?使用拓扑描述符对RNA图形基序进行聚类分析。
PLoS Comput Biol. 2025 Jul 15;21(7):e1013230. doi: 10.1371/journal.pcbi.1013230. eCollection 2025 Jul.
2
How Large is the Universe of RNA-Like Motifs? A Clustering Analysis of RNA Graph Motifs Using Topological Descriptors.类RNA基序的范围有多大?使用拓扑描述符对RNA图基序进行聚类分析。
ArXiv. 2025 Jan 8:arXiv:2501.04258v1.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
6
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
7
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
8
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
9
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

引用本文的文献

1
The emerging roles of long non-coding RNAs in the nervous system.长链非编码RNA在神经系统中的新作用。
Nat Rev Neurosci. 2025 Sep 5. doi: 10.1038/s41583-025-00960-z.

本文引用的文献

1
Heterogeneous and multiple conformational transition pathways between pseudoknots of the SARS-CoV-2 frameshift element.严重急性呼吸综合征冠状病毒2型移码元件假结之间的异质性和多种构象转变途径。
Proc Natl Acad Sci U S A. 2025 Jan 28;122(4):e2417479122. doi: 10.1073/pnas.2417479122. Epub 2025 Jan 24.
2
Abolished frameshifting for predicted structure-stabilizing SARS-CoV-2 mutants: implications to alternative conformations and their statistical structural analyses.消除预测结构稳定的 SARS-CoV-2 突变体的框架移码:对替代构象及其统计结构分析的影响。
RNA. 2024 Oct 16;30(11):1437-1450. doi: 10.1261/rna.080035.124.
3
Analyzing Single Cell RNA Sequencing with Topological Nonnegative Matrix Factorization.
使用拓扑非负矩阵分解分析单细胞RNA测序
J Comput Appl Math. 2024 Aug 1;445. doi: 10.1016/j.cam.2024.115842. Epub 2024 Feb 19.
4
PLPCA: Persistent Laplacian-Enhanced PCA for Microarray Data Analysis.PLPCA:用于微阵列数据分析的持久拉普拉斯增强主成分分析。
J Chem Inf Model. 2024 Apr 8;64(7):2405-2420. doi: 10.1021/acs.jcim.3c01023. Epub 2023 Sep 22.
5
Evolution of coronavirus frameshifting elements: Competing stem networks explain conservation and variability.冠状病毒框架移位元件的进化:竞争的茎环网络解释了保守性和变异性。
Proc Natl Acad Sci U S A. 2023 May 16;120(20):e2221324120. doi: 10.1073/pnas.2221324120. Epub 2023 May 8.
6
Persistent Laplacian projected Omicron BA.4 and BA.5 to become new dominating variants.持续的拉普拉斯投影奥密克戎 BA.4 和 BA.5 成为新的优势变体。
Comput Biol Med. 2022 Dec;151(Pt A):106262. doi: 10.1016/j.compbiomed.2022.106262. Epub 2022 Nov 2.
7
Advances and opportunities in RNA structure experimental determination and computational modeling.RNA 结构实验测定和计算建模的进展和机遇。
Nat Methods. 2022 Oct;19(10):1193-1207. doi: 10.1038/s41592-022-01623-y. Epub 2022 Oct 6.
8
RNA-As-Graphs Motif Atlas-Dual Graph Library of RNA Modules and Viral Frameshifting-Element Applications.RNA 图谱基序双图库——RNA 模块和病毒移码元件应用
Int J Mol Sci. 2022 Aug 17;23(16):9249. doi: 10.3390/ijms23169249.
9
Length-dependent motions of SARS-CoV-2 frameshifting RNA pseudoknot and alternative conformations suggest avenues for frameshifting suppression.SARS-CoV-2 框架移位 RNA 假结和变构的长度依赖性运动提示了框架移位抑制的途径。
Nat Commun. 2022 Jul 25;13(1):4284. doi: 10.1038/s41467-022-31353-w.
10
Deep Learning in RNA Structure Studies.RNA结构研究中的深度学习
Front Mol Biosci. 2022 May 23;9:869601. doi: 10.3389/fmolb.2022.869601. eCollection 2022.