• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用拓扑指纹对RNA结构进行准确分类

Accurate Classification of RNA Structures Using Topological Fingerprints.

作者信息

Huang Jiajie, Li Kejie, Gribskov Michael

机构信息

Department of Biological Sciences, Purdue University, West Lafayette, Indiana, United States of America.

Life Sciences Solutions Group, Thermo Fisher Scientific, South San Francisco, California, United States of America.

出版信息

PLoS One. 2016 Oct 18;11(10):e0164726. doi: 10.1371/journal.pone.0164726. eCollection 2016.

DOI:10.1371/journal.pone.0164726
PMID:27755571
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5068708/
Abstract

While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity-an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC > 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures. Source code is freely available at https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint.

摘要

虽然众所周知RNA具有复杂的结构,但功能相似的RNA往往在序列上相似度很低。尽管碱基配对区域的确切大小和间距各不相同,但功能相似的RNA在碱基配对茎的排列或拓扑结构上具有明显的相似性。此外,预测的RNA结构通常缺乏假结(生物活性的关键方面),并且只是部分正确或不完整。一种拓扑方法解决了所有这些难题。在这项工作中,我们将每个RNA结构描述为一个可以转换为拓扑谱(RNA指纹)的图。RNA结构中的子图集合,即其RNA指纹,可以与其他RNA结构的指纹进行比较,以识别功能相关的RNA并对其进行正确分类。即使省略高达30%的茎,也能识别出拓扑相似的RNA,这表明不需要非常精确的结构。我们在一组八个精心挑选的RNA家族上研究了RNA指纹方法的性能,这些家族大小和功能各异,包含假结,且序列相似度很低——这是一个特别具有挑战性的测试集。尽管测试集难度很大,但RNA指纹方法非常成功(ROC曲线下面积>0.95)。由于包含了假结,RNA指纹方法比仅基于二级结构的方法涵盖了更广泛的可能结构,并且其对不完整结构的耐受性表明它甚至可以应用于预测结构。源代码可在https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/ee817d7983a0/pone.0164726.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/45721910c93b/pone.0164726.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/5bc6ede74f30/pone.0164726.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/f9e13a18f6fc/pone.0164726.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/60d60f6645ba/pone.0164726.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/27259e96a0f4/pone.0164726.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/ee817d7983a0/pone.0164726.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/45721910c93b/pone.0164726.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/5bc6ede74f30/pone.0164726.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/f9e13a18f6fc/pone.0164726.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/60d60f6645ba/pone.0164726.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/27259e96a0f4/pone.0164726.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29ed/5068708/ee817d7983a0/pone.0164726.g006.jpg

相似文献

1
Accurate Classification of RNA Structures Using Topological Fingerprints.使用拓扑指纹对RNA结构进行准确分类
PLoS One. 2016 Oct 18;11(10):e0164726. doi: 10.1371/journal.pone.0164726. eCollection 2016.
2
Topology and prediction of RNA pseudoknots.RNA 假结的拓扑结构和预测。
Bioinformatics. 2011 Apr 15;27(8):1076-85. doi: 10.1093/bioinformatics/btr090. Epub 2011 Feb 17.
3
A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences.一种用于预测未比对序列中包括假结在内的常见RNA二级结构基序的图论方法。
Bioinformatics. 2004 Jul 10;20(10):1591-602. doi: 10.1093/bioinformatics/bth131. Epub 2004 Feb 12.
4
Topological classification of RNA structures.RNA结构的拓扑分类
J Mol Biol. 2008 Jun 13;379(4):900-11. doi: 10.1016/j.jmb.2008.04.033. Epub 2008 Apr 18.
5
ILM: a web server for predicting RNA secondary structures with pseudoknots.ILM:一个用于预测带假结的RNA二级结构的网络服务器。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W146-9. doi: 10.1093/nar/gkh444.
6
Exploring the repertoire of RNA secondary motifs using graph theory; implications for RNA design.利用图论探索RNA二级基序库;对RNA设计的启示。
Nucleic Acids Res. 2003 Jun 1;31(11):2926-43. doi: 10.1093/nar/gkg365.
7
Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics.基于热力学的实用假结折叠算法的设计、实现与评估。
BMC Bioinformatics. 2004 Aug 4;5:104. doi: 10.1186/1471-2105-5-104.
8
Predicting common foldings of homologous RNAs.预测同源RNA的常见折叠方式。
J Biomol Struct Dyn. 1991 Apr;8(5):1027-44. doi: 10.1080/07391102.1991.10507863.
9
Conformational features of topologically classified RNA secondary structures.拓扑分类 RNA 二级结构的构象特征。
PLoS One. 2012;7(7):e39907. doi: 10.1371/journal.pone.0039907. Epub 2012 Jul 5.
10
Candidates for novel RNA topologies.新型RNA拓扑结构的候选者。
J Mol Biol. 2004 Aug 27;341(5):1129-44. doi: 10.1016/j.jmb.2004.06.054.

引用本文的文献

1
Analysis of lncRNAs in (Tarwi) and Their Potential Role in Drought Response.(塔维)中lncRNAs的分析及其在干旱响应中的潜在作用。
Noncoding RNA. 2023 Aug 23;9(5):48. doi: 10.3390/ncrna9050048.
2
Arm-less mitochondrial tRNAs conserved for over 30 millions of years in spiders.蜘蛛中超过 3000 万年保守的无臂线粒体 tRNA。
BMC Genomics. 2019 Aug 23;20(1):665. doi: 10.1186/s12864-019-6026-1.
3
IRESpy: an XGBoost model for prediction of internal ribosome entry sites.IRESpy:一种用于预测内部核糖体进入位点的 XGBoost 模型。

本文引用的文献

1
Rfam 12.0: updates to the RNA families database.Rfam 12.0:RNA家族数据库的更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7. doi: 10.1093/nar/gku1063. Epub 2014 Nov 11.
2
The RNA shapes studio.RNA形状工作室
Bioinformatics. 2015 Feb 1;31(3):423-5. doi: 10.1093/bioinformatics/btu649. Epub 2014 Oct 1.
3
Enhancement of accuracy and efficiency for RNA secondary structure prediction by sequence segmentation and MapReduce.通过序列分割和MapReduce提高RNA二级结构预测的准确性和效率。
BMC Bioinformatics. 2019 Jul 30;20(1):409. doi: 10.1186/s12859-019-2999-7.
4
Dual Graph Partitioning Highlights a Small Group of Pseudoknot-Containing RNA Submotifs.双图划分突出了一小部分含有假结的RNA亚基序。
Genes (Basel). 2018 Jul 25;9(8):371. doi: 10.3390/genes9080371.
BMC Struct Biol. 2013;13 Suppl 1(Suppl 1):S3. doi: 10.1186/1472-6807-13-S1-S3. Epub 2013 Nov 8.
4
Evolutionary conservation of long non-coding RNAs; sequence, structure, function.长链非编码RNA的进化保守性;序列、结构与功能。
Biochim Biophys Acta. 2014 Mar;1840(3):1063-71. doi: 10.1016/j.bbagen.2013.10.035. Epub 2013 Oct 27.
5
Secondary structure and domain architecture of the 23S and 5S rRNAs.23S 和 5S rRNA 的二级结构和结构域结构。
Nucleic Acids Res. 2013 Aug;41(15):7522-35. doi: 10.1093/nar/gkt513. Epub 2013 Jun 14.
6
Nuclear group I introns in self-splicing and beyond.核 I 组内含子的自我剪接及其应用。
Mob DNA. 2013 Jun 5;4(1):17. doi: 10.1186/1759-8753-4-17.
7
On topological indices for small RNA graphs.关于小 RNA 图的拓扑指标。
Comput Biol Chem. 2012 Dec;41:35-40. doi: 10.1016/j.compbiolchem.2012.10.004. Epub 2012 Oct 23.
8
RNA structure prediction: an overview of methods.RNA结构预测:方法概述
Methods Mol Biol. 2012;905:99-122. doi: 10.1007/978-1-61779-949-5_8.
9
GraphClust: alignment-free structural clustering of local RNA secondary structures.GraphClust:无比对的局部 RNA 二级结构的结构聚类。
Bioinformatics. 2012 Jun 15;28(12):i224-32. doi: 10.1093/bioinformatics/bts224.
10
Identifying complete RNA structural ensembles including pseudoknots.鉴定包括假结在内的完整 RNA 结构整体。
RNA Biol. 2012 Feb;9(2):187-99. doi: 10.4161/rna.18386. Epub 2012 Feb 1.