• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GraphClust:无比对的局部 RNA 二级结构的结构聚类。

GraphClust: alignment-free structural clustering of local RNA secondary structures.

机构信息

Bioinformatics Group, Department of Computer Science, University of Freiburg,Georges-Köhler-Allee 106, D-79110 Freiburg, Germany.

出版信息

Bioinformatics. 2012 Jun 15;28(12):i224-32. doi: 10.1093/bioinformatics/bts224.

DOI:10.1093/bioinformatics/bts224
PMID:22689765
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3371856/
Abstract

MOTIVATION

Clustering according to sequence-structure similarity has now become a generally accepted scheme for ncRNA annotation. Its application to complete genomic sequences as well as whole transcriptomes is therefore desirable but hindered by extremely high computational costs.

RESULTS

We present a novel linear-time, alignment-free method for comparing and clustering RNAs according to sequence and structure. The approach scales to datasets of hundreds of thousands of sequences. The quality of the retrieved clusters has been benchmarked against known ncRNA datasets and is comparable to state-of-the-art sequence-structure methods although achieving speedups of several orders of magnitude. A selection of applications aiming at the detection of novel structural ncRNAs are presented. Exemplarily, we predicted local structural elements specific to lincRNAs likely functionally associating involved transcripts to vital processes of the human nervous system. In total, we predicted 349 local structural RNA elements.

AVAILABILITY

The GraphClust pipeline is available on request.

摘要

动机

根据序列-结构相似性进行聚类现在已经成为 ncRNA 注释的一种普遍接受的方案。因此,将其应用于完整的基因组序列和整个转录组是可取的,但受到极高的计算成本的阻碍。

结果

我们提出了一种新颖的线性时间、无比对的方法,用于根据序列和结构比较和聚类 RNA。该方法可扩展到数十万条序列的数据集。所检索的聚类的质量已经针对已知的 ncRNA 数据集进行了基准测试,与最先进的序列-结构方法相当,尽管实现了几个数量级的加速。还提出了一系列旨在检测新型结构 ncRNA 的应用。例如,我们预测了 lincRNA 特有的局部结构元素,这些元素可能将涉及的转录本与人类神经系统的重要过程联系起来。总共预测了 349 个局部结构 RNA 元件。

可用性

GraphClust 管道可根据要求提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ce9/3371856/d269c5542389/bts224f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ce9/3371856/31fb2e29dab4/bts224f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ce9/3371856/d269c5542389/bts224f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ce9/3371856/31fb2e29dab4/bts224f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ce9/3371856/d269c5542389/bts224f2.jpg

相似文献

1
GraphClust: alignment-free structural clustering of local RNA secondary structures.GraphClust:无比对的局部 RNA 二级结构的结构聚类。
Bioinformatics. 2012 Jun 15;28(12):i224-32. doi: 10.1093/bioinformatics/bts224.
2
Fast and accurate clustering of noncoding RNAs using ensembles of sequence alignments and secondary structures.利用序列比对和二级结构的集合进行非编码 RNA 的快速准确聚类。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S48. doi: 10.1186/1471-2105-12-S1-S48.
3
A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。
Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.
4
Search for 5'-leader regulatory RNA structures based on gene annotation aided by the RiboGap database.借助RiboGap数据库,基于基因注释搜索5'-前导调控RNA结构。
Methods. 2017 Mar 15;117:3-13. doi: 10.1016/j.ymeth.2017.02.009. Epub 2017 Mar 6.
5
Structure-based whole-genome realignment reveals many novel noncoding RNAs.基于结构的全基因组重排揭示了许多新的非编码 RNA。
Genome Res. 2013 Jun;23(6):1018-27. doi: 10.1101/gr.137091.111. Epub 2013 Jan 7.
6
NoFold: RNA structure clustering without folding or alignment.NoFold:无需折叠或比对的RNA结构聚类
RNA. 2014 Nov;20(11):1671-83. doi: 10.1261/rna.041913.113. Epub 2014 Sep 18.
7
Finding consensus stable local optimal structures for aligned RNA sequences and its application to discovering riboswitch elements.寻找比对RNA序列的共识稳定局部最优结构及其在发现核糖开关元件中的应用。
Int J Bioinform Res Appl. 2014;10(4-5):498-518. doi: 10.1504/IJBRA.2014.062997.
8
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.基于预测的二级结构形成自由能变化检测非编码RNA。
BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173.
9
Multiple structural alignment and clustering of RNA sequences.RNA序列的多重结构比对与聚类
Bioinformatics. 2007 Apr 15;23(8):926-32. doi: 10.1093/bioinformatics/btm049. Epub 2007 Feb 25.
10
Chain-RNA: a comparative ncRNA search tool based on the two-dimensional chain algorithm.链 RNA:一种基于二维链算法的比较 ncRNA 搜索工具。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):274-85. doi: 10.1109/TCBB.2012.137.

引用本文的文献

1
Robust RNA secondary structure prediction with a mixture of deep learning and physics-based experts.结合深度学习和基于物理的专家方法进行稳健的RNA二级结构预测。
Biol Methods Protoc. 2025 Jan 6;10(1):bpae097. doi: 10.1093/biomethods/bpae097. eCollection 2025.
2
Clusters of mammalian conserved RNA structures in UTRs associate with RBP binding sites.非翻译区中哺乳动物保守RNA结构簇与RNA结合蛋白结合位点相关联。
NAR Genom Bioinform. 2024 Aug 9;6(3):lqae089. doi: 10.1093/nargab/lqae089. eCollection 2024 Sep.
3
Comparative RNA Genomics.比较 RNA 基因组学。

本文引用的文献

1
LocARNA-P: accurate boundary prediction and improved detection of structural RNAs.LocARNA-P:准确的边界预测和结构 RNA 的改进检测。
RNA. 2012 May;18(5):900-14. doi: 10.1261/rna.029041.111. Epub 2012 Mar 26.
2
Systematic identification of long noncoding RNAs expressed during zebrafish embryogenesis.系统鉴定斑马鱼胚胎发生过程中表达的长非编码 RNA。
Genome Res. 2012 Mar;22(3):577-91. doi: 10.1101/gr.133009.111. Epub 2011 Nov 22.
3
New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes.
Methods Mol Biol. 2024;2802:347-393. doi: 10.1007/978-1-0716-3838-5_12.
4
Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure prediction and RNA drug discovery.RNA 信息学的最新趋势:机器学习和深度学习在 RNA 二级结构预测和 RNA 药物发现中的应用综述。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad186.
5
RNAsmc: A integrated tool for comparing RNA secondary structure and evaluating allosteric effects.RNAsmc:一种用于比较RNA二级结构和评估变构效应的综合工具。
Comput Struct Biotechnol J. 2023 Jan 9;21:965-973. doi: 10.1016/j.csbj.2023.01.007. eCollection 2023.
6
Structure-based screening for functional non-coding RNAs in fission yeast identifies a factor repressing untimely initiation of sexual differentiation.基于结构的裂殖酵母功能非编码 RNA 筛选鉴定出一种抑制性分化过早起始的因子。
Nucleic Acids Res. 2022 Oct 28;50(19):11229-11242. doi: 10.1093/nar/gkac825.
7
Informative RNA base embedding for RNA structural alignment and clustering by deep representation learning.通过深度表示学习进行RNA结构比对和聚类的信息性RNA碱基嵌入
NAR Genom Bioinform. 2022 Feb 22;4(1):lqac012. doi: 10.1093/nargab/lqac012. eCollection 2022 Mar.
8
Prediction and analysis of functional RNA structures within the integrative genomics viewer.整合基因组浏览器内功能性RNA结构的预测与分析。
NAR Genom Bioinform. 2022 Jan 14;4(1):lqab127. doi: 10.1093/nargab/lqab127. eCollection 2022 Mar.
9
Deep forest ensemble learning for classification of alignments of non-coding RNA sequences based on multi-view structure representations.基于多视图结构表示的非编码 RNA 序列比对分类的深度森林集成学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa354.
10
A comprehensive survey of integron-associated genes present in metagenomes.对宏基因组中存在的整合子相关基因进行全面调查。
BMC Genomics. 2020 Jul 20;21(1):495. doi: 10.1186/s12864-020-06830-5.
通过比较分析脊椎动物基因组鉴定出的人类调控 RNA 结构的新家族。
Genome Res. 2011 Nov;21(11):1929-43. doi: 10.1101/gr.112516.110. Epub 2011 Oct 12.
4
Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses.整合注释人类大型长非编码 RNA 揭示了其全局特征和特定亚类。
Genes Dev. 2011 Sep 15;25(18):1915-27. doi: 10.1101/gad.17446611. Epub 2011 Sep 2.
5
The reality of pervasive transcription.普遍转录的现实。
PLoS Biol. 2011 Jul;9(7):e1000625; discussion e1001102. doi: 10.1371/journal.pbio.1000625. Epub 2011 Jul 12.
6
Fast and accurate clustering of noncoding RNAs using ensembles of sequence alignments and secondary structures.利用序列比对和二级结构的集合进行非编码 RNA 的快速准确聚类。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S48. doi: 10.1186/1471-2105-12-S1-S48.
7
Rfam: Wikipedia, clans and the "decimal" release.Rfam:维基百科、家族及“十进制”版本。
Nucleic Acids Res. 2011 Jan;39(Database issue):D141-5. doi: 10.1093/nar/gkq1129. Epub 2010 Nov 9.
8
Long noncoding RNA genes: conservation of sequence and brain expression among diverse amniotes.长非编码 RNA 基因:不同羊膜动物间序列和大脑表达的保守性。
Genome Biol. 2010;11(7):R72. doi: 10.1186/gb-2010-11-7-r72. Epub 2010 Jul 12.
9
Long non-coding RNAs in nervous system function and disease.长非编码 RNA 在神经系统功能和疾病中的作用。
Brain Res. 2010 Jun 18;1338:20-35. doi: 10.1016/j.brainres.2010.03.110. Epub 2010 Apr 7.
10
Comparative genomics reveals 104 candidate structured RNAs from bacteria, archaea, and their metagenomes.比较基因组学揭示了来自细菌、古菌及其宏基因组的 104 个候选结构 RNA。
Genome Biol. 2010;11(3):R31. doi: 10.1186/gb-2010-11-3-r31. Epub 2010 Mar 15.