• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DotAligner:RNA 结构基序的识别和聚类。

DotAligner: identification and clustering of RNA structure motifs.

机构信息

RNA Biology and Plasticity Group, Garvan Institute of Medical Research, 384 Victoria Street, Sydney, NSW 2010, Australia.

St Vincent's Clinical School, Faculty of Medicine, UNSW Australia, Sydney, NSW 2010, Australia.

出版信息

Genome Biol. 2017 Dec 28;18(1):244. doi: 10.1186/s13059-017-1371-3.

DOI:10.1186/s13059-017-1371-3
PMID:29284541
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5747123/
Abstract

The diversity of processed transcripts in eukaryotic genomes poses a challenge for the classification of their biological functions. Sparse sequence conservation in non-coding sequences and the unreliable nature of RNA structure predictions further exacerbate this conundrum. Here, we describe a computational method, DotAligner, for the unsupervised discovery and classification of homologous RNA structure motifs from a set of sequences of interest. Our approach outperforms comparable algorithms at clustering known RNA structure families, both in speed and accuracy. It identifies clusters of known and novel structure motifs from ENCODE immunoprecipitation data for 44 RNA-binding proteins.

摘要

真核基因组中加工转录本的多样性给它们的生物功能分类带来了挑战。非编码序列中稀疏的序列保守性和 RNA 结构预测的不可靠性进一步加剧了这一难题。在这里,我们描述了一种计算方法 DotAligner,用于从一组感兴趣的序列中无监督地发现和分类同源 RNA 结构基序。我们的方法在聚类已知的 RNA 结构家族方面优于可比的算法,无论是在速度还是准确性方面。它从 44 种 RNA 结合蛋白的 ENCODE 免疫沉淀数据中识别出已知和新型结构基序的聚类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/cbe35245e220/13059_2017_1371_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/c49bc97c1a32/13059_2017_1371_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/bf8b3175fa2d/13059_2017_1371_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/9d6fbe564057/13059_2017_1371_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/4892e2f27608/13059_2017_1371_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/cbe35245e220/13059_2017_1371_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/c49bc97c1a32/13059_2017_1371_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/bf8b3175fa2d/13059_2017_1371_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/9d6fbe564057/13059_2017_1371_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/4892e2f27608/13059_2017_1371_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55c2/5747123/cbe35245e220/13059_2017_1371_Fig5_HTML.jpg

相似文献

1
DotAligner: identification and clustering of RNA structure motifs.DotAligner:RNA 结构基序的识别和聚类。
Genome Biol. 2017 Dec 28;18(1):244. doi: 10.1186/s13059-017-1371-3.
2
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data.单链隐马尔可夫模型:从高通量RNA结合蛋白数据中提取直观的序列结构基序
Nucleic Acids Res. 2017 Nov 2;45(19):11004-11018. doi: 10.1093/nar/gkx756.
3
De novo discovery of structured ncRNA motifs in genomic sequences.基因组序列中结构化非编码RNA基序的从头发现。
Methods Mol Biol. 2014;1097:303-18. doi: 10.1007/978-1-62703-709-9_15.
4
Promoter-based identification of novel non-coding RNAs reveals the presence of dicistronic snoRNA-miRNA genes in Arabidopsis thaliana.基于启动子的新型非编码RNA鉴定揭示了拟南芥中双顺反子snoRNA-miRNA基因的存在。
BMC Genomics. 2015 Nov 25;16:1009. doi: 10.1186/s12864-015-2221-x.
5
Predicting candidate genomic sequences that correspond to synthetic functional RNA motifs.预测与合成功能性RNA基序相对应的候选基因组序列。
Nucleic Acids Res. 2005 Oct 27;33(18):6057-69. doi: 10.1093/nar/gki911. Print 2005.
6
GraphClust2: Annotation and discovery of structured RNAs with scalable and accessible integrative clustering.GraphClust2:具有可扩展和可访问的集成聚类功能的结构化 RNA 的注释和发现。
Gigascience. 2019 Dec 1;8(12). doi: 10.1093/gigascience/giz150.
7
Dynamic programming algorithms for RNA structure prediction with binding sites.用于带有结合位点的RNA结构预测的动态规划算法。
Pac Symp Biocomput. 2010:98-107. doi: 10.1142/9789814295291_0012.
8
RNA secondary structure prediction from multi-aligned sequences.基于多序列比对的RNA二级结构预测。
Methods Mol Biol. 2015;1269:17-38. doi: 10.1007/978-1-4939-2291-8_2.
9
An Ariadne's thread to the identification and annotation of noncoding RNAs in eukaryotes.一条用于鉴定和注释真核生物中非编码RNA的线索。
Brief Bioinform. 2009 Sep;10(5):475-89. doi: 10.1093/bib/bbp022. Epub 2009 Apr 21.
10
PSSMTS: position specific scoring matrices on tree structures.PSSMTS:树形结构上的位置特异性评分矩阵。
J Math Biol. 2008 Jan;56(1-2):201-14. doi: 10.1007/s00285-007-0108-4. Epub 2007 Jul 7.

引用本文的文献

1
ECSFinder: optimized prediction of evolutionarily conserved RNA secondary structures from genome sequences.ECSFinder:从基因组序列中对进化保守RNA二级结构进行优化预测。
Nucleic Acids Res. 2025 Aug 11;53(15). doi: 10.1093/nar/gkaf780.
2
Clusters of mammalian conserved RNA structures in UTRs associate with RBP binding sites.非翻译区中哺乳动物保守RNA结构簇与RNA结合蛋白结合位点相关联。
NAR Genom Bioinform. 2024 Aug 9;6(3):lqae089. doi: 10.1093/nargab/lqae089. eCollection 2024 Sep.
3
Comparative RNA Genomics.比较 RNA 基因组学。

本文引用的文献

1
The identification and functional annotation of RNA structures conserved in vertebrates.脊椎动物保守 RNA 结构的鉴定和功能注释。
Genome Res. 2017 Aug;27(8):1371-1383. doi: 10.1101/gr.208652.116. Epub 2017 May 9.
2
RNAscClust: clustering RNA sequences using structure conservation and graph based motifs.RNAscClust:使用结构保守性和基于图的基元对 RNA 序列进行聚类。
Bioinformatics. 2017 Jul 15;33(14):2089-2096. doi: 10.1093/bioinformatics/btx114.
3
A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs.
Methods Mol Biol. 2024;2802:347-393. doi: 10.1007/978-1-0716-3838-5_12.
4
Long non-coding RNAs: definitions, functions, challenges and recommendations.长非编码 RNA:定义、功能、挑战与建议。
Nat Rev Mol Cell Biol. 2023 Jun;24(6):430-447. doi: 10.1038/s41580-022-00566-8. Epub 2023 Jan 3.
5
Structure-based screening for functional non-coding RNAs in fission yeast identifies a factor repressing untimely initiation of sexual differentiation.基于结构的裂殖酵母功能非编码 RNA 筛选鉴定出一种抑制性分化过早起始的因子。
Nucleic Acids Res. 2022 Oct 28;50(19):11229-11242. doi: 10.1093/nar/gkac825.
6
Deep forest ensemble learning for classification of alignments of non-coding RNA sequences based on multi-view structure representations.基于多视图结构表示的非编码 RNA 序列比对分类的深度森林集成学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa354.
7
A proposed reverse transcription mechanism for (CAG)n and similar expandable repeats that cause neurological and other diseases.一种针对导致神经和其他疾病的(CAG)n及类似可扩展重复序列的逆转录机制假说。
Heliyon. 2020 Feb 26;6(2):e03258. doi: 10.1016/j.heliyon.2020.e03258. eCollection 2020 Feb.
8
RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment.RNAmountAlign:用于局部、全局、半全局两两和多 RNA 序列/结构比对的高效软件。
PLoS One. 2020 Jan 24;15(1):e0227177. doi: 10.1371/journal.pone.0227177. eCollection 2020.
9
A systematic review of the application of machine learning in the detection and classification of transposable elements.机器学习在转座元件检测与分类中的应用的系统综述。
PeerJ. 2019 Dec 18;7:e8311. doi: 10.7717/peerj.8311. eCollection 2019.
10
Multiple Sequence Alignments Enhance Boundary Definition of RNA Structures.多序列比对增强RNA结构的边界定义。
Genes (Basel). 2018 Dec 4;9(12):604. doi: 10.3390/genes9120604.
一项针对保守RNA结构的统计测试表明,缺乏lncRNA中存在结构的证据。
Nat Methods. 2017 Jan;14(1):45-48. doi: 10.1038/nmeth.4066. Epub 2016 Nov 7.
4
Long non-coding RNAs: spatial amplifiers that control nuclear structure and gene expression.长非编码 RNA:控制核结构和基因表达的空间放大器。
Nat Rev Mol Cell Biol. 2016 Dec;17(12):756-770. doi: 10.1038/nrm.2016.126. Epub 2016 Oct 26.
5
RNA Duplex Map in Living Cells Reveals Higher-Order Transcriptome Structure.活细胞中的RNA双链体图谱揭示了高阶转录组结构。
Cell. 2016 May 19;165(5):1267-1279. doi: 10.1016/j.cell.2016.04.028. Epub 2016 May 12.
6
Robust transcriptome-wide discovery of RNA-binding protein binding sites with enhanced CLIP (eCLIP).通过增强型交联免疫沉淀(eCLIP)在全转录组范围内稳健地发现RNA结合蛋白结合位点。
Nat Methods. 2016 Jun;13(6):508-14. doi: 10.1038/nmeth.3810. Epub 2016 Mar 28.
7
Foldalign 2.5: multithreaded implementation for pairwise structural RNA alignment.Foldalign 2.5:用于成对结构RNA比对的多线程实现。
Bioinformatics. 2016 Apr 15;32(8):1238-40. doi: 10.1093/bioinformatics/btv748. Epub 2015 Dec 24.
8
The ins and outs of lncRNA structure: How, why and what comes next?长链非编码RNA结构的来龙去脉:方式、原因及后续发展?
Biochim Biophys Acta. 2016 Jan;1859(1):46-58. doi: 10.1016/j.bbagrm.2015.08.009. Epub 2015 Aug 29.
9
Architectural RNAs (arcRNAs): A class of long noncoding RNAs that function as the scaffold of nuclear bodies.结构RNA(arcRNA):一类作为核体支架发挥作用的长链非编码RNA。
Biochim Biophys Acta. 2016 Jan;1859(1):139-46. doi: 10.1016/j.bbagrm.2015.05.007. Epub 2015 Jun 3.
10
Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species.通过对17个物种转录组的直接比较得出的长链非编码RNA进化原理。
Cell Rep. 2015 May 19;11(7):1110-22. doi: 10.1016/j.celrep.2015.04.023. Epub 2015 May 7.