利用宿主基因的基因组特性对秀丽隐杆线虫 box H/ACA snoRNAs 进行计算预测。

Computational prediction of Caenorhabditis box H/ACA snoRNAs using genomic properties of their host genes.

机构信息

Department of Ecology and Evolution , University of Chicago, Chicago, Illinois 60637, USA.

出版信息

RNA. 2010 Feb;16(2):290-8. doi: 10.1261/rna.1876210. Epub 2009 Dec 28.

DOI:10.1261/rna.1876210

PMID:20038629

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2811658/

Abstract

Identification of small nucleolar RNAs (snoRNAs) in genomic sequences has been challenging due to the relative paucity of sequence features. Many current prediction algorithms rely on detection of snoRNA motifs complementary to target sites in snRNAs and rRNAs. However, recent discovery of snoRNAs without apparent targets requires development of alternative prediction methods. We present an approach that combines rule-based filters and a Bayesian Classifier to identify a class of snoRNAs (H/ACA) without requiring target sequence information. It takes advantage of unique attributes of their genomic organization and improved species-specific motif characterization to predict snoRNAs that may otherwise be difficult to discover. Searches in the genomes of Caenorhabditis elegans and the closely related Caenorhabditis briggsae suggest that our method performs well compared to recent benchmark algorithms. Our results illustrate the benefits of training gene discovery engines on features restricted to particular phylogenetic groups and the utility of incorporating diverse data types in gene prediction.

摘要

由于序列特征相对较少，因此在基因组序列中识别小核仁 RNA（snoRNAs）具有挑战性。许多当前的预测算法依赖于检测与 snRNA 和 rRNA 中的靶位点互补的 snoRNA 基序。然而，最近发现没有明显靶标的 snoRNAs 需要开发替代的预测方法。我们提出了一种结合基于规则的过滤器和贝叶斯分类器的方法来识别一类 snoRNAs（H/ACA），而无需目标序列信息。它利用了它们基因组组织的独特属性和改进的物种特异性基序特征来预测可能难以发现的 snoRNAs。在秀丽隐杆线虫和密切相关的秀丽新杆线虫的基因组中进行搜索表明，与最近的基准算法相比，我们的方法表现良好。我们的结果说明了在特定进化枝上的特征上训练基因发现引擎的好处，以及在基因预测中纳入不同数据类型的实用性。

相似文献

Computational prediction of Caenorhabditis box H/ACA snoRNAs using genomic properties of their host genes.

RNA. 2010 Feb;16(2):290-8. doi: 10.1261/rna.1876210. Epub 2009 Dec 28.

Evolution of small nucleolar RNAs in nematodes.

Nucleic Acids Res. 2006 May 19;34(9):2676-85. doi: 10.1093/nar/gkl359. Print 2006.

Isolation of eight novel Caenorhabditis elegans small RNAs.

Gene. 2004 Jun 23;335:47-56. doi: 10.1016/j.gene.2004.03.004.

A combined computational and experimental analysis of two families of snoRNA genes from Caenorhabditis elegans, revealing the expression and evolution pattern of snoRNAs in nematodes.

Genomics. 2007 Apr;89(4):490-501. doi: 10.1016/j.ygeno.2006.12.002. Epub 2007 Jan 11.

Prediction of structured non-coding RNAs in the genomes of the nematodes Caenorhabditis elegans and Caenorhabditis briggsae.

J Exp Zool B Mol Dev Evol. 2006 Jul 15;306(4):379-92. doi: 10.1002/jez.b.21086.

SnoReport 2.0: new features and a refined Support Vector Machine to improve snoRNA identification.

BMC Bioinformatics. 2016 Dec 15;17(Suppl 18):464. doi: 10.1186/s12859-016-1345-6.

Genome-wide analyses of two families of snoRNA genes from Drosophila melanogaster, demonstrating the extensive utilization of introns for coding of snoRNAs.

RNA. 2005 Aug;11(8):1303-16. doi: 10.1261/rna.2380905. Epub 2005 Jun 29.

Computational prediction and validation of C/D, H/ACA and Eh_U3 snoRNAs of Entamoeba histolytica.

BMC Genomics. 2012 Aug 14;13:390. doi: 10.1186/1471-2164-13-390.

SnoRNAs from the filamentous fungus Neurospora crassa: structural, functional and evolutionary insights.

BMC Genomics. 2009 Nov 8;10:515. doi: 10.1186/1471-2164-10-515.

SnoReport: computational identification of snoRNAs with unknown targets.

Bioinformatics. 2008 Jan 15;24(2):158-64. doi: 10.1093/bioinformatics/btm464. Epub 2007 Sep 25.

引用本文的文献

Identification and characterization of new structured RNA classes in plants.

RNA Biol. 2025 Dec;22(1):1-16. doi: 10.1080/15476286.2025.2523696. Epub 2025 Jun 30.

Sequencing of individual barcoded cDNAs using Pacific Biosciences and Oxford Nanopore Technologies reveals platform-specific error patterns.

Genome Res. 2022 Apr;32(4):726-737. doi: 10.1101/gr.276405.121. Epub 2022 Mar 17.

H/ACA Small Ribonucleoproteins: Structural and Functional Comparison Between Archaea and Eukaryotes.

Front Microbiol. 2021 Mar 11;12:654370. doi: 10.3389/fmicb.2021.654370. eCollection 2021.

The nucleolus of Caenorhabditis elegans.

J Biomed Biotechnol. 2012;2012:601274. doi: 10.1155/2012/601274. Epub 2012 Apr 19.

Family size and turnover rates among several classes of small non-protein-coding RNA genes in Caenorhabditis nematodes.

Genome Biol Evol. 2012;4(4):565-74. doi: 10.1093/gbe/evs034. Epub 2012 Mar 30.

本文引用的文献

Small RNAs derived from snoRNAs.

RNA. 2009 Jul;15(7):1233-40. doi: 10.1261/rna.1528909. Epub 2009 May 27.

snoRNA, a novel precursor of microRNA in Giardia lamblia.

PLoS Pathog. 2008 Nov;4(11):e1000224. doi: 10.1371/journal.ppat.1000224. Epub 2008 Nov 28.

Excess of microRNAs in large and very 5' biased introns.

Biochem Biophys Res Commun. 2008 Apr 11;368(3):709-15. doi: 10.1016/j.bbrc.2008.01.117. Epub 2008 Feb 4.

snoTARGET shows that human orphan snoRNA targets locate close to alternative splice junctions.

Gene. 2008 Jan 31;408(1-2):172-9. doi: 10.1016/j.gene.2007.10.037. Epub 2007 Nov 21.

SnoReport: computational identification of snoRNAs with unknown targets.

Bioinformatics. 2008 Jan 15;24(2):158-64. doi: 10.1093/bioinformatics/btm464. Epub 2007 Sep 25.

Mapping the C. elegans noncoding transcriptome with a whole-genome tiling microarray.

Genome Res. 2007 Oct;17(10):1471-7. doi: 10.1101/gr.6611807. Epub 2007 Sep 4.

Comparison of C. elegans and C. briggsae genome sequences reveals extensive conservation of chromosome organization and synteny.

PLoS Biol. 2007 Jul;5(7):e167. doi: 10.1371/journal.pbio.0050167. Epub 2007 Jul 3.

Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.

Appl Environ Microbiol. 2007 Aug;73(16):5261-7. doi: 10.1128/AEM.00062-07. Epub 2007 Jun 22.

Non-coding RNAs: lessons from the small nuclear and small nucleolar RNAs.

Nat Rev Mol Cell Biol. 2007 Mar;8(3):209-20. doi: 10.1038/nrm2124.

A combined computational and experimental analysis of two families of snoRNA genes from Caenorhabditis elegans, revealing the expression and evolution pattern of snoRNAs in nematodes.

Genomics. 2007 Apr;89(4):490-501. doi: 10.1016/j.ygeno.2006.12.002. Epub 2007 Jan 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用宿主基因的基因组特性对秀丽隐杆线虫 box H/ACA snoRNAs 进行计算预测。

Computational prediction of Caenorhabditis box H/ACA snoRNAs using genomic properties of their host genes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献