四膜虫非编码RNA基因和非编码DNA的全基因组进化分析。

Genome-wide evolutionary analysis of the noncoding RNA genes and noncoding DNA of Paramecium tetraurelia.

作者信息

Chen Chun-Long, Zhou Hui, Liao Jian-You, Qu Liang-Hu, Amar Laurence

机构信息

Institut de Biologie Animale Intégrative et Cellulaire, Université Paris Sud, Orsay, France

出版信息

RNA. 2009 Apr;15(4):503-14. doi: 10.1261/rna.1306009. Epub 2009 Feb 13.

DOI:10.1261/rna.1306009

PMID:19218550

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2661823/

Abstract

The compact genome of the unicellular eukaryote Paramecium tetraurelia contains noncoding DNA (ncDNA) distributed into >39,000 intergenic sequences and >90,000 introns of 390 base pairs (bp) and 25 bp on average, respectively. Here we analyzed the molecular features of the ncRNA genes, introns, and intergenic sequences of this genome. We mainly used computational programs and comparative genomics possible because the P. tetraurelia genome had formed throughout whole-genome duplications (WGDs). We characterized 417 5S rRNA, snRNA, snoRNA, SRP RNA, and tRNA putative genes, 415 of which map within intergenic sequences, and two, within introns. The evolution of these ncRNA genes appears to have mainly involved purifying selection and gene deletion. We then compared the introns that interrupt the protein-coding gene duplicates arisen from the recent WGD and identified a population of a few thousands of introns having evolved under most stringent constraints (>95% of identity). We also showed that low nucleotide substitution levels characterize the 50 and 80-115 base pairs flanking, respectively, the stop and start codons of the protein-coding genes. Lower substitution levels mark the base pairs flanking the highly transcribed genes, or the start codons of the genes of the sets with a high number of WGD-related sequences. Finally, adjacent to protein-coding genes, we characterized 32 DNA motifs able to encode stable and evolutionary conserved RNA secondary structures and defining putative expression controlling elements. Fourteen DNA motifs with similar properties map distant from protein-coding genes and may encode regulatory ncRNAs.

摘要

单细胞真核生物四膜虫的紧凑基因组包含非编码DNA（ncDNA），其分布在超过39,000个基因间序列和超过90,000个内含子中，平均长度分别为390个碱基对（bp）和25 bp。在此，我们分析了该基因组中非编码RNA基因、内含子和基因间序列的分子特征。由于四膜虫基因组是在全基因组复制（WGD）过程中形成的，我们主要使用了计算程序和比较基因组学方法。我们鉴定了417个5S rRNA、snRNA、snoRNA、SRP RNA和tRNA的假定基因，其中415个位于基因间序列中，2个位于内含子中。这些非编码RNA基因的进化似乎主要涉及纯化选择和基因删除。然后，我们比较了打断近期WGD产生的蛋白质编码基因重复序列的内含子，并鉴定出数千个在最严格限制条件下进化的内含子群体（同一性>95%）。我们还表明，蛋白质编码基因的终止密码子和起始密码子两侧分别为50和80 - 115个碱基对的区域具有低核苷酸替换水平。较低的替换水平标记了高转录基因两侧的碱基对，或具有大量与WGD相关序列的基因集的起始密码子两侧的碱基对。最后，在蛋白质编码基因附近，我们鉴定了32个能够编码稳定且进化保守的RNA二级结构并定义假定表达控制元件的DNA基序。14个具有相似性质的DNA基序位于远离蛋白质编码基因的位置，可能编码调控性非编码RNA。

相似文献

Genome-wide evolutionary analysis of the noncoding RNA genes and noncoding DNA of Paramecium tetraurelia.四膜虫非编码RNA基因和非编码DNA的全基因组进化分析。

RNA. 2009 Apr;15(4):503-14. doi: 10.1261/rna.1306009. Epub 2009 Feb 13.

Whole-genome duplications contributed to the expansion of cytochrome b5 genes in Paramecium tetraurelia.全基因组复制促成了四膜虫细胞色素b5基因的扩增。

Genet Mol Res. 2013 Jun 13;12(2):1882-96. doi: 10.4238/2013.January.9.1.

The actin multigene family of Paramecium tetraurelia.四膜虫的肌动蛋白多基因家族。

BMC Genomics. 2007 Mar 28;8:82. doi: 10.1186/1471-2164-8-82.

Massive colonization of protein-coding exons by selfish genetic elements in Paramecium germline genomes.草履虫种系基因组中自私遗传元件对蛋白质编码外显子的大规模定殖。

PLoS Biol. 2021 Jul 29;19(7):e3001309. doi: 10.1371/journal.pbio.3001309. eCollection 2021 Jul.

Gene expression in a paleopolyploid: a transcriptome resource for the ciliate Paramecium tetraurelia.古多倍体中的基因表达：纤毛虫四膜虫的转录组资源。

BMC Genomics. 2010 Oct 8;11:547. doi: 10.1186/1471-2164-11-547.

Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia.通过纤毛虫四膜虫揭示的全基因组复制的全球趋势。

Nature. 2006 Nov 9;444(7116):171-8. doi: 10.1038/nature05230. Epub 2006 Nov 1.

Extremely short 20-33 nucleotide introns are the standard length in Paramecium tetraurelia.极短的20 - 33个核苷酸的内含子是四膜虫的标准长度。

Nucleic Acids Res. 1994 Apr 11;22(7):1221-5. doi: 10.1093/nar/22.7.1221.

Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva.泰勒虫中非编码DNA的特性及潜在顺式调控元件的鉴定

BMC Genomics. 2008 Dec 3;9:582. doi: 10.1186/1471-2164-9-582.

High coding density on the largest Paramecium tetraurelia somatic chromosome.在最大的四膜虫体细胞染色体上具有高编码密度。

Curr Biol. 2004 Aug 10;14(15):1397-404. doi: 10.1016/j.cub.2004.07.029.

The splicing of tiny introns of Paramecium is controlled by MAGO.草履虫微小内含子的拼接由 MAGO 控制。

Gene. 2018 Jul 15;663:101-109. doi: 10.1016/j.gene.2018.04.007. Epub 2018 Apr 10.

引用本文的文献

Contrasting outcomes of genome reduction in mikrocytids and microsporidians.微细胞生物和微孢子生物中基因组缩减的对比结果。

BMC Biol. 2023 Jun 6;21(1):137. doi: 10.1186/s12915-023-01635-w.

Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression.草履虫基因组学的改进方法与资源：转录单元、基因注释与基因表达

BMC Genomics. 2017 Jun 26;18(1):483. doi: 10.1186/s12864-017-3887-z.

"Hypothesis for the modern RNA world": a pervasive non-coding RNA-based genetic regulation is a prerequisite for the emergence of multicellular complexity.“现代 RNA 世界假说”：普遍存在的基于非编码 RNA 的遗传调控是多细胞复杂性出现的前提。

Orig Life Evol Biosph. 2011 Dec;41(6):587-607. doi: 10.1007/s11084-011-9262-1. Epub 2012 Feb 10.

Experimental identification and analysis of macronuclear non-coding RNAs from the ciliate Tetrahymena thermophila.实验鉴定和分析纤毛虫嗜热四膜虫的巨核非编码 RNA。

Nucleic Acids Res. 2012 Feb;40(3):1267-81. doi: 10.1093/nar/gkr792. Epub 2011 Oct 3.

Exploiting Oxytricha trifallax nanochromosomes to screen for non-coding RNA genes.利用尾草履虫纳米染色体筛选非编码 RNA 基因。

Nucleic Acids Res. 2011 Sep 1;39(17):7529-47. doi: 10.1093/nar/gkr501. Epub 2011 Jun 28.

Plant noncoding RNA gene discovery by "single-genome comparative genomics".通过“单基因组比较基因组学”发现植物非编码 RNA 基因。

RNA. 2011 Mar;17(3):390-400. doi: 10.1261/rna.2426511. Epub 2011 Jan 10.

A comparative genome-wide study of ncRNAs in trypanosomatids.原核生物与真核生物中 ncRNA 的比较基因组研究

BMC Genomics. 2010 Nov 4;11:615. doi: 10.1186/1471-2164-11-615.

ParameciumDB in 2011: new tools and new data for functional and comparative genomics of the model ciliate Paramecium tetraurelia.2011年的草履虫数据库：用于模式纤毛虫四膜虫功能和比较基因组学的新工具与新数据。

Nucleic Acids Res. 2011 Jan;39(Database issue):D632-6. doi: 10.1093/nar/gkq918. Epub 2010 Oct 14.

本文引用的文献

snoRNA, a novel precursor of microRNA in Giardia lamblia.小核仁RNA，贾第虫中一种新型的微小RNA前体。

PLoS Pathog. 2008 Nov;4(11):e1000224. doi: 10.1371/journal.ppat.1000224. Epub 2008 Nov 28.

A human snoRNA with microRNA-like functions.一种具有类似微小RNA功能的人类小核仁RNA。

Mol Cell. 2008 Nov 21;32(4):519-28. doi: 10.1016/j.molcel.2008.10.017.

Maternal noncoding transcripts antagonize the targeting of DNA elimination by scanRNAs in Paramecium tetraurelia.母体非编码转录本拮抗四膜虫中扫描RNA对DNA消除的靶向作用。

Genes Dev. 2008 Jun 1;22(11):1501-12. doi: 10.1101/gad.473008.

Prader-Willi phenotype caused by paternal deficiency for the HBII-85 C/D box small nucleolar RNA cluster.由父源HBII-85 C/D盒小核仁RNA簇缺陷引起的普拉德-威利表型。

Nat Genet. 2008 Jun;40(6):719-21. doi: 10.1038/ng.158. Epub 2008 May 25.

Genomewide analysis of box C/D and box H/ACA snoRNAs in Chlamydomonas reinhardtii reveals an extensive organization into intronic gene clusters.莱茵衣藻中C/D盒和H/ACA盒小核仁RNA的全基因组分析揭示了其内含子基因簇的广泛组织形式。

Genetics. 2008 May;179(1):21-30. doi: 10.1534/genetics.107.086025.

The transcriptional landscape of the yeast genome defined by RNA sequencing.通过RNA测序定义的酵母基因组转录图谱。

Science. 2008 Jun 6;320(5881):1344-9. doi: 10.1126/science.1158441. Epub 2008 May 1.

Transcriptional regulation of human small nuclear RNA genes.人类小核RNA基因的转录调控

Biochim Biophys Acta. 2008 May;1779(5):295-305. doi: 10.1016/j.bbagrm.2008.04.001. Epub 2008 Apr 8.

Computational screen for spliceosomal RNA genes aids in defining the phylogenetic distribution of major and minor spliceosomal components.剪接体RNA基因的计算筛选有助于确定主要和次要剪接体成分的系统发育分布。

Nucleic Acids Res. 2008 May;36(9):3001-10. doi: 10.1093/nar/gkn142. Epub 2008 Apr 4.

Translational control of intron splicing in eukaryotes.真核生物中内含子剪接的翻译控制

Nature. 2008 Jan 17;451(7176):359-62. doi: 10.1038/nature06495.

Patterns of selective constraints in noncoding DNA of rice.水稻非编码DNA中的选择性限制模式

BMC Evol Biol. 2007 Nov 1;7:208. doi: 10.1186/1471-2148-7-208.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验