脊椎动物谱系中的非编码RNA直系同源物。

ncRNA orthologies in the vertebrate lineage.

作者信息

Pignatelli Miguel, Vilella Albert J, Muffato Matthieu, Gordon Leo, White Simon, Flicek Paul, Herrero Javier

机构信息

European Molecular Biology Laboratory, European Bioinformatics Institute

European Molecular Biology Laboratory, European Bioinformatics Institute.

出版信息

Database (Oxford). 2016 Mar 15;2016. doi: 10.1093/database/bav127. Print 2016.

DOI:10.1093/database/bav127

PMID:26980512

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4792531/

Abstract

Annotation of orthologous and paralogous genes is necessary for many aspects of evolutionary analysis. Methods to infer these homology relationships have traditionally focused on protein-coding genes and evolutionary models used by these methods normally assume the positions in the protein evolve independently. However, as our appreciation for the roles of non-coding RNA genes has increased, consistently annotated sets of orthologous and paralogous ncRNA genes are increasingly needed. At the same time, methods such as PHASE or RAxML have implemented substitution models that consider pairs of sites to enable proper modelling of the loops and other features of RNA secondary structure. Here, we present a comprehensive analysis pipeline for the automatic detection of orthologues and paralogues for ncRNA genes. We focus on gene families represented in Rfam and for which a specific covariance model is provided. For each family ncRNA genes found in all Ensembl species are aligned using Infernal, and several trees are built using different substitution models. In parallel, a genomic alignment that includes the ncRNA genes and their flanking sequence regions is built with PRANK. This alignment is used to create two additional phylogenetic trees using the neighbour-joining (NJ) and maximum-likelihood (ML) methods. The trees arising from both the ncRNA and genomic alignments are merged using TreeBeST, which reconciles them with the species tree in order to identify speciation and duplication events. The final tree is used to infer the orthologues and paralogues following Fitch's definition. We also determine gene gain and loss events for each family using CAFE. All data are accessible through the Ensembl Comparative Genomics ('Compara') API, on our FTP site and are fully integrated in the Ensembl genome browser, where they can be accessed in a user-friendly manner. Database URL: http://www.ensembl.org.

摘要

直系同源基因和旁系同源基因的注释对于进化分析的许多方面都很有必要。传统上，推断这些同源关系的方法主要集中在蛋白质编码基因上，并且这些方法所使用的进化模型通常假定蛋白质中的位置是独立进化的。然而，随着我们对非编码RNA基因作用的认识不断增加，对经过一致注释的直系同源和旁系同源非编码RNA基因集的需求也日益增长。与此同时，诸如PHASE或RAxML等方法已经实现了考虑位点对的替代模型，以便对RNA二级结构的环和其他特征进行适当建模。在这里，我们提出了一个用于自动检测非编码RNA基因直系同源物和旁系同源物的综合分析流程。我们专注于Rfam中所代表的基因家族，并且为其提供了特定的协方差模型。对于在所有Ensembl物种中发现的每个家族的非编码RNA基因，使用Infernal进行比对，并使用不同的替代模型构建多棵树。同时，使用PRANK构建一个包含非编码RNA基因及其侧翼序列区域的基因组比对。该比对用于使用邻接法（NJ）和最大似然法（ML）创建另外两棵系统发育树。使用TreeBeST合并来自非编码RNA和基因组比对的树，将它们与物种树进行协调，以识别物种形成和复制事件。最终的树用于根据菲奇的定义推断直系同源物和旁系同源物。我们还使用CAFE确定每个家族的基因获得和丢失事件。所有数据都可以通过Ensembl比较基因组学（“Compara”）应用程序编程接口、我们的FTP站点访问，并且完全集成在Ensembl基因组浏览器中，在那里可以以用户友好的方式进行访问。数据库网址：http://www.ensembl.org。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdf2/4792531/a2a9176e8460/bav127f1p.jpg

相似文献

ncRNA orthologies in the vertebrate lineage.

Database (Oxford). 2016 Mar 15;2016. doi: 10.1093/database/bav127. Print 2016.

Ensembl comparative genomics resources.

Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/bav096. Print 2016.

OGS2: genome re-annotation of the jewel wasp Nasonia vitripennis.

BMC Genomics. 2016 Aug 25;17(1):678. doi: 10.1186/s12864-016-2886-9.

Evolution of vertebrate genes related to prion and Shadoo proteins--clues from comparative genomic analysis.

Mol Biol Evol. 2004 Dec;21(12):2210-31. doi: 10.1093/molbev/msh245. Epub 2004 Sep 1.

Ensembl 2021.

Nucleic Acids Res. 2021 Jan 8;49(D1):D884-D891. doi: 10.1093/nar/gkaa942.

Ensembl Genomes 2020-enabling non-vertebrate genomic research.

Nucleic Acids Res. 2020 Jan 8;48(D1):D689-D695. doi: 10.1093/nar/gkz890.

Non-Coding RNA Analysis Using the Rfam Database.

Curr Protoc Bioinformatics. 2018 Jun;62(1):e51. doi: 10.1002/cpbi.51. Epub 2018 Jun 5.

Chain-RNA: a comparative ncRNA search tool based on the two-dimensional chain algorithm.

IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):274-85. doi: 10.1109/TCBB.2012.137.

Two rounds of whole genome duplication in the ancestral vertebrate.

PLoS Biol. 2005 Oct;3(10):e314. doi: 10.1371/journal.pbio.0030314. Epub 2005 Sep 6.

Plant noncoding RNA gene discovery by "single-genome comparative genomics".

RNA. 2011 Mar;17(3):390-400. doi: 10.1261/rna.2426511. Epub 2011 Jan 10.

引用本文的文献

Critical analysis of descriptive microRNA data in the translational research on cardioprotection and cardiac repair: lost in the complexity of bioinformatics.

Basic Res Cardiol. 2025 Apr 9. doi: 10.1007/s00395-025-01104-1.

miRNAs in Heart Development and Disease.

Int J Mol Sci. 2024 Jan 30;25(3):1673. doi: 10.3390/ijms25031673.

G-quadruplex occurrence and conservation: more than just a question of guanine-cytosine content.

NAR Genom Bioinform. 2022 Mar 4;4(1):lqac010. doi: 10.1093/nargab/lqac010. eCollection 2022 Mar.

catRAPID omics v2.0: going deeper and wider in the prediction of protein-RNA interactions.

Nucleic Acids Res. 2021 Jul 2;49(W1):W72-W79. doi: 10.1093/nar/gkab393.

AASRA: an anchor alignment-based small RNA annotation pipeline†.

Biol Reprod. 2021 Jul 2;105(1):267-277. doi: 10.1093/biolre/ioab062.

RNAcentral 2021: secondary structure integration, improved sequence search and new member databases.

Nucleic Acids Res. 2021 Jan 8;49(D1):D212-D220. doi: 10.1093/nar/gkaa921.

H/ACA snoRNA levels are regulated during stem cell differentiation.

Nucleic Acids Res. 2020 Sep 4;48(15):8686-8703. doi: 10.1093/nar/gkaa612.

GraphClust2: Annotation and discovery of structured RNAs with scalable and accessible integrative clustering.

Gigascience. 2019 Dec 1;8(12). doi: 10.1093/gigascience/giz150.

Integrated analysis of lncRNA, miRNA and mRNA expression profiling in patients with systemic lupus erythematosus.

Arch Med Sci. 2019 Jul;15(4):872-879. doi: 10.5114/aoms.2018.79145. Epub 2018 Oct 19.

Controlling metastatic cancer: the role of phytochemicals in cell signaling.

J Cancer Res Clin Oncol. 2019 May;145(5):1087-1109. doi: 10.1007/s00432-019-02892-5. Epub 2019 Mar 22.

本文引用的文献

Ensembl comparative genomics resources.

Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/bav096. Print 2016.

Rfam 12.0: updates to the RNA families database.

Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7. doi: 10.1093/nar/gku1063. Epub 2014 Nov 11.

miRBase: annotating high confidence microRNAs using deep sequencing data.

Nucleic Acids Res. 2014 Jan;42(Database issue):D68-73. doi: 10.1093/nar/gkt1181. Epub 2013 Nov 25.

PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome.

Nucleic Acids Res. 2014 Jan;42(Database issue):D897-902. doi: 10.1093/nar/gkt1177. Epub 2013 Nov 25.

TreeFam v9: a new website, more species and orthology-on-the-fly.

Nucleic Acids Res. 2014 Jan;42(Database issue):D922-5. doi: 10.1093/nar/gkt1055. Epub 2013 Nov 4.

Infernal 1.1: 100-fold faster RNA homology searches.

Bioinformatics. 2013 Nov 15;29(22):2933-5. doi: 10.1093/bioinformatics/btt509. Epub 2013 Sep 4.

PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees.

Nucleic Acids Res. 2013 Jan;41(Database issue):D377-86. doi: 10.1093/nar/gks1118. Epub 2012 Nov 27.

Human microRNAs originated from two periods at accelerated rates in mammalian evolution.

Mol Biol Evol. 2013 Mar;30(3):613-26. doi: 10.1093/molbev/mss262. Epub 2012 Nov 20.

Rfam 11.0: 10 years of RNA families.

Nucleic Acids Res. 2013 Jan;41(Database issue):D226-32. doi: 10.1093/nar/gks1005. Epub 2012 Nov 3.

Birth and expression evolution of mammalian microRNA genes.

Genome Res. 2013 Jan;23(1):34-45. doi: 10.1101/gr.140269.112. Epub 2012 Oct 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

脊椎动物谱系中的非编码RNA直系同源物。

ncRNA orthologies in the vertebrate lineage.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献