Suppr超能文献

全基因组范围内对具有生物学功能的假基因的调查。

Genome-wide survey for biologically functional pseudogenes.

作者信息

Svensson Orjan, Arvestad Lars, Lagergren Jens

机构信息

Stockholm Bioinformatics Centre, Royal Institute of Technology, Albanova University Center, Stockholm, Sweden.

出版信息

PLoS Comput Biol. 2006 May;2(5):e46. doi: 10.1371/journal.pcbi.0020046. Epub 2006 May 5.

Abstract

According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human-mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human-mouse species split, and also a larger group of primate-specific ones found from human-chimpanzee searches. Two processed sequences are notable, their conservation since the human-mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3), and one from the Spinocerebellar ataxia type 1 protein (ATX1). Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios.

摘要

根据目前的估计,哺乳动物基因组中大约存在20000个假基因。其中绝大多数是蛋白质编码基因的失活和无功能副本,因此以中性方式进化。最近有研究发现,位于小鼠5号染色体上的Makorin1假基因在体内确实至关重要,并且在进化上也得以保留,这促使我们对人类、小鼠和黑猩猩的其他功能性假基因进行全基因组调查。据我们所知,我们鉴定出了人类和小鼠共有的保守假基因的首个实例,这些假基因起源于人类与小鼠物种分化之前的一次复制事件,自物种分化后一直作为假基因进化。功能是解释此类假基因对明显矛盾特性(即高度保守和古老起源)的一种可能方式。通过将候选基因的表达证据和共线性与适当的测试集进行比较,对功能假说进行了检验。这些测试表明了潜在的生物学功能。我们的候选基因集包括一小部分寿命较长的假基因,其未知的潜在功能自人类与小鼠物种分化之前就得以保留,还包括从人类与黑猩猩搜索中发现的一大类灵长类特异性假基因。有两个加工序列值得注意,它们自人类与小鼠分化以来的保守程度与大多数蛋白质编码基因一样高;一个源自Ataxin 7样3蛋白(ATX7NL3),另一个源自脊髓小脑共济失调1型蛋白(ATX1)。我们的方法是比较性的,可应用于任何一对物种。它通过基于跨物种BLAST比较和最大似然系统发育估计的半自动流程来实现。为了将假基因与蛋白质编码基因区分开来,我们使用标准方法,利用框内失活以及基于Ka/Ks比率的概率过滤器。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验