基因组SEGE：真核生物基因组中“无内含子”基因的数据库。

Genome SEGE: a database for 'intronless' genes in eukaryotic genomes.

作者信息

Sakharkar Meena Kishore, Kangueane Pandjassarame

机构信息

Nanyang Centre for Supercomputing and Visualization, School of Mechanical and Production Engineering, Nanyang Technological University, Singapore 639798.

出版信息

BMC Bioinformatics. 2004 Jun 2;5:67. doi: 10.1186/1471-2105-5-67.

DOI:10.1186/1471-2105-5-67

PMID:15175116

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC434494/

Abstract

BACKGROUND

A number of completely sequenced eukaryotic genome data are available in the public domain. Eukaryotic genes are either 'intron containing' or 'intronless'. Eukaryotic 'intronless' genes are interesting datasets for comparative genomics and evolutionary studies. The SEGE database containing a collection of eukaryotic single exon genes is available. However, SEGE is derived using GenBank. The redundant, incomplete and heterogeneous qualities of GenBank data are a bottleneck for biological investigation in comparative genomics and evolutionary studies. Such studies often require representative gene sets from each genome and this is possible only by deriving specific datasets from completely sequenced genome data. Thus Genome SEGE, a database for 'intronless' genes in completely sequenced eukaryotic genomes, has been constructed.

AVAILABILITY

http://sege.ntu.edu.sg/wester/intronless

DESCRIPTION

Eukaryotic 'intronless' genes are extracted from nine completely sequenced genomes (four of which are unicellular and five of which are multi-cellular). The complete dataset is available for download. Data subsets are also available for 'intronless' pseudo-genes. The database provides information on the distribution of 'intronless' genes in different genomes together with their length distributions in each genome. Additionally, the search tool provides pre-computed PROSITE motifs for each sequence in the database with appropriate hyperlinks to InterPro. A search facility is also available through the web server.

CONCLUSIONS

The unique features that distinguish Genome SEGE from SEGE is the service providing representative 'intronless' datasets for completely sequenced genomes. 'Intronless' gene sets available in this database will be of use for subsequent bio-computational analysis in comparative genomics and evolutionary studies. Such analysis may help to revisit the original genome data for re-examination and re-annotation.

摘要

背景

公共领域中有许多已完成全序列测定的真核生物基因组数据。真核基因可分为“含内含子”或“无内含子”两类。真核生物的“无内含子”基因是比较基因组学和进化研究中有趣的数据集。目前有一个包含真核生物单外显子基因集合的SEGE数据库。然而，SEGE是基于GenBank构建的。GenBank数据的冗余、不完整和异质性是比较基因组学和进化研究中生物学调查的一个瓶颈。此类研究通常需要每个基因组的代表性基因集，而这只有通过从全序列测定的基因组数据中获取特定数据集才能实现。因此，构建了Genome SEGE，一个用于存储已完成全序列测定的真核生物基因组中“无内含子”基因的数据库。

可用性

http://sege.ntu.edu.sg/wester/intronless

描述

真核生物的“无内含子”基因是从九个已完成全序列测定的基因组中提取的（其中四个是单细胞基因组，五个是多细胞基因组）。完整数据集可供下载。还提供了“无内含子”假基因的数据子集。该数据库提供了不同基因组中“无内含子”基因的分布信息以及它们在每个基因组中的长度分布。此外，搜索工具为数据库中的每个序列提供了预先计算的PROSITE基序，并带有指向InterPro的适当超链接。也可通过网络服务器进行搜索。

结论

Genome SEGE与SEGE的独特区别在于，它为已完成全序列测定的基因组提供代表性的“无内含子”数据集服务。该数据库中可用的“无内含子”基因集将用于比较基因组学和进化研究中的后续生物计算分析。此类分析可能有助于重新审视原始基因组数据以进行重新检查和重新注释。

相似文献

Genome SEGE: a database for 'intronless' genes in eukaryotic genomes.

BMC Bioinformatics. 2004 Jun 2;5:67. doi: 10.1186/1471-2105-5-67.

u-Genome: a database on genome design in unicellular genomes.

In Silico Biol. 2005;5(5-6):611-5.

IGD: a resource for intronless genes in the human genome.

Gene. 2011 Nov 15;488(1-2):35-40. doi: 10.1016/j.gene.2011.08.013. Epub 2011 Sep 2.

SEGE: A database on 'intron less/single exonic' genes from eukaryotes.

Bioinformatics. 2002 Sep;18(9):1266-7. doi: 10.1093/bioinformatics/18.9.1266.

Xpro: database of eukaryotic protein-encoding genes.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D59-63. doi: 10.1093/nar/gkh051.

Comparative analysis of human intronless proteins.

Biochem Biophys Res Commun. 2005 Jun 3;331(2):512-9. doi: 10.1016/j.bbrc.2005.03.209.

MitoRes: a resource of nuclear-encoded mitochondrial genes and their products in Metazoa.

BMC Bioinformatics. 2006 Jan 24;7:36. doi: 10.1186/1471-2105-7-36.

Functional and evolutionary analyses on expressed intronless genes in the mouse genome.

FEBS Lett. 2006 Feb 20;580(5):1472-8. doi: 10.1016/j.febslet.2006.01.070. Epub 2006 Jan 31.

Comparative analysis of intronless genes in teleost fish genomes: insights into their evolution and molecular function.

Mar Genomics. 2011 Jun;4(2):109-19. doi: 10.1016/j.margen.2011.03.004. Epub 2011 Apr 7.

RIGD: A Database for Intronless Genes in the Rosaceae.

Front Genet. 2020 Aug 7;11:868. doi: 10.3389/fgene.2020.00868. eCollection 2020.

引用本文的文献

Mutational Bias and Natural Selection Driving the Synonymous Codon Usage of Single-Exon Genes in Rice (Oryza sativa L.).

Rice (N Y). 2023 Feb 27;16(1):11. doi: 10.1186/s12284-023-00627-2.

From Anna University to America and to Agriculture.

Bioinformation. 2021 Jan 31;17(1):29-36. doi: 10.6026/97320630017029. eCollection 2021.

Genome-wide analysis of European sea bass provides insights into the evolution and functions of single-exon genes.

Ecol Evol. 2021 Apr 2;11(11):6546-6557. doi: 10.1002/ece3.7507. eCollection 2021 Jun.

RIGD: A Database for Intronless Genes in the Rosaceae.

Front Genet. 2020 Aug 7;11:868. doi: 10.3389/fgene.2020.00868. eCollection 2020.

Translocation of promoter-conserved hatching enzyme genes with intron-loss provides a new insight in the role of retrocopy during teleostean evolution.

Sci Rep. 2019 Feb 21;9(1):2448. doi: 10.1038/s41598-019-38693-6.

IGDD: a database of intronless genes in dicots.

BMC Bioinformatics. 2016 Jul 27;17:289. doi: 10.1186/s12859-016-1148-9.

SinEx DB: a database for single exon coding sequences in mammalian genomes.

Database (Oxford). 2016 Jun 7;2016. doi: 10.1093/database/baw095. Print 2016.

The consensus 5' splice site motif inhibits mRNA nuclear export.

PLoS One. 2015 Mar 31;10(3):e0122743. doi: 10.1371/journal.pone.0122743. eCollection 2015.

Mammalian introns: when the junk generates molecular diversity.

Int J Mol Sci. 2015 Feb 20;16(3):4429-52. doi: 10.3390/ijms16034429.

PIGD: a database for intronless genes in the Poaceae.

BMC Genomics. 2014 Oct 1;15(1):832. doi: 10.1186/1471-2164-15-832.

本文引用的文献

The InterPro Database, 2003 brings increased coverage and new features.

Nucleic Acids Res. 2003 Jan 1;31(1):315-8. doi: 10.1093/nar/gkg046.

SEGE: A database on 'intron less/single exonic' genes from eukaryotes.

Bioinformatics. 2002 Sep;18(9):1266-7. doi: 10.1093/bioinformatics/18.9.1266.

Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22.

Genome Res. 2002 Feb;12(2):272-80. doi: 10.1101/gr.207102.

The PROSITE database, its status in 2002.

Nucleic Acids Res. 2002 Jan 1;30(1):235-8. doi: 10.1093/nar/30.1.235.

The sequence of the human genome.

Science. 2001 Feb 16;291(5507):1304-51. doi: 10.1126/science.1058040.

Targeted gene deletion in Zygosaccharomyces bailii.

Yeast. 2001 Jan 30;18(2):173-86. doi: 10.1002/1097-0061(20010130)18:2<173::AID-YEA663>3.0.CO;2-F.

Genomes were forged by massive bombardments with retroelements and retrosequences.

Genetica. 1999;107(1-3):209-38.

Many G-protein-coupled receptors are encoded by retrogenes.

Trends Genet. 1999 Aug;15(8):304-5. doi: 10.1016/s0168-9525(99)01783-7.

Why are human G-protein-coupled receptors predominantly intronless?

Trends Genet. 1999 Feb;15(2):47-9. doi: 10.1016/s0168-9525(98)01648-5.

Histone genes: not so simple after all.

Cell. 1984 Oct;38(3):624-6. doi: 10.1016/0092-8674(84)90256-3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基因组SEGE：真核生物基因组中“无内含子”基因的数据库。

Genome SEGE: a database for 'intronless' genes in eukaryotic genomes.

作者信息

机构信息

出版信息

BACKGROUND

AVAILABILITY

DESCRIPTION

CONCLUSIONS

背景

可用性

描述

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献