Dagan Tal, Sorek Rotem, Sharon Eilon, Ast Gil, Graur Dan
Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D489-92. doi: 10.1093/nar/gkh132.
Alu elements are short interspersed elements (SINEs) approximately 300 nucleotides in length. More than 1 million Alus are found in the human genome. Despite their being genetically functionless, recent findings suggest that Alu elements may have a broad evolutionary impact by affecting gene structures, protein sequences, splicing motifs and expression patterns. Because of these effects, compiling a genomic database of Alu sequences that reside within protein-coding genes seemed a useful enterprise. Presently, such data are limited since the structural and positional information on genes and Alu sequences are scattered throughout incompatible and unconnected databases. AluGene (http://Alugene.tau.ac.il/) provides easy access to a complete Alu map of the human genome, as well as Alu-associated information. The Alu elements are annotated with respect to coding region and exon/intron location. This design facilitates queries on Alu sequences, locations, as well as motifs and compositional properties via a one-stop search page.
Alu元件是长度约为300个核苷酸的短散在元件(SINEs)。在人类基因组中发现了超过100万个Alu元件。尽管它们在基因上没有功能,但最近的研究结果表明,Alu元件可能通过影响基因结构、蛋白质序列、剪接基序和表达模式而具有广泛的进化影响。由于这些影响,编制一个位于蛋白质编码基因内的Alu序列基因组数据库似乎是一项有益的工作。目前,此类数据有限,因为基因和Alu序列的结构和位置信息分散在不兼容且不相关的数据库中。AluGene(http://Alugene.tau.ac.il/)提供了对人类基因组完整Alu图谱以及Alu相关信息的便捷访问。Alu元件根据编码区和外显子/内含子位置进行注释。这种设计便于通过一站式搜索页面查询Alu序列、位置以及基序和组成特性。