Suppr超能文献

FREP:小鼠cDNA中功能重复序列的数据库。

FREP: a database of functional repeats in mouse cDNAs.

作者信息

Nagashima Takeshi, Matsuda Hideo, Silva Diego G, Petrovsky Nikolai, Konagaya Akihiko, Schönbach Christian, Kasukawa Takeya, Arakawa Takahiro, Carninci Piero, Kawai Jun, Hayashizaki Yoshihide

机构信息

Biomedical Knowledge Discovery Team, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan.

出版信息

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D471-5. doi: 10.1093/nar/gkh123.

Abstract

The FREP database (http://facts.gsc.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16,527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of polyadenylation sites in 3'UTR-located repeats, (iii) effect on translation, (iv) position in the protein- coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5%) inferred FREPs derived from 6861 (41.5%) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.

相似文献

1
FREP: a database of functional repeats in mouse cDNAs.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D471-5. doi: 10.1093/nar/gkh123.
5
HUGE: a database for human KIAA proteins, a 2004 update integrating HUGEppi and ROUGE.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D502-4. doi: 10.1093/nar/gkh035.
7
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.
BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.
8
The frequency and position of Alu repeats in cDNAs, as determined by database searching.
Genomics. 1995 Jun 10;27(3):544-8. doi: 10.1006/geno.1995.1090.
9
COPASAAR--a database for proteomic analysis of single amino acid repeats.
BMC Bioinformatics. 2005 Aug 3;6:196. doi: 10.1186/1471-2105-6-196.
10
Database for chicken full-length cDNAs.
Physiol Genomics. 2007 Jan 17;28(2):141-5. doi: 10.1152/physiolgenomics.00097.2006. Epub 2006 Nov 14.

引用本文的文献

1
The Air noncoding RNA: an imprinted cis-silencing transcript.
Cold Spring Harb Symp Quant Biol. 2004;69:55-66. doi: 10.1101/sqb.2004.69.55.
2
Computational method for discovery of estrogen responsive genes.
Nucleic Acids Res. 2004 Dec 1;32(21):6212-7. doi: 10.1093/nar/gkh943. Print 2004.

本文引用的文献

1
Multiple sequence alignment with the Clustal series of programs.
Nucleic Acids Res. 2003 Jul 1;31(13):3497-500. doi: 10.1093/nar/gkg500.
2
Inferring higher functional information for RIKEN mouse full-length cDNA clones with FACTS.
Genome Res. 2003 Jun;13(6B):1520-33. doi: 10.1101/gr.1019903.
4
ATLAS: a system to selectively identify human-specific L1 insertions.
Am J Hum Genet. 2003 Apr;72(4):823-38. doi: 10.1086/373939. Epub 2003 Mar 11.
5
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.
Nucleic Acids Res. 2003 Jan 1;31(1):365-70. doi: 10.1093/nar/gkg095.
6
The InterPro Database, 2003 brings increased coverage and new features.
Nucleic Acids Res. 2003 Jan 1;31(1):315-8. doi: 10.1093/nar/gkg046.
9
Initial sequencing and comparative analysis of the mouse genome.
Nature. 2002 Dec 5;420(6915):520-62. doi: 10.1038/nature01262.
10
GeneCards 2002: towards a complete, object-oriented, human gene compendium.
Bioinformatics. 2002 Nov;18(11):1542-3. doi: 10.1093/bioinformatics/18.11.1542.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验