Suppr超能文献

Repbase中重复元件的注释、提交和筛选:RepbaseSubmitter和Censor。

Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.

作者信息

Kohany Oleksiy, Gentles Andrew J, Hankus Lukasz, Jurka Jerzy

机构信息

Genetic Information Research Institute, 1925 Landings Drive, Mountain View, CA 94043, USA.

出版信息

BMC Bioinformatics. 2006 Oct 25;7:474. doi: 10.1186/1471-2105-7-474.

Abstract

BACKGROUND

Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases.

RESULTS

We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments.

CONCLUSION

Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at http://www.girinst.org/repbase/submission.html (RepbaseSubmitter) and http://www.girinst.org/censor/index.php (Censor).

摘要

背景

Repbase是一个真核生物重复DNA的参考数据库,其中包括重复序列的原型序列以及注释中描述的基本信息。该数据库的更新和维护需要专门的工具,我们已经创建了这些工具并使其可与Repbase一起使用,它们也可用作其他经过整理的数据库的模板。

结果

我们描述了软件工具RepbaseSubmitter和Censor,它们旨在促进Repbase内容的更新和筛选。RepbaseSubmitter是一个基于Java的界面,用于格式化和注释Repbase条目。它消除了许多常见的格式错误,并自动执行诸如计算序列长度和组成等操作,从而便于整理Repbase序列。此外,它具有几个用于预测序列中蛋白质编码区域的功能;在Repbase条目中搜索并包含PubMed参考文献;以及在NCBI分类数据库中搜索以正确包含物种信息和分类位置。Censor是一种通过与已知重复序列进行比较来快速识别重复元件的工具。它使用WU-BLAST以提高速度和灵敏度,并且可以对基因组序列进行DNA-DNA、DNA-蛋白质或翻译后的DNA-翻译后的DNA搜索。碎片化输出包括查询序列中存在的重复序列图谱,并可选择报告屏蔽后的查询序列、在查询中找到的重复序列以及比对结果。

结论

Censor和RepbaseSubmitter既可以作为基于网络的服务使用,也可以下载使用。可在http://www.girinst.org/repbase/submission.html(RepbaseSubmitter)和http://www.girinst.org/censor/index.php(Censor)找到它们。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4733/1634758/677ba872830a/1471-2105-7-474-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验