• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Key2Ann:一种通过用人类可读注释替换数据库标识符来处理序列集的工具。

Key2Ann: a tool to process sequence sets by replacing database identifiers with a human-readable annotation.

作者信息

Pürzer Andreas, Grassmann Felix, Birzer Dietmar, Merkl Rainer

机构信息

University of Applied Sciences, Department of Computer Science and Mathematics, 93025 Regensburg, Germany.

出版信息

J Integr Bioinform. 2011 Mar 4;8(1):539. doi: 10.2390/biecoll-jib-2011-153.

DOI:10.2390/biecoll-jib-2011-153
PMID:21372341
Abstract

Deducing common properties or degrees of phylogenetic relationship by analyzing a grouping or clustering of sequence sets is a frequently used technique in computational biology. If interpreted by means of visual inspection, the conclusions depend for many of these applications on meaningful names for the input data. In accordance with the aim of the analysis, the sequences should be provided with names indicating the function of the genes or gene-products, the phylogenetic position or other properties characterizing the contributing species. However, sequences extracted from databases are most often annotated with identifiers which only implicitly contain the desired information. To solve this problem, we have designed and implemented a tool named Key2Ann, which replaces in multiple fasta files the database keys with short terms indicating the taxonomic position or other features like the gene name or the EC-number. In addition, properties like habitat, growth temperature or the degree of pathogenicity can be coded for microbial species. To allow for highest flexibility, the user can control the composition of the names by means of command line parameters. Key2Ann is written in Java and can be downloaded via http://www-bioinf.uni-regensburg.de/downl/Key2Ann.zip. We demonstrate the usage of Key2Ann by discussing three typical examples of phylogenetic analysis.

摘要

通过分析序列集的分组或聚类来推断系统发育关系的共同属性或程度,是计算生物学中常用的技术。如果通过目视检查来解释,对于许多此类应用而言,结论取决于输入数据是否有有意义的名称。根据分析目的,序列应被赋予能够表明基因或基因产物功能、系统发育位置或表征相关物种的其他属性的名称。然而,从数据库中提取的序列通常用标识符进行注释,这些标识符仅隐含地包含所需信息。为了解决这个问题,我们设计并实现了一个名为Key2Ann的工具,它在多个fasta文件中用表示分类位置或其他特征(如基因名称或酶委员会编号)的简短术语替换数据库键。此外,对于微生物物种,可以编码诸如栖息地、生长温度或致病程度等属性。为了实现最高的灵活性,用户可以通过命令行参数控制名称的组成。Key2Ann用Java编写,可以通过http://www-bioinf.uni-regensburg.de/downl/Key2Ann.zip下载。我们通过讨论系统发育分析的三个典型例子来演示Key2Ann的用法。

相似文献

1
Key2Ann: a tool to process sequence sets by replacing database identifiers with a human-readable annotation.Key2Ann:一种通过用人类可读注释替换数据库标识符来处理序列集的工具。
J Integr Bioinform. 2011 Mar 4;8(1):539. doi: 10.2390/biecoll-jib-2011-153.
2
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
3
PhyloGena--a user-friendly system for automated phylogenetic annotation of unknown sequences.PhyloGena——一个用于对未知序列进行自动系统发育注释的用户友好型系统。
Bioinformatics. 2007 Apr 1;23(7):793-801. doi: 10.1093/bioinformatics/btm016. Epub 2007 Mar 1.
4
PROMPT: a protein mapping and comparison tool.提示:一种蛋白质图谱绘制与比较工具。
BMC Bioinformatics. 2006 Jul 4;7:331. doi: 10.1186/1471-2105-7-331.
5
JUICE: a data management system that facilitates the analysis of large volumes of information in an EST project workflow.JUICE:一个数据管理系统,可在EST项目工作流程中促进对大量信息的分析。
BMC Bioinformatics. 2006 Nov 23;7:513. doi: 10.1186/1471-2105-7-513.
6
A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm.一个使用黑马算法识别出的古菌和细菌基因组中系统发育非典型基因的数据库。
BMC Bioinformatics. 2008 Oct 7;9:419. doi: 10.1186/1471-2105-9-419.
7
Taxonomic colouring of phylogenetic trees of protein sequences.蛋白质序列系统发育树的分类着色。
BMC Bioinformatics. 2006 Feb 17;7:79. doi: 10.1186/1471-2105-7-79.
8
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
9
Omics data management and annotation.组学数据管理与注释
Methods Mol Biol. 2011;719:71-96. doi: 10.1007/978-1-61779-027-0_3.
10
PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood.PhyloPat:包含基因邻域的系统发育模式数据库的更新版本。
Nucleic Acids Res. 2009 Jan;37(Database issue):D731-7. doi: 10.1093/nar/gkn645. Epub 2008 Oct 2.

引用本文的文献

1
Long-Term Persistence of Bi-functionality Contributes to the Robustness of Microbial Life through Exaptation.双功能性的长期持续存在通过扩展适应促进了微生物生命的稳健性。
PLoS Genet. 2016 Jan 29;12(1):e1005836. doi: 10.1371/journal.pgen.1005836. eCollection 2016 Jan.