用于DNA/蛋白质序列分析、基因功能注释和蛋白质分类的生物信息学工具。

Bioinformatic tools for DNA/protein sequence analysis, functional assignment of genes and protein classification.

作者信息

Rehm B H

机构信息

Institut für Mikrobiologie der Westfalischen Wilhelms-Universität Münster, Germany.

出版信息

Appl Microbiol Biotechnol. 2001 Dec;57(5-6):579-92. doi: 10.1007/s00253-001-0844-0.

DOI:10.1007/s00253-001-0844-0

PMID:11778865

Abstract

The development of efficient DNA sequencing methods has led to the achievement of the DNA sequence of entire genomes from (to date) 55 prokaryotes, 5 eukaryotic organisms and 10 eukaryotic chromosomes. Thus, an enormous amount of DNA sequence data is available and even more will be forthcoming in the near future. Analysis of this overwhelming amount of data requires bioinformatic tools in order to identify genes that encode functional proteins or RNA. This is an important task, considering that even in the well-studied Escherichia coli more than 30% of the identified open reading frames are hypothetical genes. Future challenges of genome sequence analysis will include the understanding of gene regulation and metabolic pathway reconstruction including DNA chip technology, which holds tremendous potential for biomedicine and the biotechnological production of valuable compounds. The overwhelming volume of information often confuses scientists. This review intends to provide a guide to choosing the most efficient way to analyze a new sequence or to collect information on a gene or protein of interest by applying current publicly available databases and Web services. Recently developed tools that allow functional assignment of genes, mainly based on sequence similarity of the deduced amino acid sequence, using the currently available and increasing biological databases will be discussed.

摘要

高效DNA测序方法的发展已使得（截至目前）从55种原核生物、5种真核生物和10条真核染色体中获取了整个基因组的DNA序列。因此，现在已有大量的DNA序列数据，而且在不久的将来还会有更多数据出现。分析如此海量的数据需要生物信息学工具，以便识别编码功能性蛋白质或RNA的基因。鉴于即使在研究充分的大肠杆菌中，超过30%已识别的开放阅读框都是假设基因，这是一项重要任务。基因组序列分析未来面临的挑战将包括对基因调控的理解以及代谢途径重建，其中包括DNA芯片技术，该技术在生物医学和有价值化合物的生物技术生产方面具有巨大潜力。海量的信息常常让科学家们感到困惑。本综述旨在提供一份指南，介绍如何通过应用当前公开可用的数据库和网络服务，选择最有效的方法来分析新序列或收集有关感兴趣的基因或蛋白质的信息。将讨论最近开发的主要基于推导氨基酸序列的序列相似性、利用当前可用且不断增加的生物数据库对基因进行功能分配的工具。

相似文献

Bioinformatic tools for DNA/protein sequence analysis, functional assignment of genes and protein classification.

Appl Microbiol Biotechnol. 2001 Dec;57(5-6):579-92. doi: 10.1007/s00253-001-0844-0.

VISTA family of computational tools for comparative analysis of DNA sequences and whole genomes.

Methods Mol Biol. 2006;338:69-89. doi: 10.1385/1-59745-097-9:69.

Functional and structural genomics using PEDANT.

Bioinformatics. 2001 Jan;17(1):44-57. doi: 10.1093/bioinformatics/17.1.44.

MitoRes: a resource of nuclear-encoded mitochondrial genes and their products in Metazoa.

BMC Bioinformatics. 2006 Jan 24;7:36. doi: 10.1186/1471-2105-7-36.

Inferring function from homology.

Methods Mol Biol. 2008;453:149-68. doi: 10.1007/978-1-60327-429-6_6.

Statistical significance in biological sequence analysis.

Brief Bioinform. 2006 Mar;7(1):2-24. doi: 10.1093/bib/bbk001.

CBS Genome Atlas Database: a dynamic storage for bioinformatic results and sequence data.

Bioinformatics. 2004 Dec 12;20(18):3682-6. doi: 10.1093/bioinformatics/bth423. Epub 2004 Jul 15.

Exact distribution for the local score of one i.i.d. random sequence.

J Comput Biol. 2001;8(4):373-80. doi: 10.1089/106652701752236197.

Protein families and TRIBES in genome sequence space.

Nucleic Acids Res. 2003 Aug 1;31(15):4632-8. doi: 10.1093/nar/gkg495.

引用本文的文献

Deep Neural Network Framework Based on Word Embedding for Protein Glutarylation Sites Prediction.

Life (Basel). 2022 Aug 10;12(8):1213. doi: 10.3390/life12081213.

Mechanism of drug resistance in bacteria: efflux pump modulation for designing of new antibiotic enhancers.

Folia Microbiol (Praha). 2021 Oct;66(5):727-739. doi: 10.1007/s12223-021-00910-z. Epub 2021 Aug 25.

Rewiring of glycerol metabolism in Escherichia coli for effective production of recombinant proteins.

Biotechnol Biofuels. 2020 Dec 14;13(1):205. doi: 10.1186/s13068-020-01848-z.

In-silico Design of DNA Oligonucleotides: Challenges and Approaches.

Comput Struct Biotechnol J. 2019 Jul 29;17:1056-1065. doi: 10.1016/j.csbj.2019.07.008. eCollection 2019.

Functional Characterization of a Gene in Sedum alfredii Hance Resembling Rubber Elongation Factor Endowed with Functions Associated with Cadmium Tolerance.

Front Plant Sci. 2016 Jun 29;7:965. doi: 10.3389/fpls.2016.00965. eCollection 2016.

Label noise in subtype discrimination of class C G protein-coupled receptors: A systematic approach to the analysis of classification errors.

BMC Bioinformatics. 2015 Sep 29;16:314. doi: 10.1186/s12859-015-0731-9.

HMGB1 protein does not mediate the inflammatory response in spontaneous spinal cord regeneration: a hint for CNS regeneration.

J Biol Chem. 2013 Jun 21;288(25):18204-18. doi: 10.1074/jbc.M113.463810. Epub 2013 May 6.

Gecko CD59 is implicated in proximodistal identity during tail regeneration.

PLoS One. 2011 Mar 28;6(3):e17878. doi: 10.1371/journal.pone.0017878.

The molecular cloning of glial fibrillary acidic protein in Gekko japonicus and its expression changes after spinal cord transection.

Cell Mol Biol Lett. 2010 Dec;15(4):582-99. doi: 10.2478/s11658-010-0029-x. Epub 2010 Aug 14.

Bioinformatic analyses of transmembrane transport: novel software for deducing protein phylogeny, topology, and evolution.

J Mol Microbiol Biotechnol. 2009;17(4):163-76. doi: 10.1159/000239667. Epub 2009 Sep 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于DNA/蛋白质序列分析、基因功能注释和蛋白质分类的生物信息学工具。

Bioinformatic tools for DNA/protein sequence analysis, functional assignment of genes and protein classification.

作者信息

Rehm B H

机构信息

Institut für Mikrobiologie der Westfalischen Wilhelms-Universität Münster, Germany.

出版信息

Appl Microbiol Biotechnol. 2001 Dec;57(5-6):579-92. doi: 10.1007/s00253-001-0844-0.

DOI:10.1007/s00253-001-0844-0

PMID:11778865

Abstract

摘要

用于DNA/蛋白质序列分析、基因功能注释和蛋白质分类的生物信息学工具。

Bioinformatic tools for DNA/protein sequence analysis, functional assignment of genes and protein classification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于DNA/蛋白质序列分析、基因功能注释和蛋白质分类的生物信息学工具。

Bioinformatic tools for DNA/protein sequence analysis, functional assignment of genes and protein classification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献