Suppr超能文献

后生动物中生物活性肽数据库的构建。

The construction of a bioactive peptide database in Metazoa.

作者信息

Liu Feng, Baggerman Geert, Schoofs Liliane, Wets Geert

机构信息

Data Analysis & Modeling Group, Transportation Research Institute, Hasselt University, Building D, 3590 Diepenbeek, Belgium.

出版信息

J Proteome Res. 2008 Sep;7(9):4119-31. doi: 10.1021/pr800037n. Epub 2008 Aug 16.

Abstract

Bioactive peptides play critical roles in regulating most biological processes in animals, and have considerable biological, medical and industrial importance. A number of peptides have been discovered usually based on their biological activities in vitro or based on their sequence similarities in silico. Through searches in Swiss-Prot and Trembl protein databases using BLAST alignment tools and other in silico methods, all currently known bioactive peptides and their precursor proteins are extracted. In addition, 132 recently discovered putative peptide genes in Drosophila as well as their orthologs in other species are collected. In total, 20 027 bioactive peptides from 19 438 precursor proteins covering 2820 metazoan species are retained, and they, respectively, make up a peptide and a peptide precursor database. The peptides and peptide precursor proteins are further classified into 373 families, 178 of which are represented by Prosite Pfam or Smart motifs, or by typical peptide motifs that have been constructed recently. The remaining 195 families are novel peptide families. The motifs characterizing the 178 peptide families are saved into a peptide motif database. The peptide, peptide precursor and peptide motif databases (version 1.0) are the most complete peptide, precursor and peptide motif collection in Metazoa so far. They are available on the WWW at http://www.peptides.be/.

摘要

生物活性肽在调节动物的大多数生物过程中发挥着关键作用,并且具有相当大的生物学、医学和工业重要性。许多肽通常是基于它们在体外的生物活性或基于它们在计算机上的序列相似性而被发现的。通过使用BLAST比对工具和其他计算机方法在Swiss-Prot和Trembl蛋白质数据库中进行搜索,提取了所有目前已知的生物活性肽及其前体蛋白。此外,还收集了果蝇中最近发现的132个假定的肽基因及其在其他物种中的直系同源基因。总共保留了来自19438个前体蛋白、涵盖2820种后生动物物种的20027个生物活性肽,它们分别构成了一个肽数据库和一个肽前体数据库。这些肽和肽前体蛋白进一步被分类为373个家族,其中178个家族由Prosite Pfam或Smart基序,或由最近构建的典型肽基序表示。其余195个家族是新的肽家族。表征这178个肽家族的基序被保存到一个肽基序数据库中。肽、肽前体和肽基序数据库(版本1.0)是迄今为止后生动物中最完整的肽、前体和肽基序集合。它们可在万维网上通过http://www.peptides.be/获取。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验