Cotter Dawn, Guda Purnima, Fahy Eoin, Subramaniam Shankar
San Diego Supercomputer Center, University of California, 9500 Gilman Drive, San Diego, CA 92037, USA.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D463-7. doi: 10.1093/nar/gkh048.
MitoProteome is an object-relational mitochondrial protein sequence database and annotation system. The initial release contains 847 human mitochondrial protein sequences, derived from public sequence databases and mass spectrometric analysis of highly purified human heart mitochondria. Each sequence is manually annotated with primary function, subfunction and subcellular location, and extensively annotated in an automated process with data extracted from external databases, including gene information from LocusLink and Ensembl; disease information from OMIM; protein-protein interaction data from MINT and DIP; functional domain information from Pfam; protein fingerprints from PRINTS; protein family and family-specific signatures from InterPro; structure data from PDB; mutation data from PMD; BLAST homology data from NCBI NR; and proteins found to be related based on LocusLink and SWISS-PROT references and sequence and taxonomy data. By highly automating the processes of maintaining the MitoProteome Protein List and extracting relevant data from external databases, we are able to present a dynamic database, updated frequently to reflect changes in public resources. The MitoProteome database is publicly available at http://www. mitoproteome.org/. Users may browse and search MitoProteome, and access a complete compilation of data relevant to each protein of interest, cross-linked to external databases.
线粒体蛋白质组是一个对象关系型线粒体蛋白质序列数据库及注释系统。其初始版本包含847条人类线粒体蛋白质序列,这些序列来源于公共序列数据库以及对高度纯化的人类心脏线粒体进行的质谱分析。每条序列都经过人工注释,标注了主要功能、子功能和亚细胞定位,并通过自动流程利用从外部数据库提取的数据进行了广泛注释,这些外部数据库包括来自LocusLink和Ensembl的基因信息、来自OMIM的疾病信息、来自MINT和DIP的蛋白质相互作用数据、来自Pfam的功能域信息、来自PRINTS的蛋白质指纹、来自InterPro的蛋白质家族和家族特异性特征、来自PDB的结构数据、来自PMD的突变数据、来自NCBI NR的BLAST同源性数据,以及基于LocusLink和SWISS - PROT参考文献以及序列和分类学数据发现的相关蛋白质。通过高度自动化维护线粒体蛋白质组蛋白质列表以及从外部数据库提取相关数据的流程,我们能够呈现一个动态数据库,该数据库会频繁更新以反映公共资源的变化。线粒体蛋白质组数据库可在http://www.mitoproteome.org/上公开获取。用户可以浏览和搜索线粒体蛋白质组,并获取与每个感兴趣蛋白质相关的完整数据汇编,这些数据与外部数据库相互交联。