Agile Genomics LLC, Mount Pleasant, SC 29466, USA. ulrich.luke+
Nucleic Acids Res. 2010 Jan;38(Database issue):D401-7. doi: 10.1093/nar/gkp940. Epub 2009 Nov 9.
The MiST2 database (http://mistdb.com) identifies and catalogs the repertoire of signal transduction proteins in microbial genomes. Signal transduction systems regulate the majority of cellular activities including the metabolism, development, host-recognition, biofilm production, virulence, and antibiotic resistance of human pathogens. Thus, knowledge of the proteins and interactions that comprise these communication networks is an essential component to furthering biomedical discovery. These are identified by searching protein sequences for specific domain profiles that implicate a protein in signal transduction. Compared to the previous version of the database, MiST2 contains a host of new features and improvements including the following: draft genomes; extracytoplasmic function (ECF) sigma factor protein identification; enhanced classification of signaling proteins; novel, high-quality domain models for identifying histidine kinases and response regulators; neighboring two-component genes; gene cart; better search capabilities; enhanced taxonomy browser; advanced genome browser; and a modern, biologist-friendly web interface. MiST2 currently contains 966 complete and 157 draft bacterial and archaeal genomes, which collectively contain more than 245 000 signal transduction proteins. The majority (66%) of these are one-component systems, followed by two-component proteins (26%), chemotaxis (6%), and finally ECF factors (2%).
MiST2 数据库(http://mistdb.com)识别和编目了微生物基因组中信号转导蛋白的库。信号转导系统调节着大多数细胞活动,包括人类病原体的新陈代谢、发育、宿主识别、生物膜形成、毒力和抗生素耐药性。因此,了解构成这些通讯网络的蛋白质和相互作用是推进生物医学发现的重要组成部分。这些是通过搜索蛋白质序列中特定的结构域特征来确定与信号转导相关的蛋白质。与数据库的上一个版本相比,MiST2 包含了许多新的功能和改进,包括以下内容:草案基因组;细胞外功能(ECF)σ因子蛋白鉴定;信号蛋白分类的增强;用于鉴定组氨酸激酶和反应调节剂的新型、高质量结构域模型;相邻的双组分基因;基因图谱;更好的搜索功能;增强的分类浏览器;高级基因组浏览器;以及现代化的、对生物学家友好的网络界面。MiST2 目前包含 966 个完整和 157 个草案细菌和古细菌基因组,它们共同包含超过 245000 个信号转导蛋白。这些蛋白中大多数(66%)是单组分系统,其次是双组分蛋白(26%)、趋化性(6%),最后是 ECF 因子(2%)。