Suppr超能文献

比较转运蛋白分类数据库(TCDB)和蛋白质家族数据库(Pfam)中两个转运蛋白分类系统的复杂性、挑战和益处。

The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam.

作者信息

Chiang Zachary, Vastermark Ake, Punta Marco, Coggill Penelope C, Mistry Jaina, Finn Robert D, Saier Milton H

出版信息

Brief Bioinform. 2015 Sep;16(5):865-72. doi: 10.1093/bib/bbu053. Epub 2015 Jan 21.

Abstract

Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that different classification systems for transport proteins have arisen, be it within a specialized database, focused on this functional class of proteins, or as part of a broader classification system for all proteins. Two such databases are the Transporter Classification Database (TCDB) and the Protein family (Pfam) database. As part of a long-term endeavor to improve consistency between the two classification systems, we have compared transporter annotations in the two databases to understand the rationale for differences and to improve both systems. Differences sometimes reflect the fact that one database has a particular transporter family while the other does not. Differing family definitions and hierarchical organizations were reconciled, resulting in recognition of 69 Pfam 'Domains of Unknown Function', which proved to be transport protein families to be renamed using TCDB annotations. Of over 400 potential new Pfam families identified from TCDB, 10% have already been added to Pfam, and TCDB has created 60 new entries based on Pfam data. This work, for the first time, reveals the benefits of comprehensive database comparisons and explains the differences between Pfam and TCDB.

摘要

转运蛋白系统约占细胞中所有蛋白质的10%,在许多过程中发挥着关键作用。改进和扩展它们的分类是一个重要目标,可能会影响从比较基因组学到潜在药物靶点搜索等一系列研究。不同的转运蛋白分类系统出现并不奇怪,无论是在专注于这类功能蛋白的专门数据库中,还是作为所有蛋白质更广泛分类系统的一部分。两个这样的数据库是转运蛋白分类数据库(TCDB)和蛋白质家族(Pfam)数据库。作为改善这两个分类系统之间一致性的长期努力的一部分,我们比较了两个数据库中的转运蛋白注释,以了解差异的原因并改进这两个系统。差异有时反映出一个数据库有某个特定的转运蛋白家族而另一个没有的情况。不同的家族定义和层次组织得到了协调,结果识别出69个Pfam“功能未知结构域”,事实证明这些是要用TCDB注释重新命名的转运蛋白家族。从TCDB中识别出的400多个潜在新Pfam家族中,已有10%被添加到Pfam中,并且TCDB已根据Pfam数据创建了60个新条目。这项工作首次揭示了全面数据库比较的益处,并解释了Pfam和TCDB之间的差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caeb/4570203/01b15bbed8b9/bbu053f1p.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验