Suppr超能文献

蛋白质家族、亚家族、功能及通路的PANTHER数据库。

The PANTHER database of protein families, subfamilies, functions and pathways.

作者信息

Mi Huaiyu, Lazareva-Ulitsky Betty, Loo Rozina, Kejariwal Anish, Vandergriff Jody, Rabkin Steven, Guo Nan, Muruganujan Anushya, Doremieux Olivier, Campbell Michael J, Kitano Hiroaki, Thomas Paul D

机构信息

Computational Biology, Applied Biosystems, 850 Lincoln Center Drive, Foster City, CA 94404, USA.

出版信息

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D284-8. doi: 10.1093/nar/gki078.

Abstract

PANTHER is a large collection of protein families that have been subdivided into functionally related subfamilies, using human expertise. These subfamilies model the divergence of specific functions within protein families, allowing more accurate association with function (ontology terms and pathways), as well as inference of amino acids important for functional specificity. Hidden Markov models (HMMs) are built for each family and subfamily for classifying additional protein sequences. The latest version, 5.0, contains 6683 protein families, divided into 31,705 subfamilies, covering approximately 90% of mammalian protein-coding genes. PANTHER 5.0 includes a number of significant improvements over previous versions, most notably (i) representation of pathways (primarily signaling pathways) and association with subfamilies and individual protein sequences; (ii) an improved methodology for defining the PANTHER families and subfamilies, and for building the HMMs; (iii) resources for scoring sequences against PANTHER HMMs both over the web and locally; and (iv) a number of new web resources to facilitate analysis of large gene lists, including data generated from high-throughput expression experiments. Efforts are underway to add PANTHER to the InterPro suite of databases, and to make PANTHER consistent with the PIRSF database. PANTHER is now publicly available without restriction at http://panther.appliedbiosystems.com.

摘要

PANTHER是一个大型蛋白质家族集合,这些家族已通过人工专业知识细分为功能相关的亚家族。这些亚家族对蛋白质家族内特定功能的分化进行建模,从而能更准确地与功能(本体术语和通路)相关联,以及推断对功能特异性重要的氨基酸。为每个家族和亚家族构建隐马尔可夫模型(HMM),用于对其他蛋白质序列进行分类。最新版本5.0包含6683个蛋白质家族,分为31,705个亚家族,覆盖了约90%的哺乳动物蛋白质编码基因。与之前版本相比,PANTHER 5.0有许多重大改进,最显著的是:(i)通路(主要是信号通路)的表示以及与亚家族和单个蛋白质序列的关联;(ii)定义PANTHER家族和亚家族以及构建HMM的改进方法;(iii)通过网络和本地针对PANTHER HMM对序列进行评分的资源;(iv)一些新的网络资源,以促进对大型基因列表的分析,包括从高通量表达实验生成的数据。目前正在努力将PANTHER添加到InterPro数据库套件中,并使PANTHER与PIRSF数据库保持一致。现在可在http://panther.appliedbiosystems.com上无限制地公开获取PANTHER。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eac/540032/0b7ef2ae8cf4/gki078f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验