Suppr超能文献

秀丽隐杆线虫中的免疫球蛋白超家族蛋白。

Immunoglobulin superfamily proteins in Caenorhabditis elegans.

作者信息

Teichmann S A, Chothia C

机构信息

MRC Laboratory of Molecular Biology, Hills Road, Cambridge, CB2 2QH, UK.

出版信息

J Mol Biol. 2000 Mar 10;296(5):1367-83. doi: 10.1006/jmbi.1999.3497.

Abstract

The predicted proteins of the genome of Caenorhabditis elegans were analysed by various sequence comparison methods to identify the repertoire of proteins that are members of the immunoglobulin superfamily (IgSF). The IgSF is one of the largest families of protein domain in this genome and likely to be one of the major families in other multicellular eukaryotes too. This is because members of the superfamily are involved in a variety of functions including cell-cell recognition, cell-surface receptors, muscle structure and, in higher organisms, the immune system. Sixty-four proteins with 488 I set IgSF domains were identified largely by using Hidden Markov models. The domain architectures of the protein products of these 64 genes are described. Twenty-one of these had been characterised previously. We show that another 25 are related to proteins of known function. The C. elegans IgSF proteins can be classified into five broad categories: muscle proteins, protein kinases and phosphatases, three categories of proteins involved in the development of the nervous system, leucine-rich repeat containing proteins and proteins without homologues of known function, of which there are 18. The 19 proteins involved in nervous system development that are not kinases or phosphatases are homologues of neuroglian, axonin, NCAM, wrapper, klingon, ICCR and nephrin or belong to the recently identified zig gene family. Out of the set of 64 genes, 22 are on the X chromosome. This study should be seen as an initial description of the IgSF repertoire in C. elegans, because the current gene definitions may contain a number of errors, especially in the case of long sequences, and there may be IgSF genes that have not yet been detected. However, the proteins described here do provide an overview of the bulk of the repertoire of immunoglobulin superfamily members in C. elegans, a framework for refinement and extension of the repertoire as gene and protein definitions improve, and the basis for investigations of their function and for comparisons with the repertoires of other organisms.

摘要

通过各种序列比较方法对秀丽隐杆线虫基因组的预测蛋白质进行了分析,以确定属于免疫球蛋白超家族(IgSF)的蛋白质库。IgSF是该基因组中最大的蛋白质结构域家族之一,可能也是其他多细胞真核生物中的主要家族之一。这是因为该超家族的成员参与多种功能,包括细胞间识别、细胞表面受体、肌肉结构,以及在高等生物中的免疫系统。主要通过使用隐马尔可夫模型鉴定出了64种含有488个IgSF结构域的蛋白质。描述了这64个基因的蛋白质产物的结构域结构。其中21种先前已有特征描述。我们表明另外25种与已知功能的蛋白质相关。秀丽隐杆线虫的IgSF蛋白质可分为五大类:肌肉蛋白质、蛋白激酶和磷酸酶、参与神经系统发育的三类蛋白质、富含亮氨酸重复序列的蛋白质以及功能未知的无同源物的蛋白质,其中无同源物的蛋白质有18种。参与神经系统发育但不是激酶或磷酸酶的19种蛋白质是神经胶质蛋白、轴突蛋白、神经细胞黏附分子(NCAM)、包裹蛋白、克林贡蛋白、ICCR和nephrin的同源物,或者属于最近鉴定出的zig基因家族。在这64个基因中,有22个位于X染色体上。本研究应被视为秀丽隐杆线虫中IgSF蛋白质库的初步描述,因为当前的基因定义可能包含一些错误,尤其是在长序列的情况下,并且可能存在尚未检测到的IgSF基因。然而,这里描述的蛋白质确实提供了秀丽隐杆线虫中免疫球蛋白超家族成员大部分蛋白质库的概述、随着基因和蛋白质定义的改进对蛋白质库进行完善和扩展的框架,以及对其功能进行研究并与其他生物的蛋白质库进行比较的基础。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验