• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

原核单跨膜蛋白结构域架构的全蛋白质组分析。

A proteome-wide analysis of domain architectures of prokaryotic single-spanning transmembrane proteins.

作者信息

Arai Masafumi, Fukushi Takafumi, Satake Masanobu, Shimizu Toshio

机构信息

Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Japan.

出版信息

Comput Biol Chem. 2005 Oct;29(5):379-87. doi: 10.1016/j.compbiolchem.2005.08.004. Epub 2005 Oct 6.

DOI:10.1016/j.compbiolchem.2005.08.004
PMID:16213795
Abstract

We performed a proteome-wide survey of the domain architectures in single-spanning transmembrane (TM) proteins (single-spannings) from 87 sequenced prokaryotic (Bacterial and Archaean) genomes by assigning Pfam domains to their N-tail and C-tail loops. Out of 14,625 single-spannings, 3,516 sequences have at least one domain assigned, and no domains were assigned to 7,850, with the remaining 3,259 with less reliable assignment. In the domain-assigned sequences, 3116 sequences are with at most two domains, and the other 400 sequences with more than two. The assigned domains distribute over 651 Pfam families, which account for 11.4% of the total Pfam-A families. Among the 651 families are mostly soluble-protein-originated ones, but only 21 families are unique to TM proteins. The occurrence frequency of the individual domain families follows a power-law, that is, 264 families occur only once, 106 just twice, and the families appeared more than 30 times are counted by only 39. It is found that the great majority of the sequences having one or two domains are of the type II topology with the C-tail loop containing domains on it. On the contrary, the N-tail loop of the same type topology seldom carries domains. Importantly, the assigned domains are always found on the tail loops longer than 60 residues, even for the small domains with less than 30 residues. There are still as many as 5,800 sequences without assigned domains in spite of having at least one long tail, on which no less than 1,000 novel domain families are expected most likely to lie concealed unknown yet. We also investigated the domain arrangement preference and the domain family combination patterns in 'singlets' (single-spannings with one assigned domain) and 'doublets' (with two domains).

摘要

我们通过将Pfam结构域分配给87个已测序的原核生物(细菌和古生菌)基因组中的单跨膜(TM)蛋白(单跨膜蛋白)的N端和C端环,对其结构域架构进行了全蛋白质组范围的调查。在14625个单跨膜蛋白中,3516个序列至少有一个已分配的结构域,7850个未分配结构域,其余3259个分配不太可靠。在已分配结构域的序列中,3116个序列最多有两个结构域,另外400个序列有两个以上结构域。已分配的结构域分布在651个Pfam家族中,占Pfam-A家族总数的11.4%。在这651个家族中,大多数是起源于可溶性蛋白的家族,但只有21个家族是TM蛋白特有的。各个结构域家族的出现频率遵循幂律,即264个家族只出现一次,106个家族只出现两次,出现超过30次的家族只有39个。结果发现,绝大多数具有一个或两个结构域的序列属于II型拓扑结构,其C端环上含有结构域。相反,相同类型拓扑结构的N端环很少携带结构域。重要的是,即使是长度小于30个残基的小结构域,已分配的结构域也总是出现在长度超过60个残基的尾环上。尽管有至少一个长尾巴,但仍有多达5800个序列未分配结构域,最有可能隐藏着不少于1000个未知的新结构域家族。我们还研究了“单结构域蛋白”(具有一个已分配结构域的单跨膜蛋白)和“双结构域蛋白”(具有两个结构域的单跨膜蛋白)中的结构域排列偏好和结构域家族组合模式。

相似文献

1
A proteome-wide analysis of domain architectures of prokaryotic single-spanning transmembrane proteins.原核单跨膜蛋白结构域架构的全蛋白质组分析。
Comput Biol Chem. 2005 Oct;29(5):379-87. doi: 10.1016/j.compbiolchem.2005.08.004. Epub 2005 Oct 6.
2
Internal gene duplication in the evolution of prokaryotic transmembrane proteins.原核生物跨膜蛋白进化中的内部基因复制。
J Mol Biol. 2004 May 21;339(1):1-15. doi: 10.1016/j.jmb.2004.03.048.
3
Domain combinations in archaeal, eubacterial and eukaryotic proteomes.古菌、真细菌和真核生物蛋白质组中的结构域组合
J Mol Biol. 2001 Jul 6;310(2):311-25. doi: 10.1006/jmbi.2001.4776.
4
Identification and distribution of protein families in 120 completed genomes using Gene3D.利用Gene3D在120个已完成测序的基因组中鉴定蛋白质家族并分析其分布情况。
Proteins. 2005 May 15;59(3):603-15. doi: 10.1002/prot.20409.
5
Genome-wide survey of transcription factors in prokaryotes reveals many bacteria-specific families not found in archaea.对原核生物转录因子的全基因组调查揭示了许多在古细菌中未发现的细菌特异性家族。
DNA Res. 2005;12(5):269-80. doi: 10.1093/dnares/dsi016. Epub 2006 Jan 10.
6
Global phylogeny determined by the combination of protein domains in proteomes.由蛋白质组中蛋白质结构域组合所确定的全球系统发育。
Mol Biol Evol. 2006 Dec;23(12):2444-54. doi: 10.1093/molbev/msl117. Epub 2006 Sep 13.
7
Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions.生命三界中的多结构域蛋白:孤儿结构域及其他未分类区域。
J Mol Biol. 2005 Apr 22;348(1):231-43. doi: 10.1016/j.jmb.2005.02.007.
8
Comprehensive analysis of orthologous protein domains using the HOPS database.使用HOPS数据库对直系同源蛋白结构域进行综合分析。
Genome Res. 2003 Oct;13(10):2353-62. doi: 10.1101/gr1305203.
9
The origins of modern proteomes.现代蛋白质组的起源。
Biochimie. 2007 Dec;89(12):1454-63. doi: 10.1016/j.biochi.2007.09.004. Epub 2007 Sep 15.
10
A consensus algorithm to screen genomes for novel families of transmembrane beta barrel proteins.一种用于筛选基因组中新型跨膜β桶蛋白家族的共识算法。
Proteins. 2007 Oct 1;69(1):8-18. doi: 10.1002/prot.21439.