• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于搜索模块数据库的蛋白质家族分类。

Protein family classification based on searching a database of blocks.

作者信息

Henikoff S, Henikoff J G

机构信息

Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, Washington 98104.

出版信息

Genomics. 1994 Jan 1;19(1):97-107. doi: 10.1006/geno.1994.1018.

DOI:10.1006/geno.1994.1018
PMID:8188249
Abstract

The most highly conserved regions of proteins can be represented as "blocks" of locally aligned sequence segments. Previously, an automated system was introduced to generate a database of blocks that is searched for local similarities using a sequence query. Here, we describe a method for searching this database that can also reveal significant global similarities. Local and global alignments are scored independently, so they can be used in concert to infer homology. A set of 7082 diverse sequences not represented in the database provided queries for testing this approach. The resulting distributions of scores led to guidelines for interpretation of search data and to the classification of 289 uncatalogued sequences into known groups. Thirty-eight of these relationships appear to be new discoveries. We also show how searching a database of blocks can be used to detect repeated domains and to find distinct cross-family relationships that were missed in searches of sequence databases.

摘要

蛋白质中保守性最高的区域可以表示为局部比对序列片段的“模块”。此前,已引入一个自动化系统来生成一个模块数据库,该数据库可通过序列查询来搜索局部相似性。在此,我们描述一种搜索该数据库的方法,该方法还能揭示显著的全局相似性。局部比对和全局比对分别计分,因此它们可以协同使用以推断同源性。一组未包含在数据库中的7082个不同序列为测试该方法提供了查询序列。所得的分数分布为搜索数据的解释提供了指导方针,并将289个未分类序列分类到已知组中。其中38种关系似乎是新发现。我们还展示了如何通过搜索模块数据库来检测重复结构域,并找到在序列数据库搜索中遗漏的不同家族间的关系。

相似文献

1
Protein family classification based on searching a database of blocks.基于搜索模块数据库的蛋白质家族分类。
Genomics. 1994 Jan 1;19(1):97-107. doi: 10.1006/geno.1994.1018.
2
A sequence property approach to searching protein databases.一种用于搜索蛋白质数据库的序列属性方法。
J Mol Biol. 1995 Aug 18;251(3):390-9. doi: 10.1006/jmbi.1995.0442.
3
FASTA-SWAP and FASTA-PAT: pattern database searches using combinations of aligned amino acids, and a novel scoring theory.FASTA-SWAP和FASTA-PAT:使用比对氨基酸组合进行模式数据库搜索以及一种新颖的评分理论。
J Mol Biol. 1996 Jun 21;259(4):840-54. doi: 10.1006/jmbi.1996.0362.
4
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
5
Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches.通过迭代数据库搜索获取有关蛋白质的重要结构、功能和进化信息。
J Mol Biol. 1999 Apr 16;287(5):1023-40. doi: 10.1006/jmbi.1999.2653.
6
GeneAssist. Smith-Waterman and other database similarity searches and identification of motifs.基因辅助。史密斯-沃特曼算法及其他数据库相似性搜索与基序识别。
Methods Mol Biol. 1997;70:173-87.
7
Modular arrangement of proteins as inferred from analysis of homology.从同源性分析推断出的蛋白质模块化排列。
Protein Sci. 1994 Mar;3(3):482-92. doi: 10.1002/pro.5560030314.
8
Homology-based method for identification of protein repeats using statistical significance estimates.基于同源性的蛋白质重复序列识别方法及其统计学显著性估计
J Mol Biol. 2000 May 5;298(3):521-37. doi: 10.1006/jmbi.2000.3684.
9
A novel sequence similarity searching and visualization method based on overlappingly translated nucleic acids: the blastNP.一种基于重叠翻译核酸的新型序列相似性搜索与可视化方法:blastNP。
Med Hypotheses. 2004;62(4):568-74. doi: 10.1016/j.mehy.2003.11.020.
10
A tool for analyzing and annotating genomic sequences.一种用于分析和注释基因组序列的工具。
Genomics. 1997 Nov 15;46(1):37-45. doi: 10.1006/geno.1997.4984.

引用本文的文献

1
Remote homology search with hidden Potts models.使用隐式 Potts 模型进行远程同源搜索。
PLoS Comput Biol. 2020 Nov 30;16(11):e1008085. doi: 10.1371/journal.pcbi.1008085. eCollection 2020 Nov.
2
Evolutionary history of the human multigene families reveals widespread gene duplications throughout the history of animals.人类多基因家族的进化历史揭示了动物历史上广泛的基因重复。
BMC Evol Biol. 2019 Jun 20;19(1):128. doi: 10.1186/s12862-019-1441-0.
3
Identification of a novel potassium channel (GiK) as a potential drug target in : Computational descriptions of binding sites.
鉴定一种新型钾通道(GiK)作为潜在药物靶点:结合位点的计算描述
PeerJ. 2019 Feb 27;7:e6430. doi: 10.7717/peerj.6430. eCollection 2019.
4
Determinants of Base-Pair Substitution Patterns Revealed by Whole-Genome Sequencing of DNA Mismatch Repair Defective .全基因组测序揭示的 DNA 错配修复缺陷. 的碱基对替换模式的决定因素
Genetics. 2018 Aug;209(4):1029-1042. doi: 10.1534/genetics.118.301237. Epub 2018 Jun 15.
5
The Protein Data Bank: Current Status and Future Challenges.蛋白质数据库:现状与未来挑战。
J Res Natl Inst Stand Technol. 1996 May-Jun;101(3):231-241. doi: 10.6028/jres.101.025.
6
Determinants of spontaneous mutation in the bacterium Escherichia coli as revealed by whole-genome sequencing.通过全基因组测序揭示的大肠杆菌自发突变的决定因素。
Proc Natl Acad Sci U S A. 2015 Nov 3;112(44):E5990-9. doi: 10.1073/pnas.1512136112. Epub 2015 Oct 12.
7
Cloning, expression and characterization of glycerol dehydrogenase involved in 2,3-butanediol formation in Serratia marcescens H30.在粘质沙雷氏菌 H30 中参与 2,3-丁二醇形成的甘油脱氢酶的克隆、表达和特性研究。
J Ind Microbiol Biotechnol. 2014 Sep;41(9):1319-27. doi: 10.1007/s10295-014-1472-x. Epub 2014 Jul 1.
8
Interaction of the heterotrimeric G protein alpha subunit SSG-1 of Sporothrix schenckii with proteins related to stress response and fungal pathogenicity using a yeast two-hybrid assay.利用酵母双杂交试验研究申克孢子丝菌异三聚体 G 蛋白α亚基 SSG-1 与应激反应和真菌致病性相关蛋白的相互作用。
BMC Microbiol. 2010 Dec 9;10:317. doi: 10.1186/1471-2180-10-317.
9
A statistical model of protein sequence similarity and function similarity reveals overly-specific function predictions.蛋白质序列相似性和功能相似性的统计模型揭示了过于具体的功能预测。
PLoS One. 2009 Oct 21;4(10):e7546. doi: 10.1371/journal.pone.0007546.
10
Novel features of the polysaccharide-digesting gliding bacterium Flavobacterium johnsoniae as revealed by genome sequence analysis.基因组序列分析揭示了多糖消化滑行细菌黄杆菌的新特征。
Appl Environ Microbiol. 2009 Nov;75(21):6864-75. doi: 10.1128/AEM.01495-09. Epub 2009 Aug 28.