Suppr超能文献

对蛋白质特征序列进行系统搜索。

A systematic search for protein signature sequences.

作者信息

Sheridan R P, Venkataraghavan R

机构信息

Medical Research Division, Lederle Laboratories, American Cyanamid Corp., Pearl River, New York 10965.

出版信息

Proteins. 1992 Sep;14(1):16-28. doi: 10.1002/prot.340140105.

Abstract

Signature sequences are contiguous patterns of amino acids 10-50 residues long that are associated with a particular structure or function in proteins. These may be of three types (by our nomenclature): superfamily signatures, remnant homologies, and motifs. We have performed a systematic search through a database of protein sequences to automatically and preferentially find remnant homologies and motifs. This was accomplished in three steps: 1. We generated a nonredundant sequence database. 2. We used BLAST3 (Altschul and Lipman, Proc. Natl. Acad. Sci. U.S.A. 87:5509-5513, 1990) to generate local pairwise and triplet sequence alignments for every protein in the database vs. every other. 3. We selected "interesting" alignments and grouped them into clusters. We find that most of the clusters contain segments from proteins which share a common structure or function. Many of them correspond to signatures previously noted in the literature. We discuss three previously recognized motifs in detail (FAD/NAD-binding, ATP/GTP-binding, and cytochrome b5-like domains) to demonstrate how the alignments generated by our procedure are consistent with previous work and make structural and functional sense. We also discuss two signatures (for N-acetyltransferases and glycerol-phosphate binding) which to our knowledge have not been previously recognized.

摘要

特征序列是长度为10 - 50个残基的连续氨基酸模式,与蛋白质中的特定结构或功能相关。根据我们的命名法,这些特征序列可分为三种类型:超家族特征序列、残余同源序列和基序。我们通过蛋白质序列数据库进行了系统搜索,以自动且优先地找到残余同源序列和基序。这一过程分三步完成:1. 我们生成了一个非冗余序列数据库。2. 我们使用BLAST3(阿尔茨舒尔和利普曼,《美国国家科学院院刊》87:5509 - 5513,1990)为数据库中的每个蛋白质与其他所有蛋白质生成局部成对和三联体序列比对。3. 我们选择“有趣的”比对并将它们分组为簇。我们发现大多数簇包含来自具有共同结构或功能的蛋白质的片段。其中许多与文献中先前指出的特征序列相对应。我们详细讨论了三个先前已识别的基序(FAD/NAD结合、ATP/GTP结合和细胞色素b5样结构域),以说明我们的程序生成的比对如何与先前的工作一致并具有结构和功能意义。我们还讨论了两个据我们所知先前未被识别的特征序列(用于N - 乙酰转移酶和甘油磷酸结合)。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验