• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于蛋白质折叠识别的3D-1D替换矩阵,其包含序列的预测二级结构。

A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.

作者信息

Rice D W, Eisenberg D

机构信息

UCLA-DOE Laboratory of Structural Biology and Molecular Medicine, Molecular Biology Institute, UCLA, Los Angeles, CA 90095-1570, USA.

出版信息

J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924.

DOI:10.1006/jmbi.1997.0924
PMID:9135128
Abstract

In protein fold recognition, a probe amino acid sequence is compared to a library of representative folds of known structure to identify a structural homolog. In cases where the probe and its homolog have clear sequence similarity, traditional residue substitution matrices have been used to predict the structural similarity. In cases where the probe is sequentially distant from its homolog, we have developed a (7 x 3 x 2 x 7 x 3) 3D-1D substitution matrix (called H3P2), calculated from a database of 119 structural pairs. Members of each pair share a similar fold, but have sequence identity less than 30%. Each probe sequence position is defined by one of seven residue classes and three secondary structure classes. Each homologous fold position is defined by one of seven residue classes, three secondary structure classes, and two burial classes. Thus the matrix is five-dimensional and contains 7 x 3 x 2 x 7 x 3 = 882 elements or 3D-1D scores. The first step in assigning a probe sequence to its homologous fold is the prediction of the three-state (helix, strand, coil) secondary structure of the probe; here we use the profile based neural network prediction of secondary structure (PHD) program. Then a dynamic programming algorithm uses the H3P2 matrix to align the probe sequence with structures in a representative fold library. To test the effectiveness of the H3P2 matrix a challenging, fold class diverse, and cross-validated benchmark assessment is used to compare the H3P2 matrix to the GONNET, PAM250, BLOSUM62 and a secondary structure only substitution matrix. For distantly related sequences the H3P2 matrix detects more homologous structures at higher reliabilities than do these other substitution matrices, based on sensitivity versus specificity plots (or SENS-SPEC plots). The added efficacy of the H3P2 matrix arises from its information on the statistical preferences for various sequence-structure environment combinations from very distantly related proteins. It introduces the predicted secondary structure information from a sequence into fold recognition in a statistical way that normalizes the inherent correlations between residue type, secondary structure and solvent accessibility.

摘要

在蛋白质折叠识别中,将一条探测氨基酸序列与已知结构的代表性折叠文库进行比较,以识别结构同源物。在探测序列与其同源物具有明显序列相似性的情况下,传统的残基替换矩阵已被用于预测结构相似性。在探测序列与其同源物在序列上距离较远的情况下,我们开发了一种(7×3×2×7×3)三维-一维替换矩阵(称为H3P2),它是根据119个结构对的数据库计算得出的。每对结构的成员具有相似的折叠,但序列同一性小于30%。每个探测序列位置由七种残基类别和三种二级结构类别之一定义。每个同源折叠位置由七种残基类别、三种二级结构类别和两种埋藏类别之一定义。因此,该矩阵是五维的,包含7×3×2×7×3 = 882个元素或三维-一维得分。将探测序列与其同源折叠进行匹配的第一步是预测探测序列的三态(螺旋、链、无规卷曲)二级结构;这里我们使用基于轮廓的神经网络二级结构预测(PHD)程序。然后,一种动态规划算法使用H3P2矩阵将探测序列与代表性折叠文库中的结构进行比对。为了测试H3P2矩阵的有效性,使用了一个具有挑战性、折叠类别多样且经过交叉验证的基准评估,将H3P2矩阵与GONNET、PAM250、BLOSUM62以及仅基于二级结构的替换矩阵进行比较。对于远缘相关序列,基于敏感性与特异性图(或SENS-SPEC图),H3P2矩阵比其他这些替换矩阵能以更高的可靠性检测到更多的同源结构。H3P2矩阵额外的有效性源于其关于来自远缘相关蛋白质的各种序列-结构环境组合的统计偏好信息。它以一种统计方式将来自序列的预测二级结构信息引入折叠识别,从而对残基类型、二级结构和溶剂可及性之间的内在相关性进行归一化。

相似文献

1
A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.一种用于蛋白质折叠识别的3D-1D替换矩阵,其包含序列的预测二级结构。
J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924.
2
Protein fold recognition by prediction-based threading.基于预测穿线法的蛋白质折叠识别
J Mol Biol. 1997 Jul 18;270(3):471-80. doi: 10.1006/jmbi.1997.1101.
3
DPANN: improved sequence to structure alignments following fold recognition.DPANN:折叠识别后改进的序列到结构比对。
Proteins. 2004 Aug 15;56(3):528-38. doi: 10.1002/prot.20144.
4
Use of residue pairs in protein sequence-sequence and sequence-structure alignments.残基对在蛋白质序列-序列和序列-结构比对中的应用。
Protein Sci. 2000 Aug;9(8):1576-88. doi: 10.1110/ps.9.8.1576.
5
The 1.7 A crystal structure of BPI: a study of how two dissimilar amino acid sequences can adopt the same fold.杀菌/通透性增加蛋白(BPI)的1.7埃晶体结构:关于两种不同氨基酸序列如何能形成相同折叠方式的研究。
J Mol Biol. 2000 Jun 16;299(4):1019-34. doi: 10.1006/jmbi.2000.3805.
6
Alignment and searching for common protein folds using a data bank of structural templates.利用结构模板数据库进行比对并寻找常见蛋白质折叠。
J Mol Biol. 1993 Jun 5;231(3):735-52. doi: 10.1006/jmbi.1993.1323.
7
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
8
Protein fold recognition using sequence-derived predictions.利用序列衍生预测进行蛋白质折叠识别。
Protein Sci. 1996 May;5(5):947-55. doi: 10.1002/pro.5560050516.
9
Protein structure mining using a structural alphabet.使用结构字母表进行蛋白质结构挖掘。
Proteins. 2008 May 1;71(2):920-37. doi: 10.1002/prot.21776.
10
Protein fold recognition by mapping predicted secondary structures.通过映射预测的二级结构进行蛋白质折叠识别。
J Mol Biol. 1996 Jun 14;259(3):349-65. doi: 10.1006/jmbi.1996.0325.

引用本文的文献

1
Exploring amino acid functions in a deep mutational landscape.探索深度突变景观中的氨基酸功能。
Mol Syst Biol. 2021 Jul;17(7):e10305. doi: 10.15252/msb.202110305.
2
Substitution scoring matrices for proteins - An overview.蛋白质替换评分矩阵——概述。
Protein Sci. 2020 Nov;29(11):2150-2163. doi: 10.1002/pro.3954. Epub 2020 Oct 12.
3
Comparative and evolutionary analyses of the divergence of plant oligosaccharyltransferase STT3 isoforms.植物寡糖基转移酶STT3亚型差异的比较与进化分析
FEBS Open Bio. 2020 Mar;10(3):468-483. doi: 10.1002/2211-5463.12804. Epub 2020 Feb 19.
4
From local structure to a global framework: recognition of protein folds.从局部结构到全局框架:蛋白质折叠的识别
J R Soc Interface. 2014 Apr 16;11(95):20131147. doi: 10.1098/rsif.2013.1147. Print 2014 Jun 6.
5
Improvement in low-homology template-based modeling by employing a model evaluation method with focus on topology.通过采用侧重于拓扑结构的模型评估方法改进基于低同源性模板的建模。
PLoS One. 2014 Feb 26;9(2):e89935. doi: 10.1371/journal.pone.0089935. eCollection 2014.
6
Incorporation of local structural preference potential improves fold recognition.局部结构偏好势的纳入提高了折叠识别的性能。
PLoS One. 2011 Feb 18;6(2):e17215. doi: 10.1371/journal.pone.0017215.
7
Aligning protein sequence and analysing substitution pattern using a class-specific matrix.使用特定类别矩阵对齐蛋白质序列并分析取代模式。
J Biosci. 2010 Jun;35(2):295-314. doi: 10.1007/s12038-010-0033-3.
8
(PS)2-v2: template-based protein structure prediction server.(PS)2-v2:基于模板的蛋白质结构预测服务器。
BMC Bioinformatics. 2009 Oct 31;10:366. doi: 10.1186/1471-2105-10-366.
9
Sequence context-specific profiles for homology searching.用于同源性搜索的序列上下文特定概况。
Proc Natl Acad Sci U S A. 2009 Mar 10;106(10):3770-5. doi: 10.1073/pnas.0810767106. Epub 2009 Feb 20.
10
Improved scoring function for comparative modeling using the M4T method.使用M4T方法改进比较建模的评分函数。
J Struct Funct Genomics. 2009 Mar;10(1):95-9. doi: 10.1007/s10969-008-9044-9. Epub 2008 Nov 5.