• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

α-螺旋结构变异的特征与序列预测。

Characterization and sequence prediction of structural variations in α-helix.

机构信息

Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India.

出版信息

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S20. doi: 10.1186/1471-2105-12-S1-S20.

DOI:10.1186/1471-2105-12-S1-S20
PMID:21342550
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3044275/
Abstract

BACKGROUND

The structure conservation in various α-helix subclasses reveals the sequence and context dependent factors causing distortions in the α-helix. The sequence-structure relationship in these subclasses can be used to predict structural variations in α-helix purely based on its sequence. We train support vector machine(SVM) with dot product kernel function to discriminate between regular α-helix and non-regular α-helices purely based on the sequences, which are represented with various overall and position specific propensities of amino acids.

RESULTS

We characterize the structural distortions in five α-helix subclasses. The sequence structure correlation in the subclasses reveals that the increased propensity of proline, histidine, serine, aspartic acid and aromatic amino acids are responsible for the distortions in regular α-helix. The N-terminus of regular α-helix prefers neutral and acidic polar amino acids, while the C-terminus prefers basic polar amino acid. Proline is preferred in the first turn of regular α-helix, while it is preferred to produce kinked and curved subclasses. The SVM discriminates between regular α-helix and the rest with precision of 80.97% and recall of 88.05%.

CONCLUSIONS

The correlation between structural variation in helices and their sequences is manifested by the performance of SVM based on sequence features. The results presented here are useful for computational design of helices. The results are also useful for prediction of structural perturbations in helix sequence purely based on its sequence.

摘要

背景

各种α-螺旋亚类的结构保守性揭示了导致α-螺旋扭曲的序列和上下文相关因素。这些亚类中的序列-结构关系可用于仅基于序列预测α-螺旋的结构变化。我们使用带有内积核函数的支持向量机(SVM)来区分规则α-螺旋和不规则α-螺旋,这些序列使用各种总体和位置特异性氨基酸倾向来表示。

结果

我们描述了五个α-螺旋亚类的结构扭曲。亚类中的序列结构相关性表明,脯氨酸、组氨酸、丝氨酸、天冬氨酸和芳香族氨基酸的倾向增加是导致规则α-螺旋扭曲的原因。规则α-螺旋的 N 端偏爱中性和酸性极性氨基酸,而 C 端偏爱碱性极性氨基酸。脯氨酸优先出现在规则α-螺旋的第一转,而它优先产生扭曲和弯曲的亚类。SVM 区分规则α-螺旋和其余的精度为 80.97%,召回率为 88.05%。

结论

基于序列特征的 SVM 的性能表明了螺旋结构变化与其序列之间的相关性。这里呈现的结果可用于螺旋的计算设计。这些结果也可用于仅基于序列预测螺旋序列的结构扰动。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff30/3044275/cf487a80bfab/1471-2105-12-S1-S20-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff30/3044275/e97e6410b528/1471-2105-12-S1-S20-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff30/3044275/cf487a80bfab/1471-2105-12-S1-S20-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff30/3044275/e97e6410b528/1471-2105-12-S1-S20-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff30/3044275/cf487a80bfab/1471-2105-12-S1-S20-2.jpg

相似文献

1
Characterization and sequence prediction of structural variations in α-helix.α-螺旋结构变异的特征与序列预测。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S20. doi: 10.1186/1471-2105-12-S1-S20.
2
Gaussian mixture modeling of alpha-helix subclasses: structure and sequence variations.α-螺旋亚类的高斯混合建模:结构与序列变异
Pac Symp Biocomput. 2006:291-302.
3
Structural and sequence characteristics of long alpha helices in globular proteins.球状蛋白质中长α螺旋的结构和序列特征。
Biophys J. 1996 Sep;71(3):1574-86. doi: 10.1016/S0006-3495(96)79360-8.
4
omega-Helices in proteins.蛋白质中的 ω-螺旋。
Protein J. 2010 May;29(4):242-9. doi: 10.1007/s10930-010-9245-5.
5
Amino acid propensities are position-dependent throughout the length of alpha-helices.在整个α螺旋长度上,氨基酸倾向是依赖于位置的。
J Mol Biol. 2004 Apr 9;337(5):1195-205. doi: 10.1016/j.jmb.2004.02.004.
6
Geometrical and sequence characteristics of alpha-helices in globular proteins.球状蛋白质中α螺旋的几何和序列特征。
Biophys J. 1998 Oct;75(4):1935-44. doi: 10.1016/S0006-3495(98)77634-9.
7
Sequence and structure patterns in proteins from an analysis of the shortest helices: implications for helix nucleation.通过对最短螺旋的分析得出的蛋白质序列和结构模式:对螺旋成核的影响
J Mol Biol. 2003 Feb 7;326(1):273-91. doi: 10.1016/s0022-2836(02)01338-4.
8
Amino acid propensities for secondary structures are influenced by the protein structural class.二级结构的氨基酸倾向受蛋白质结构类别的影响。
Biochem Biophys Res Commun. 2006 Apr 7;342(2):441-51. doi: 10.1016/j.bbrc.2006.01.159. Epub 2006 Feb 8.
9
PATSIM: Prediction and analysis of protein sequences using hybrid Knuth-Morris Pratt (KMP) and Boyer-Moore (BM) algorithm.PATSIM:使用混合的克努特-莫里斯-普拉特(KMP)算法和博耶-摩尔(BM)算法对蛋白质序列进行预测和分析。
Gene. 2018 May 30;657:50-59. doi: 10.1016/j.gene.2018.02.069. Epub 2018 Mar 6.
10
SVM-based method for protein structural class prediction using secondary structural content and structural information of amino acids.基于支持向量机的蛋白质结构类预测方法,该方法利用二级结构含量和氨基酸的结构信息。
J Bioinform Comput Biol. 2011 Aug;9(4):489-502. doi: 10.1142/s0219720011005422.

引用本文的文献

1
DNA movies and panspermia.DNA 电影与生源说。
Life (Basel). 2011 Oct 20;1(1):9-18. doi: 10.3390/life1010009.

本文引用的文献

1
Gaussian mixture modeling of alpha-helix subclasses: structure and sequence variations.α-螺旋亚类的高斯混合建模:结构与序列变异
Pac Symp Biocomput. 2006:291-302.
2
A geometric invariant-based framework for the analysis of protein conformational space.一种基于几何不变量的蛋白质构象空间分析框架。
Bioinformatics. 2005 Sep 15;21(18):3622-8. doi: 10.1093/bioinformatics/bti621. Epub 2005 Aug 11.
3
Amino acid propensities are position-dependent throughout the length of alpha-helices.在整个α螺旋长度上,氨基酸倾向是依赖于位置的。
J Mol Biol. 2004 Apr 9;337(5):1195-205. doi: 10.1016/j.jmb.2004.02.004.
4
The spectrum kernel: a string kernel for SVM protein classification.谱核:一种用于支持向量机蛋白质分类的字符串核。
Pac Symp Biocomput. 2002:564-75.
5
Predicting the sub-cellular location of proteins from text using support vector machines.使用支持向量机从文本中预测蛋白质的亚细胞定位。
Pac Symp Biocomput. 2002:374-85. doi: 10.1142/9789812799623_0035.
6
Knowledge-based analysis of microarray gene expression data by using support vector machines.利用支持向量机对微阵列基因表达数据进行基于知识的分析。
Proc Natl Acad Sci U S A. 2000 Jan 4;97(1):262-7. doi: 10.1073/pnas.97.1.262.
7
The ASTRAL compendium for protein structure and sequence analysis.用于蛋白质结构和序列分析的ASTRAL汇编
Nucleic Acids Res. 2000 Jan 1;28(1):254-6. doi: 10.1093/nar/28.1.254.
8
Dissecting alpha-helices: position-specific analysis of alpha-helices in globular proteins.剖析α-螺旋:球状蛋白质中α-螺旋的位置特异性分析。
Proteins. 1998 Jun 1;31(4):460-76. doi: 10.1002/(sici)1097-0134(19980601)31:4<460::aid-prot12>3.0.co;2-d.
9
Design of helix ends. Amino acid preferences, hydrogen bonding and electrostatic interactions.螺旋末端的设计。氨基酸偏好、氢键和静电相互作用。
Int J Pept Protein Res. 1993 May;41(5):499-511.
10
N- and C-capping preferences for all 20 amino acids in alpha-helical peptides.α-螺旋肽中所有20种氨基酸的N端和C端封端偏好性。
Protein Sci. 1995 Jul;4(7):1325-36. doi: 10.1002/pro.5560040708.