• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RAPHAEL:螺线管蛋白结构的识别、周期性和插入分配。

RAPHAEL: recognition, periodicity and insertion assignment of solenoid protein structures.

机构信息

Department of Biology, University of Padua, Viale G. Colombo 3, 35131 Padova, Italy.

出版信息

Bioinformatics. 2012 Dec 15;28(24):3257-64. doi: 10.1093/bioinformatics/bts550. Epub 2012 Sep 8.

DOI:10.1093/bioinformatics/bts550
PMID:22962341
Abstract

MOTIVATION

Repeat proteins form a distinct class of structures where folding is greatly simplified. Several classes have been defined, with solenoid repeats of periodicity between ca. 5 and 40 being the most challenging to detect. Such proteins evolve quickly and their periodicity may be rapidly hidden at sequence level. From a structural point of view, finding solenoids may be complicated by the presence of insertions or multiple domains. To the best of our knowledge, no automated methods are available to characterize solenoid repeats from structure.

RESULTS

Here we introduce RAPHAEL, a novel method for the detection of solenoids in protein structures. It reliably solves three problems of increasing difficulty: (1) recognition of solenoid domains, (2) determination of their periodicity and (3) assignment of insertions. RAPHAEL uses a geometric approach mimicking manual classification, producing several numeric parameters that are optimized for maximum performance. The resulting method is very accurate, with 89.5% of solenoid proteins and 97.2% of non-solenoid proteins correctly classified. RAPHAEL periodicities have a Spearman correlation coefficient of 0.877 against the manually established ones. A baseline algorithm for insertion detection in identified solenoids has a Q(2) value of 79.8%, suggesting room for further improvement. RAPHAEL finds 1931 highly confident repeat structures not previously annotated as solenoids in the Protein Data Bank records.

摘要

动机

重复蛋白质形成了一个独特的结构类别,其中折叠过程大大简化。已经定义了几个类别,其中周期性约为 5 到 40 的螺旋重复是最具挑战性的。这些蛋白质进化迅速,其周期性可能在序列水平上迅速隐藏。从结构的角度来看,找到螺旋可能会因为存在插入或多个结构域而变得复杂。据我们所知,目前还没有自动化的方法可以从结构上描述螺旋重复。

结果

在这里,我们引入了 RAPHAEL,这是一种用于检测蛋白质结构中螺旋的新方法。它可靠地解决了三个越来越困难的问题:(1)识别螺旋结构域,(2)确定其周期性,(3)分配插入。RAPHAEL 使用一种几何方法模拟手动分类,生成几个数字参数,这些参数经过优化,以获得最佳性能。该方法非常准确,89.5%的螺旋蛋白和 97.2%的非螺旋蛋白被正确分类。RAPHAEL 的周期性与手动建立的周期性具有 0.877 的斯皮尔曼相关系数。在已识别的螺旋结构中用于插入检测的基线算法的 Q(2)值为 79.8%,表明还有进一步改进的空间。RAPHAEL 在蛋白质数据库记录中发现了 1931 个以前未注释为螺旋的高度置信度的重复结构。

相似文献

1
RAPHAEL: recognition, periodicity and insertion assignment of solenoid protein structures.RAPHAEL:螺线管蛋白结构的识别、周期性和插入分配。
Bioinformatics. 2012 Dec 15;28(24):3257-64. doi: 10.1093/bioinformatics/bts550. Epub 2012 Sep 8.
2
REPETITA: detection and discrimination of the periodicity of protein solenoid repeats by discrete Fourier transform.REPETITA:通过离散傅里叶变换检测和辨别蛋白质螺线管重复序列的周期性
Bioinformatics. 2009 Jun 15;25(12):i289-95. doi: 10.1093/bioinformatics/btp232.
3
ConSole: using modularity of contact maps to locate solenoid domains in protein structures.ConSole:利用接触图的模块性定位蛋白质结构中的螺旋域。
BMC Bioinformatics. 2014 Apr 27;15:119. doi: 10.1186/1471-2105-15-119.
4
Revealing aperiodic aspects of solenoid proteins from sequence information.从序列信息中揭示螺线管蛋白的非周期性特征。
Bioinformatics. 2016 Sep 15;32(18):2776-82. doi: 10.1093/bioinformatics/btw319. Epub 2016 Jun 9.
5
Solenoid and non-solenoid protein recognition using stationary wavelet packet transform.利用定态小波包变换识别电磁阀和非电磁阀蛋白。
Bioinformatics. 2010 Sep 15;26(18):i467-73. doi: 10.1093/bioinformatics/btq371.
6
Beta-rolls, beta-helices, and other beta-solenoid proteins.β-折叠、β-螺旋及其他β-螺线管蛋白。
Adv Protein Chem. 2006;73:55-96. doi: 10.1016/S0065-3233(06)73003-0.
7
Autoregressive and iterative hidden Markov models for periodicity detection and solenoid structure recognition in protein sequences.用于蛋白质序列周期性检测和螺线管结构识别的自回归和迭代隐马尔可夫模型。
IEEE J Biomed Health Inform. 2013 Mar;17(2):436-41. doi: 10.1109/JBHI.2012.2235852.
8
Identification of repetitive units in protein structures with ReUPred.使用ReUPred鉴定蛋白质结构中的重复单元。
Amino Acids. 2016 Jun;48(6):1391-400. doi: 10.1007/s00726-016-2187-2. Epub 2016 Feb 22.
9
When protein folding is simplified to protein coiling: the continuum of solenoid protein structures.当蛋白质折叠简化为蛋白质卷曲时:螺线管蛋白质结构的连续体。
Trends Biochem Sci. 2000 Oct;25(10):509-15. doi: 10.1016/s0968-0004(00)01667-4.
10
Functional and genomic analyses of alpha-solenoid proteins.α-螺旋蛋白的功能和基因组分析。
PLoS One. 2013 Nov 21;8(11):e79894. doi: 10.1371/journal.pone.0079894. eCollection 2013.

引用本文的文献

1
RepeatsDB in 2025: expanding annotations of structured tandem repeats proteins on AlphaFoldDB.2025年的重复序列数据库:在AlphaFoldDB上扩展结构化串联重复序列蛋白的注释
Nucleic Acids Res. 2025 Jan 6;53(D1):D575-D581. doi: 10.1093/nar/gkae965.
2
Evolutionary Study of Protein Short Tandem Repeats in Protein Families.蛋白质家族中蛋白质短串联重复的进化研究。
Biomolecules. 2023 Jul 13;13(7):1116. doi: 10.3390/biom13071116.
3
Quantitative analysis of visual codewords of a protein distance matrix.蛋白质距离矩阵的视觉码词的定量分析。
PLoS One. 2022 Feb 4;17(2):e0263566. doi: 10.1371/journal.pone.0263566. eCollection 2022.
4
Search for Highly Divergent Tandem Repeats in Amino Acid Sequences.搜索氨基酸序列中的高度变异串联重复。
Int J Mol Sci. 2021 Jul 1;22(13):7096. doi: 10.3390/ijms22137096.
5
RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures.2021 年的 RepeatsDB:改进了蛋白质串联重复结构的数据并扩展了分类。
Nucleic Acids Res. 2021 Jan 8;49(D1):D452-D457. doi: 10.1093/nar/gkaa1097.
6
Novel Descriptors and Digital Signal Processing- Based Method for Protein Sequence Activity Relationship Study.新型描述符和基于数字信号处理的蛋白质序列活性关系研究方法。
Int J Mol Sci. 2019 Nov 11;20(22):5640. doi: 10.3390/ijms20225640.
7
MemSTATS: A Benchmark Set of Membrane Protein Symmetries and Pseudosymmetries.MemSTATS:一个膜蛋白对称和拟对称基准数据集。
J Mol Biol. 2020 Jan 17;432(2):597-604. doi: 10.1016/j.jmb.2019.09.020. Epub 2019 Oct 16.
8
Analyzing the symmetrical arrangement of structural repeats in proteins with CE-Symm.使用 CE-Symm 分析蛋白质结构重复的对称排列。
PLoS Comput Biol. 2019 Apr 22;15(4):e1006842. doi: 10.1371/journal.pcbi.1006842. eCollection 2019 Apr.
9
A machine learning approach for reliable prediction of amino acid interactions and its application in the directed evolution of enantioselective enzymes.一种用于可靠预测氨基酸相互作用的机器学习方法及其在对映选择性酶定向进化中的应用。
Sci Rep. 2018 Nov 13;8(1):16757. doi: 10.1038/s41598-018-35033-y.
10
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures.RepeatsDB 2.0:改进了重复蛋白结构的注释、分类、搜索和可视化。
Nucleic Acids Res. 2017 Jan 4;45(D1):D308-D312. doi: 10.1093/nar/gkw1136. Epub 2016 Nov 29.