• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用ReUPred鉴定蛋白质结构中的重复单元。

Identification of repetitive units in protein structures with ReUPred.

作者信息

Hirsh Layla, Piovesan Damiano, Paladin Lisanna, Tosatto Silvio C E

机构信息

Department of Biomedical Sciences, University of Padua, Padua, Italy.

Department of Engineering, Pontificia Universidad Católica del Perú, Lima, Perú

出版信息

Amino Acids. 2016 Jun;48(6):1391-400. doi: 10.1007/s00726-016-2187-2. Epub 2016 Feb 22.

DOI:10.1007/s00726-016-2187-2
PMID:26898549
Abstract

Over the last decade, numerous studies have demonstrated the fundamental importance of tandem repeat (TR) proteins in many biological processes. A plethora of new repeat structures have also been solved. The recently published RepeatsDB provides information on TR proteins. However, a detailed structural characterization of repetitive elements is largely missing, as repeat unit annotation is manually curated and currently covers only 3 % of the bona fide TR proteins. Repeat Protein Unit Predictor (ReUPred) is a novel method for the fast automatic prediction of repeat units and repeat classification using an extensive Structure Repeat Unit Library (SRUL) derived from RepeatsDB. ReUPred uses an iterative structural search against the SRUL to find repetitive units. On a test set of solenoid proteins, ReUPred is able to correctly detect 92 % of the proteins. Unlike previous methods, it is also able to correctly classify solenoid repeats in 89 % of cases. It also outperforms two recent state-of-the-art methods for the repeat unit identification problem. The accurate prediction of repeat units increases the number of annotated repeat units by an order of magnitude compared to the sequence-based Pfam classification. ReUPred is implemented in Python for Linux and freely available from the URL: http://protein.bio.unipd.it/reupred/ .

摘要

在过去十年中,大量研究已证明串联重复(TR)蛋白在许多生物过程中具有至关重要的作用。众多新的重复结构也已得到解析。最近发布的RepeatsDB提供了有关TR蛋白的信息。然而,由于重复单元注释是人工整理的,目前仅涵盖3%的真正TR蛋白,因此对重复元件的详细结构表征在很大程度上缺失。重复蛋白单元预测器(ReUPred)是一种新颖的方法,它使用从RepeatsDB衍生而来的广泛的结构重复单元库(SRUL),快速自动预测重复单元并进行重复分类。ReUPred对SRUL进行迭代结构搜索以找到重复单元。在一组螺线管蛋白测试集中,ReUPred能够正确检测出92%的蛋白。与先前的方法不同,它在89%的情况下也能够正确分类螺线管重复。在重复单元识别问题上,它也优于最近的两种最先进方法。与基于序列的Pfam分类相比,重复单元的准确预测使注释的重复单元数量增加了一个数量级。ReUPred用Python为Linux实现,可从以下网址免费获取:http://protein.bio.unipd.it/reupred/ 。

相似文献

1
Identification of repetitive units in protein structures with ReUPred.使用ReUPred鉴定蛋白质结构中的重复单元。
Amino Acids. 2016 Jun;48(6):1391-400. doi: 10.1007/s00726-016-2187-2. Epub 2016 Feb 22.
2
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures.RepeatsDB 2.0:改进了重复蛋白结构的注释、分类、搜索和可视化。
Nucleic Acids Res. 2017 Jan 4;45(D1):D308-D312. doi: 10.1093/nar/gkw1136. Epub 2016 Nov 29.
3
RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.RepeatsDB-lite:串联重复蛋白单位注释的网络服务器。
Nucleic Acids Res. 2018 Jul 2;46(W1):W402-W407. doi: 10.1093/nar/gky360.
4
RepeatsDB: a database of tandem repeat protein structures.RepeatsDB:串联重复蛋白结构数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D352-7. doi: 10.1093/nar/gkt1175. Epub 2013 Dec 5.
5
Comparison of protein repeat classifications based on structure and sequence families.基于结构和序列家族的蛋白质重复分类比较。
Biochem Soc Trans. 2015 Oct;43(5):832-7. doi: 10.1042/BST20150079.
6
RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures.2021 年的 RepeatsDB:改进了蛋白质串联重复结构的数据并扩展了分类。
Nucleic Acids Res. 2021 Jan 8;49(D1):D452-D457. doi: 10.1093/nar/gkaa1097.
7
REPETITA: detection and discrimination of the periodicity of protein solenoid repeats by discrete Fourier transform.REPETITA:通过离散傅里叶变换检测和辨别蛋白质螺线管重复序列的周期性
Bioinformatics. 2009 Jun 15;25(12):i289-95. doi: 10.1093/bioinformatics/btp232.
8
Daisy: An integrated repeat protein curation service.黛西:一个综合的重复蛋白注释服务。
J Struct Biol. 2023 Dec;215(4):108033. doi: 10.1016/j.jsb.2023.108033. Epub 2023 Oct 3.
9
Identifying tandem Ankyrin repeats in protein structures.在蛋白质结构中识别串联锚蛋白重复序列。
BMC Bioinformatics. 2014 Dec 30;15(1):6599. doi: 10.1186/s12859-014-0440-9.
10
PRIGSA: protein repeat identification by graph spectral analysis.PRIGSA:通过图谱谱分析进行蛋白质重复序列鉴定
J Bioinform Comput Biol. 2014 Dec;12(6):1442009. doi: 10.1142/S0219720014420098.

引用本文的文献

1
Quantitative analysis of visual codewords of a protein distance matrix.蛋白质距离矩阵的视觉码词的定量分析。
PLoS One. 2022 Feb 4;17(2):e0263566. doi: 10.1371/journal.pone.0263566. eCollection 2022.
2
DbStRiPs: Database of structural repeats in proteins.DbStRiPs:蛋白质结构重复数据库。
Protein Sci. 2022 Jan;31(1):23-36. doi: 10.1002/pro.4052. Epub 2021 Mar 6.
3
RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures.2021 年的 RepeatsDB:改进了蛋白质串联重复结构的数据并扩展了分类。
Nucleic Acids Res. 2021 Jan 8;49(D1):D452-D457. doi: 10.1093/nar/gkaa1097.
4
MemSTATS: A Benchmark Set of Membrane Protein Symmetries and Pseudosymmetries.MemSTATS:一个膜蛋白对称和拟对称基准数据集。
J Mol Biol. 2020 Jan 17;432(2):597-604. doi: 10.1016/j.jmb.2019.09.020. Epub 2019 Oct 16.
5
Analyzing the symmetrical arrangement of structural repeats in proteins with CE-Symm.使用 CE-Symm 分析蛋白质结构重复的对称排列。
PLoS Comput Biol. 2019 Apr 22;15(4):e1006842. doi: 10.1371/journal.pcbi.1006842. eCollection 2019 Apr.
6
Secreted Cysteine-Rich Repeat Proteins "SCREPs": A Novel Multi-Domain Architecture.分泌型富含半胱氨酸重复蛋白(SCREPs):一种新型多结构域架构
Front Pharmacol. 2018 Nov 20;9:1333. doi: 10.3389/fphar.2018.01333. eCollection 2018.
7
RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.RepeatsDB-lite:串联重复蛋白单位注释的网络服务器。
Nucleic Acids Res. 2018 Jul 2;46(W1):W402-W407. doi: 10.1093/nar/gky360.
8
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures.RepeatsDB 2.0:改进了重复蛋白结构的注释、分类、搜索和可视化。
Nucleic Acids Res. 2017 Jan 4;45(D1):D308-D312. doi: 10.1093/nar/gkw1136. Epub 2016 Nov 29.