• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RepeatsDB:串联重复蛋白结构数据库。

RepeatsDB: a database of tandem repeat protein structures.

机构信息

Department of Biomedical Sciences, University of Padua, 35131 Padova, Italy, Department of Biological Chemistry, Universidad de Buenos Aires, Buenos Aires C1428EGA, Argentina, Department of Information Engineering, University of Padua, 35121 Padova, Italy, Department of Biosciences, COMSATS Institute of Information Technology, Sahiwal, Pakistan, Centre de Recherches de Biochimie Macromoléculaire, CNRS, 34293 Montpellier Cedex 5, France and Institut de Biologie Computationnelle, 34293 Montpellier Cedex 5, France.

出版信息

Nucleic Acids Res. 2014 Jan;42(Database issue):D352-7. doi: 10.1093/nar/gkt1175. Epub 2013 Dec 5.

DOI:10.1093/nar/gkt1175
PMID:24311564
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3964956/
Abstract

RepeatsDB (http://repeatsdb.bio.unipd.it/) is a database of annotated tandem repeat protein structures. Tandem repeats pose a difficult problem for the analysis of protein structures, as the underlying sequence can be highly degenerate. Several repeat types haven been studied over the years, but their annotation was done in a case-by-case basis, thus making large-scale analysis difficult. We developed RepeatsDB to fill this gap. Using state-of-the-art repeat detection methods and manual curation, we systematically annotated the Protein Data Bank, predicting 10,745 repeat structures. In all, 2797 structures were classified according to a recently proposed classification schema, which was expanded to accommodate new findings. In addition, detailed annotations were performed in a subset of 321 proteins. These annotations feature information on start and end positions for the repeat regions and units. RepeatsDB is an ongoing effort to systematically classify and annotate structural protein repeats in a consistent way. It provides users with the possibility to access and download high-quality datasets either interactively or programmatically through web services.

摘要

RepeatsDB(http://repeatsdb.bio.unipd.it/)是一个注释串联重复蛋白结构的数据库。串联重复是蛋白结构分析的一个难题,因为其基础序列可能高度退化。多年来已经研究了几种重复类型,但它们的注释是逐个进行的,因此难以进行大规模分析。我们开发了 RepeatsDB 来填补这一空白。使用最先进的重复检测方法和人工策展,我们系统地注释了蛋白质数据库,预测了 10745 个重复结构。总共,根据最近提出的分类方案,将 2797 个结构进行了分类,该方案扩展以适应新的发现。此外,还对 321 个蛋白质的子集进行了详细注释。这些注释提供了有关重复区域和单位起始和结束位置的信息。RepeatsDB 是一项持续的工作,旨在以一致的方式系统地分类和注释结构蛋白重复。它为用户提供了通过 Web 服务以交互或编程方式访问和下载高质量数据集的可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499d/3964956/531c5db42221/gkt1175f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499d/3964956/531c5db42221/gkt1175f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499d/3964956/531c5db42221/gkt1175f1p.jpg

相似文献

1
RepeatsDB: a database of tandem repeat protein structures.RepeatsDB:串联重复蛋白结构数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D352-7. doi: 10.1093/nar/gkt1175. Epub 2013 Dec 5.
2
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures.RepeatsDB 2.0:改进了重复蛋白结构的注释、分类、搜索和可视化。
Nucleic Acids Res. 2017 Jan 4;45(D1):D308-D312. doi: 10.1093/nar/gkw1136. Epub 2016 Nov 29.
3
RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.RepeatsDB-lite:串联重复蛋白单位注释的网络服务器。
Nucleic Acids Res. 2018 Jul 2;46(W1):W402-W407. doi: 10.1093/nar/gky360.
4
RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures.2021 年的 RepeatsDB:改进了蛋白质串联重复结构的数据并扩展了分类。
Nucleic Acids Res. 2021 Jan 8;49(D1):D452-D457. doi: 10.1093/nar/gkaa1097.
5
Identification of repetitive units in protein structures with ReUPred.使用ReUPred鉴定蛋白质结构中的重复单元。
Amino Acids. 2016 Jun;48(6):1391-400. doi: 10.1007/s00726-016-2187-2. Epub 2016 Feb 22.
6
DbStRiPs: Database of structural repeats in proteins.DbStRiPs:蛋白质结构重复数据库。
Protein Sci. 2022 Jan;31(1):23-36. doi: 10.1002/pro.4052. Epub 2021 Mar 6.
7
Comparison of protein repeat classifications based on structure and sequence families.基于结构和序列家族的蛋白质重复分类比较。
Biochem Soc Trans. 2015 Oct;43(5):832-7. doi: 10.1042/BST20150079.
8
Classification of β-hairpin repeat proteins.β-发夹重复蛋白的分类。
J Struct Biol. 2018 Feb;201(2):130-138. doi: 10.1016/j.jsb.2017.10.001. Epub 2017 Oct 7.
9
The Pfam protein families database in 2019.2019 年 Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432. doi: 10.1093/nar/gky995.
10
HRaP: database of occurrence of HomoRepeats and patterns in proteomes.HRaP:同源重复和蛋白质组模式出现数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D273-8. doi: 10.1093/nar/gkt927. Epub 2013 Oct 22.

引用本文的文献

1
SOLeNNoID: a deep learning pipeline for solenoid residue detection in protein structures.SOLeNNoID:一种用于蛋白质结构中螺线管残基检测的深度学习管道。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf415.
2
AlphaFold 2, but not AlphaFold 3, predicts confident but unrealistic β-solenoid structures for repeat proteins.AlphaFold 2能为重复蛋白预测出可信但不切实际的β-螺旋结构,而AlphaFold 3则不能。
Comput Struct Biotechnol J. 2025 Jan 22;27:467-477. doi: 10.1016/j.csbj.2025.01.016. eCollection 2025.
3
STRPsearch: fast detection of structured tandem repeat proteins.

本文引用的文献

1
Tandem-repeat proteins: regularity plus modularity equals design-ability.串联重复蛋白:规则性加上模块性等于可设计性。
Curr Opin Struct Biol. 2013 Aug;23(4):622-31. doi: 10.1016/j.sbi.2013.06.011. Epub 2013 Jul 4.
2
Detecting repetitions and periodicities in proteins by tiling the structural space.通过平铺结构空间来检测蛋白质中的重复和周期性。
J Phys Chem B. 2013 Oct 24;117(42):12887-97. doi: 10.1021/jp402105j. Epub 2013 Jul 5.
3
The challenge of increasing Pfam coverage of the human proteome.提高 Pfam 对人类蛋白质组覆盖范围的挑战。
STRPsearch:结构化串联重复蛋白的快速检测
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae690.
4
Microsatellites explorer: A database of short tandem repeats across genomes.微卫星探索者:一个跨基因组短串联重复序列数据库。
Comput Struct Biotechnol J. 2024 Oct 26;23:3817-3826. doi: 10.1016/j.csbj.2024.10.041. eCollection 2024 Dec.
5
RepeatsDB in 2025: expanding annotations of structured tandem repeats proteins on AlphaFoldDB.2025年的重复序列数据库:在AlphaFoldDB上扩展结构化串联重复序列蛋白的注释
Nucleic Acids Res. 2025 Jan 6;53(D1):D575-D581. doi: 10.1093/nar/gkae965.
6
The SKMT Algorithm: A method for assessing and comparing underlying protein entanglement.SKMT 算法:一种评估和比较潜在蛋白质纠缠的方法。
PLoS Comput Biol. 2023 Nov 27;19(11):e1011248. doi: 10.1371/journal.pcbi.1011248. eCollection 2023 Nov.
7
"Protein" no longer means what it used to.“蛋白质”已不再是过去的含义了。
Curr Res Struct Biol. 2021 Jul 7;3:146-152. doi: 10.1016/j.crstbi.2021.06.002. eCollection 2021.
8
Assessing the low complexity of protein sequences via the low complexity triangle.通过低复杂度三角形评估蛋白质序列的低复杂度。
PLoS One. 2020 Dec 30;15(12):e0239154. doi: 10.1371/journal.pone.0239154. eCollection 2020.
9
RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures.2021 年的 RepeatsDB:改进了蛋白质串联重复结构的数据并扩展了分类。
Nucleic Acids Res. 2021 Jan 8;49(D1):D452-D457. doi: 10.1093/nar/gkaa1097.
10
A New Census of Protein Tandem Repeats and Their Relationship with Intrinsic Disorder.蛋白质串联重复及其与固有无序性的关系的新普查。
Genes (Basel). 2020 Apr 9;11(4):407. doi: 10.3390/genes11040407.
Database (Oxford). 2013 Apr 19;2013:bat023. doi: 10.1093/database/bat023. Print 2013.
4
BioJS: an open source JavaScript framework for biological data visualization.BioJS:用于生物数据可视化的开源 JavaScript 框架。
Bioinformatics. 2013 Apr 15;29(8):1103-4. doi: 10.1093/bioinformatics/btt100. Epub 2013 Feb 23.
5
New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures.CATH 中的新功能家族(FunFams),以改进将保守功能位点映射到 3D 结构的工作。
Nucleic Acids Res. 2013 Jan;41(Database issue):D490-8. doi: 10.1093/nar/gks1211. Epub 2012 Nov 29.
6
The RCSB Protein Data Bank: new resources for research and education.RCSB 蛋白质数据库:研究和教育的新资源。
Nucleic Acids Res. 2013 Jan;41(Database issue):D475-82. doi: 10.1093/nar/gks1200. Epub 2012 Nov 27.
7
The future of the Protein Data Bank.蛋白质数据库的未来。
Biopolymers. 2013 Mar;99(3):218-22. doi: 10.1002/bip.22132. Epub 2012 Sep 29.
8
RAPHAEL: recognition, periodicity and insertion assignment of solenoid protein structures.RAPHAEL:螺线管蛋白结构的识别、周期性和插入分配。
Bioinformatics. 2012 Dec 15;28(24):3257-64. doi: 10.1093/bioinformatics/bts550. Epub 2012 Sep 8.
9
Repeat or not repeat?--Statistical validation of tandem repeat prediction in genomic sequences.重复还是不重复?——基因组序列中串联重复预测的统计验证。
Nucleic Acids Res. 2012 Nov 1;40(20):10005-17. doi: 10.1093/nar/gks726. Epub 2012 Aug 25.
10
MobiDB: a comprehensive database of intrinsic protein disorder annotations.MobiDB:一个全面的内在蛋白无序注释数据库。
Bioinformatics. 2012 Aug 1;28(15):2080-1. doi: 10.1093/bioinformatics/bts327. Epub 2012 Jun 1.