• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

低复杂度区域对蛋白质结构测定的影响。

Effect of low-complexity regions on protein structure determination.

作者信息

Bannen Ryan M, Bingman Craig A, Phillips George N

机构信息

Department of Biochemistry, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI 53711, USA.

出版信息

J Struct Funct Genomics. 2007 Dec;8(4):217-26. doi: 10.1007/s10969-008-9039-6. Epub 2008 Feb 27.

DOI:10.1007/s10969-008-9039-6
PMID:18302007
Abstract

It has been previously shown that protein sequences containing a quasi-repetitive assortment of amino acids are common in genomes and databases such as Swiss-Prot but are under-represented in the structure-based Protein Data Bank (PDB). Structural genomics groups have been using the absence of these "low-complexity" sequences for several years as a way to select proteins that have a good chance of successful structure determination. In this study, we examine the data deposited in the PDB as well as the available data from structural genomics groups in TargetDB and PepcDB to reveal interesting trends that could be taken into consideration when using low-complexity sequences as part of the target selection process.

摘要

先前的研究表明,含有准重复氨基酸排列的蛋白质序列在基因组和诸如瑞士蛋白质数据库(Swiss-Prot)等数据库中很常见,但在基于结构的蛋白质数据库(PDB)中代表性不足。结构基因组学团队多年来一直利用这些“低复杂性”序列的缺失来选择有很大机会成功确定结构的蛋白质。在本研究中,我们检查了PDB中存入的数据以及TargetDB和PepcDB中结构基因组学团队的现有数据,以揭示在将低复杂性序列用作目标选择过程的一部分时可考虑的有趣趋势。

相似文献

1
Effect of low-complexity regions on protein structure determination.低复杂度区域对蛋白质结构测定的影响。
J Struct Funct Genomics. 2007 Dec;8(4):217-26. doi: 10.1007/s10969-008-9039-6. Epub 2008 Feb 27.
2
TargetDB: a target registration database for structural genomics projects.TargetDB:一个用于结构基因组学项目的靶点登记数据库。
Bioinformatics. 2004 Nov 1;20(16):2860-2. doi: 10.1093/bioinformatics/bth300. Epub 2004 May 6.
3
Intrinsic disorder in the Protein Data Bank.蛋白质数据库中的内在无序状态。
J Biomol Struct Dyn. 2007 Feb;24(4):325-42. doi: 10.1080/07391102.2007.10507123.
4
A graphical approach to tracking and reporting target status in structural genomics.一种用于在结构基因组学中跟踪和报告目标状态的图形化方法。
J Struct Funct Genomics. 2007 Dec;8(4):209-16. doi: 10.1007/s10969-007-9037-0. Epub 2008 Jan 31.
5
SSMap: a new UniProt-PDB mapping resource for the curation of structural-related information in the UniProt/Swiss-Prot Knowledgebase.SSMap:一种用于在UniProt/Swiss-Prot知识库中整理结构相关信息的新型UniProt-PDB映射资源。
BMC Bioinformatics. 2008 Sep 23;9:391. doi: 10.1186/1471-2105-9-391.
6
Identification of putative domain linkers by a neural network - application to a large sequence database.通过神经网络识别假定的结构域连接子——应用于大型序列数据库
BMC Bioinformatics. 2006 Jun 27;7:323. doi: 10.1186/1471-2105-7-323.
7
PSST-2.0: Protein Data Bank Sequence Search Tool.PSST-2.0:蛋白质数据库序列搜索工具。
Appl Bioinformatics. 2005;4(2):141-5. doi: 10.2165/00822942-200504020-00008.
8
Using Dali for structural comparison of proteins.使用Dali进行蛋白质的结构比较。
Curr Protoc Bioinformatics. 2006 Jul;Chapter 5:Unit 5.5. doi: 10.1002/0471250953.bi0505s14.
9
ProtBuD: a database of biological unit structures of protein families and superfamilies.ProtBuD:蛋白质家族和超家族的生物单元结构数据库。
Bioinformatics. 2006 Dec 1;22(23):2876-82. doi: 10.1093/bioinformatics/btl490. Epub 2006 Oct 2.
10
GENIUS II: a high-throughput database system for linking ORFs in complete genomes to known protein three-dimensional structures.GENIUS II:一个用于将完整基因组中的开放阅读框与已知蛋白质三维结构相链接的高通量数据库系统。
Bioinformatics. 2004 Mar 1;20(4):596-8. doi: 10.1093/bioinformatics/btg478. Epub 2004 Jan 29.

引用本文的文献

1
Protein G-quadruplex interactions and their effects on phase transitions and protein aggregation.蛋白 G-四链体相互作用及其对相变和蛋白质聚集的影响。
Nucleic Acids Res. 2024 May 8;52(8):4702-4722. doi: 10.1093/nar/gkae229.
2
Protein G-quadruplex interactions and their effects on phase transitions and protein aggregation.蛋白质G-四链体相互作用及其对相变和蛋白质聚集的影响。
bioRxiv. 2024 Mar 20:2023.09.21.558871. doi: 10.1101/2023.09.21.558871.
3
Glycogen Synthase Kinase-3 Isoform Variants and Their Inhibitory Phosphorylation in Human Testes and Spermatozoa.

本文引用的文献

1
Addressing the intrinsic disorder bottleneck in structural proteomics.解决结构蛋白质组学中的内在无序瓶颈问题。
Proteins. 2005 May 15;59(3):444-53. doi: 10.1002/prot.20446.
2
Protein biophysical properties that correlate with crystallization success in Thermotoga maritima: maximum clustering strategy for structural genomics.与嗜热栖热菌结晶成功相关的蛋白质生物物理特性:结构基因组学的最大聚类策略。
J Mol Biol. 2004 Dec 3;344(4):977-91. doi: 10.1016/j.jmb.2004.09.076.
3
A new algorithm for detecting low-complexity regions in protein sequences.
糖原合酶激酶-3亚型变体及其在人类睾丸和精子中的抑制性磷酸化
World J Mens Health. 2023 Jan;41(1):215-226. doi: 10.5534/wjmh.220108. Epub 2022 Aug 16.
4
Understanding and identifying amino acid repeats.理解和识别氨基酸重复序列。
Brief Bioinform. 2014 Jul;15(4):582-91. doi: 10.1093/bib/bbt003.
5
The first missense mutation of NHS gene in a Tunisian family with clinical features of NHS syndrome including cardiac anomaly.NHS 基因在一个具有 NHS 综合征临床特征(包括心脏异常)的突尼斯家族中的首个错义突变。
Eur J Hum Genet. 2011 Aug;19(8):851-6. doi: 10.1038/ejhg.2011.52. Epub 2011 May 11.
6
Structural biology of TRP channels.TRP 通道的结构生物学。
Adv Exp Med Biol. 2011;704:1-23. doi: 10.1007/978-94-007-0265-3_1.
7
Genome-wide evidence for selection acting on single amino acid repeats.全基因组证据表明选择作用于单个氨基酸重复。
Genome Res. 2010 Jun;20(6):755-60. doi: 10.1101/gr.101246.109. Epub 2010 Jan 7.
8
The Center for Eukaryotic Structural Genomics.真核生物结构基因组学中心
J Struct Funct Genomics. 2009 Apr;10(2):165-79. doi: 10.1007/s10969-008-9057-4. Epub 2009 Jan 8.
一种用于检测蛋白质序列中低复杂性区域的新算法。
Bioinformatics. 2005 Jan 15;21(2):160-70. doi: 10.1093/bioinformatics/bth497. Epub 2004 Aug 27.
4
Mining the structural genomics pipeline: identification of protein properties that affect high-throughput experimental analysis.挖掘结构基因组学流程:识别影响高通量实验分析的蛋白质特性。
J Mol Biol. 2004 Feb 6;336(1):115-30. doi: 10.1016/j.jmb.2003.11.053.
5
Protein disorder prediction: implications for structural proteomics.蛋白质无序预测:对结构蛋白质组学的影响。
Structure. 2003 Nov;11(11):1453-9. doi: 10.1016/j.str.2003.10.002.
6
GlobPlot: Exploring protein sequences for globularity and disorder.GlobPlot:探索蛋白质序列的球状性和无序性。
Nucleic Acids Res. 2003 Jul 1;31(13):3701-8. doi: 10.1093/nar/gkg519.
7
A novel complexity measure for comparative analysis of protein sequences from complete genomes.一种用于完整基因组蛋白质序列比较分析的新型复杂性度量方法。
J Biomol Struct Dyn. 2003 Apr;20(5):657-68. doi: 10.1080/07391102.2003.10506882.
8
Abundance and distributions of eukaryote protein simple sequences.真核生物蛋白质简单序列的丰度和分布
Mol Cell Proteomics. 2002 Dec;1(12):983-95. doi: 10.1074/mcp.m200032-mcp200.
9
Loopy proteins appear conserved in evolution.环状蛋白在进化过程中似乎是保守的。
J Mol Biol. 2002 Sep 6;322(1):53-64. doi: 10.1016/s0022-2836(02)00736-2.
10
Simple sequences are rare in the Protein Data Bank.简单序列在蛋白质数据库中很罕见。
Proteins. 2002 Jul 1;48(1):134-40. doi: 10.1002/prot.10150.