• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

简单序列在蛋白质数据库中很罕见。

Simple sequences are rare in the Protein Data Bank.

作者信息

Huntley Melanie A, Golding G Brian

机构信息

Department of Biology, McMaster University, Hamilton, Ontario, Canada.

出版信息

Proteins. 2002 Jul 1;48(1):134-40. doi: 10.1002/prot.10150.

DOI:10.1002/prot.10150
PMID:12012345
Abstract

A simple sequence is abundant in the proteins that have been sequenced to date. But unusual protein features, such as a simple sequence, are not present in the same high frequency within structural databases. A subset of these simple sequences, a group with a highly repetitive nature has been shown to be abundant in eukaryotes but not in prokaryotes. In this study, an examination of the eukaryotic proteins in the Protein Data Bank (PDB) has revealed a large deficiency of low complexity, highly repetitive protein repeats. Through simulated databases of similar samples of eukaryotic proteins taken from the National Center for Biotechnology Information (NCBI) database, it is shown that the PDB contains a significantly less highly repetitive, simple sequence than artificial databases of similar composition randomly derived from NCBI. When the structural data for those few PDB sequences that did contain a highly repetitive simple sequence is examined in detail, it is found that in most cases the tertiary structure is unknown for the regions consisting of a simple sequence. This lack of a simple sequence both in the PDB database and in the structural information suggests that this type of simple sequence may produce disordered structures that make structural characterization difficult.

摘要

在迄今为止已测序的蛋白质中,简单序列很丰富。但是,诸如简单序列这样的不寻常蛋白质特征在结构数据库中的出现频率并不相同。这些简单序列的一个子集,即具有高度重复性的一组序列,已被证明在真核生物中很丰富,而在原核生物中则不然。在这项研究中,对蛋白质数据库(PDB)中的真核生物蛋白质进行检查后发现,低复杂性、高度重复的蛋白质重复序列存在很大不足。通过从美国国立生物技术信息中心(NCBI)数据库获取的类似真核生物蛋白质样本的模拟数据库表明,PDB中高度重复的简单序列明显少于从NCBI随机导出的类似组成的人工数据库。当详细检查那些确实包含高度重复简单序列的少数PDB序列的结构数据时,发现在大多数情况下,由简单序列组成的区域的三级结构是未知的。PDB数据库和结构信息中都缺乏简单序列,这表明这种类型的简单序列可能会产生无序结构,从而使结构表征变得困难。

相似文献

1
Simple sequences are rare in the Protein Data Bank.简单序列在蛋白质数据库中很罕见。
Proteins. 2002 Jul 1;48(1):134-40. doi: 10.1002/prot.10150.
2
Intrinsic disorder in the Protein Data Bank.蛋白质数据库中的内在无序状态。
J Biomol Struct Dyn. 2007 Feb;24(4):325-42. doi: 10.1080/07391102.2007.10507123.
3
Protein simple sequence conservation.蛋白质简单序列保守性
Proteins. 2004 Mar 1;54(4):629-38. doi: 10.1002/prot.10623.
4
NRL-3D: a sequence-structure database derived from the protein data bank (PDB) and searchable within the PIR environment.NRL - 3D:一个源自蛋白质数据库(PDB)且可在PIR环境中进行搜索的序列结构数据库。
Protein Seq Data Anal. 1990 Oct;3(5):387-405.
5
Comparison of sequence and structure-based datasets for nonredundant structural data mining.用于非冗余结构数据挖掘的基于序列和结构的数据集比较。
Proteins. 2005 Sep 1;60(4):577-83. doi: 10.1002/prot.20505.
6
A comparative view at comprehensive information resources on three-dimensional structures of biological macro-molecules.关于生物大分子三维结构综合信息资源的比较研究。
Brief Funct Genomic Proteomic. 2007 Sep;6(3):220-39. doi: 10.1093/bfgp/elm020. Epub 2007 Oct 23.
7
The ConSurf-HSSP database: the mapping of evolutionary conservation among homologs onto PDB structures.ConSurf-HSSP数据库:同源物间进化保守性在蛋白质数据银行(PDB)结构上的映射。
Proteins. 2005 Feb 15;58(3):610-7. doi: 10.1002/prot.20305.
8
PSSARD (2.0): a database server for making flexible queries relating amino acid sequences to main-chain secondary structure conformations for proteins of known three-dimensional structure and certain useful applications.PSSARD(2.0):一个数据库服务器,用于对已知三维结构的蛋白质的氨基酸序列与主链二级结构构象进行灵活查询以及一些有用的应用。
Int J Biol Macromol. 2007 Jun 1;41(1):109-13. doi: 10.1016/j.ijbiomac.2006.10.006. Epub 2006 Nov 17.
9
NdPASA: a novel pairwise protein sequence alignment algorithm that incorporates neighbor-dependent amino acid propensities.NdPASA:一种整合了邻域依赖氨基酸倾向的新型双序列蛋白质序列比对算法。
Proteins. 2005 Feb 15;58(3):628-37. doi: 10.1002/prot.20359.
10
SSMap: a new UniProt-PDB mapping resource for the curation of structural-related information in the UniProt/Swiss-Prot Knowledgebase.SSMap:一种用于在UniProt/Swiss-Prot知识库中整理结构相关信息的新型UniProt-PDB映射资源。
BMC Bioinformatics. 2008 Sep 23;9:391. doi: 10.1186/1471-2105-9-391.

引用本文的文献

1
Low-complexity regions in fungi display functional groups and are depleted in positively charged amino acids.真菌中的低复杂性区域呈现出功能基团,并且带正电荷的氨基酸含量较低。
NAR Genom Bioinform. 2025 Feb 27;7(1):lqaf014. doi: 10.1093/nargab/lqaf014. eCollection 2025 Mar.
2
Reviewing the Structure-Function Paradigm in Polyglutamine Disorders: A Synergistic Perspective on Theoretical and Experimental Approaches.综述多聚谷氨酰胺疾病的结构-功能范式:理论和实验方法的协同视角。
Int J Mol Sci. 2024 Jun 20;25(12):6789. doi: 10.3390/ijms25126789.
3
Caffeine improves mitochondrial dysfunction in the white matter of neonatal rats with hypoxia-ischemia through deacetylation: a proteomic analysis of lysine acetylation.
咖啡因通过去乙酰化改善缺氧缺血新生大鼠白质中的线粒体功能障碍:赖氨酸乙酰化的蛋白质组学分析
Front Mol Neurosci. 2024 Apr 30;17:1394886. doi: 10.3389/fnmol.2024.1394886. eCollection 2024.
4
Evolution of Transcript Abundance is Influenced by Indels in Protein Low Complexity Regions.转录本丰度的进化受蛋白质低复杂度区域插入缺失的影响。
J Mol Evol. 2024 Apr;92(2):153-168. doi: 10.1007/s00239-024-10158-z. Epub 2024 Mar 14.
5
Pervasive, conserved secondary structure in highly charged protein regions.高度带电蛋白质区域中普遍存在的保守二级结构。
PLoS Comput Biol. 2023 Oct 16;19(10):e1011565. doi: 10.1371/journal.pcbi.1011565. eCollection 2023 Oct.
6
Paradoxes of Cellular SUMOylation Regulation: A Role of Biomolecular Condensates?细胞 SUMOylation 调控的悖论:生物分子凝聚物的作用?
Pharmacol Rev. 2023 Sep;75(5):979-1006. doi: 10.1124/pharmrev.122.000784. Epub 2023 May 3.
7
Low Complexity Regions in Proteins and DNA are Poorly Correlated.蛋白质和 DNA 中的低复杂度区域相关性差。
Mol Biol Evol. 2023 Apr 4;40(4). doi: 10.1093/molbev/msad084.
8
Interaction modules that impart specificity to disordered protein.赋予无序蛋白特异性的相互作用模块。
Trends Biochem Sci. 2023 May;48(5):477-490. doi: 10.1016/j.tibs.2023.01.004. Epub 2023 Feb 6.
9
Functional Tuning of Intrinsically Disordered Regions in Human Proteins by Composition Bias.通过组成偏见对人类蛋白质中的无规卷曲区域进行功能调节。
Biomolecules. 2022 Oct 15;12(10):1486. doi: 10.3390/biom12101486.
10
Lineage-specific protein repeat expansions and contractions reveal malleable regions of immune genes.谱系特异性蛋白重复扩展和收缩揭示了免疫基因的可塑区域。
Genes Immun. 2022 Nov;23(7):218-234. doi: 10.1038/s41435-022-00186-4. Epub 2022 Oct 6.