• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质折叠模式背后的通用建筑概念。

Universal Architectural Concepts Underlying Protein Folding Patterns.

作者信息

Konagurthu Arun S, Subramanian Ramanan, Allison Lloyd, Abramson David, Stuckey Peter J, Garcia de la Banda Maria, Lesk Arthur M

机构信息

Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Monash University, Clayton, VIC, Australia.

Research Computing Center, University of Queensland, Brisbane, QLD, Australia.

出版信息

Front Mol Biosci. 2021 Apr 30;7:612920. doi: 10.3389/fmolb.2020.612920. eCollection 2020.

DOI:10.3389/fmolb.2020.612920
PMID:33996891
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8120156/
Abstract

What is the architectural "basis set" of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures-called -typically at a subdomain level, based on an unbiased subset of known protein structures. Each represents a topologically conserved assembly of helices and strands that make contact. Any protein structure can be dissected into instances of concepts from this dictionary. We dissected the Protein Data Bank and completely inventoried all the concept instances. This yields many insights, including correlations between concepts and catalytic activities or binding sites, useful for rational drug design; local amino-acid sequence-structure correlations, useful for structure prediction methods; and information supporting the recognition and exploration of evolutionary relationships, useful for structural studies. An interactive site, Proçodic, at http://lcb.infotech.monash.edu.au/prosodic (click), provides access to and navigation of the entire dictionary of concepts and their usages, and all associated information. This report is part of a continuing programme with the goal of elucidating fundamental principles of protein architecture, in the spirit of the work of Cyrus Chothia.

摘要

蛋白质结构观测宇宙的架构“基集”是什么?运用信息论推理,我们基于已知蛋白质结构的无偏子集,用一个包含1493个亚结构(通常在亚结构域水平)的字典回答了这个问题。每个亚结构代表相互接触的螺旋和链的拓扑保守组合。任何蛋白质结构都可以分解为由这个字典中的概念实例构成。我们剖析了蛋白质数据库并完整编目了所有概念实例。这带来了许多见解,包括概念与催化活性或结合位点之间的相关性,这对合理药物设计有用;局部氨基酸序列与结构的相关性,这对结构预测方法有用;以及支持识别和探索进化关系的信息,这对结构研究有用。一个交互式网站Proçodic,网址为http://lcb.infotech.monash.edu.au/prosodic (点击进入),提供对整个概念字典及其用法以及所有相关信息的访问和导航。本报告是一个持续项目的一部分,其目标是以赛勒斯·乔西亚工作的精神阐明蛋白质架构的基本原理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/071a1fe8e7bc/fmolb-07-612920-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/62e6e2daf7ab/fmolb-07-612920-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/82b4b20232a4/fmolb-07-612920-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/34d27e47122f/fmolb-07-612920-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/e6b43b1654e2/fmolb-07-612920-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/e14d0ea43fa9/fmolb-07-612920-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/27f4931a08d9/fmolb-07-612920-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/cec95a17c28b/fmolb-07-612920-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/071a1fe8e7bc/fmolb-07-612920-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/62e6e2daf7ab/fmolb-07-612920-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/82b4b20232a4/fmolb-07-612920-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/34d27e47122f/fmolb-07-612920-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/e6b43b1654e2/fmolb-07-612920-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/e14d0ea43fa9/fmolb-07-612920-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/27f4931a08d9/fmolb-07-612920-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/cec95a17c28b/fmolb-07-612920-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a623/8120156/071a1fe8e7bc/fmolb-07-612920-g008.jpg

相似文献

1
Universal Architectural Concepts Underlying Protein Folding Patterns.蛋白质折叠模式背后的通用建筑概念。
Front Mol Biosci. 2021 Apr 30;7:612920. doi: 10.3389/fmolb.2020.612920. eCollection 2020.
2
Information-Theoretic Inference of an Optimal Dictionary of Protein Supersecondary Structures.蛋白质超二级结构最优字典的信息论推断
Methods Mol Biol. 2019;1958:123-131. doi: 10.1007/978-1-4939-9161-7_6.
3
Statistical inference of protein structural alignments using information and compression.利用信息与压缩技术对蛋白质结构比对进行统计推断
Bioinformatics. 2017 Apr 1;33(7):1005-1013. doi: 10.1093/bioinformatics/btw757.
4
A new statistical framework to assess structural alignment quality using information compression.一种使用信息压缩来评估结构比对质量的新统计框架。
Bioinformatics. 2014 Sep 1;30(17):i512-8. doi: 10.1093/bioinformatics/btu460.
5
The molecular interactions that stabilize RNA tertiary structure: RNA motifs, patterns, and networks.稳定 RNA 三级结构的分子相互作用:RNA 基序、模式和网络。
Acc Chem Res. 2011 Dec 20;44(12):1302-11. doi: 10.1021/ar200098t. Epub 2011 Sep 7.
6
Cataloging topologies of protein folding patterns.蛋白质折叠模式的拓扑结构编目。
J Mol Recognit. 2010 Mar-Apr;23(2):253-7. doi: 10.1002/jmr.1006.
7
Contact patterns between helices and strands of sheet define protein folding patterns.螺旋与片层中的链之间的接触模式决定了蛋白质的折叠模式。
Proteins. 2007 Mar 1;66(4):869-76. doi: 10.1002/prot.21241.
8
Conserved key amino acid positions (CKAAPs) derived from the analysis of common substructures in proteins.通过对蛋白质中常见亚结构的分析得出的保守关键氨基酸位置(CKAAPs)。
Proteins. 2001 Feb 1;42(2):148-63. doi: 10.1002/1097-0134(20010201)42:2<148::aid-prot20>3.0.co;2-r.
9
Statistical compression of protein sequences and inference of marginal probability landscapes over competing alignments using finite state models and Dirichlet priors.使用有限状态模型和狄利克雷先验概率对蛋白质序列进行统计压缩,并对竞争比对进行边缘概率景观推断。
Bioinformatics. 2019 Jul 15;35(14):i360-i369. doi: 10.1093/bioinformatics/btz368.
10
Interactive Universal Design Kiosks: Explanations About Social Inclusion Features in Architectural Design.交互式通用设计信息亭:关于建筑设计中社会包容特征的阐释
Stud Health Technol Inform. 2016;229:26-8.

引用本文的文献

1
Protein Fold Usages in Ribosomes: Another Glance to the Past.核糖体中蛋白质折叠的用途:另一种回顾过去的方式。
Int J Mol Sci. 2024 Aug 13;25(16):8806. doi: 10.3390/ijms25168806.
2
How much metagenome data is needed for protein structure prediction: The advantages of targeted approach from the ecological and evolutionary perspectives.蛋白质结构预测需要多少宏基因组数据:从生态和进化角度看靶向方法的优势
Imeta. 2022 Mar 6;1(1):e9. doi: 10.1002/imt2.9. eCollection 2022 Mar.
3
Multilevel evolution shapes the function of NB-LRR encoding genes in plant innate immunity.

本文引用的文献

1
Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。
Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.
2
I-TASSER gateway: A protein structure and function prediction server powered by XSEDE.I-TASSER网关:一个由XSEDE提供支持的蛋白质结构与功能预测服务器。
Future Gener Comput Syst. 2019 Oct;99:73-85. doi: 10.1016/j.future.2019.04.011. Epub 2019 Apr 9.
3
SWISS-MODEL: homology modelling of protein structures and complexes.SWISS-MODEL:蛋白质结构和复合物的同源建模。
多级进化塑造了植物先天免疫中NB-LRR编码基因的功能。
Front Plant Sci. 2022 Oct 27;13:1007288. doi: 10.3389/fpls.2022.1007288. eCollection 2022.
4
Alt a 1 Promotes Allergic Asthma Through TLR4-Alveolar Macrophages.Alt a 1 通过 TLR4-肺泡巨噬细胞促进变应性哮喘。
Front Immunol. 2022 Jun 30;13:877383. doi: 10.3389/fimmu.2022.877383. eCollection 2022.
5
Missense Mutations Modify the Conformational Ensemble of the -Synuclein Monomer Which Exhibits a Two-Phase Characteristic.错义突变改变了具有两相特征的α-突触核蛋白单体的构象集。
Front Mol Biosci. 2021 Nov 29;8:786123. doi: 10.3389/fmolb.2021.786123. eCollection 2021.
6
Switching an active site helix in dihydrofolate reductase reveals limits to subdomain modularity.在二氢叶酸还原酶中切换活性位点螺旋揭示了亚结构域模块性的限制。
Biophys J. 2021 Nov 2;120(21):4738-4750. doi: 10.1016/j.bpj.2021.09.032. Epub 2021 Sep 25.
Nucleic Acids Res. 2018 Jul 2;46(W1):W296-W303. doi: 10.1093/nar/gky427.
4
Diversity and motif conservation in protein 3D structural landscape: exploration by a new multivariate simulation method.蛋白质三维结构景观中的多样性与基序保守性:一种新的多变量模拟方法的探索
J Mol Model. 2018 Mar 2;24(4):76. doi: 10.1007/s00894-018-3614-y.
5
Combining co-evolution and secondary structure prediction to improve fragment library generation.结合共进化和二级结构预测来改进片段库生成。
Bioinformatics. 2018 Jul 1;34(13):2219-2227. doi: 10.1093/bioinformatics/bty084.
6
Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths.在对不同长度的重复使用的蛋白质片段的分析中揭示了复杂的进化足迹。
Proc Natl Acad Sci U S A. 2017 Oct 31;114(44):11703-11708. doi: 10.1073/pnas.1707642114. Epub 2017 Oct 19.
7
Statistical inference of protein structural alignments using information and compression.利用信息与压缩技术对蛋白质结构比对进行统计推断
Bioinformatics. 2017 Apr 1;33(7):1005-1013. doi: 10.1093/bioinformatics/btw757.
8
SCOPe: Manual Curation and Artifact Removal in the Structural Classification of Proteins - extended Database.SCOPe:蛋白质结构分类中的人工整理与伪迹去除——扩展数据库
J Mol Biol. 2017 Feb 3;429(3):348-355. doi: 10.1016/j.jmb.2016.11.023. Epub 2016 Nov 30.
9
ECOD: new developments in the evolutionary classification of domains.ECOD:结构域进化分类的新进展
Nucleic Acids Res. 2017 Jan 4;45(D1):D296-D302. doi: 10.1093/nar/gkw1137. Epub 2016 Nov 29.
10
Tertiary alphabet for the observable protein structural universe.可观测蛋白质结构宇宙的三级字母表。
Proc Natl Acad Sci U S A. 2016 Nov 22;113(47):E7438-E7447. doi: 10.1073/pnas.1607178113. Epub 2016 Nov 3.