• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大肠杆菌的从头结构预测:迈向全基因组蛋白质结构建模与折叠分类

Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment.

作者信息

Xu Dong, Zhang Yang

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

Sci Rep. 2013;3:1895. doi: 10.1038/srep01895.

DOI:10.1038/srep01895
PMID:23719418
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3667494/
Abstract

Genome-wide protein structure prediction and structure-based function annotation have been a long-term goal in molecular biology but not yet become possible due to difficulties in modeling distant-homology targets. We developed a hybrid pipeline combining ab initio folding and template-based modeling for genome-wide structure prediction applied to the Escherichia coli genome. The pipeline was tested on 43 known sequences, where QUARK-based ab initio folding simulation generated models with TM-score 17% higher than that by traditional comparative modeling methods. For 495 unknown hard sequences, 72 are predicted to have a correct fold (TM-score > 0.5) and 321 have a substantial portion of structure correctly modeled (TM-score > 0.35). 317 sequences can be reliably assigned to a SCOP fold family based on structural analogy to existing proteins in PDB. The presented results, as a case study of E. coli, represent promising progress towards genome-wide structure modeling and fold family assignment using state-of-the-art ab initio folding algorithms.

摘要

全基因组蛋白质结构预测和基于结构的功能注释一直是分子生物学的长期目标,但由于对远源同源靶点进行建模存在困难,目前尚未实现。我们开发了一种混合流程,将从头折叠和基于模板的建模相结合,用于全基因组结构预测,并应用于大肠杆菌基因组。该流程在43个已知序列上进行了测试,基于QUARK的从头折叠模拟生成的模型,其TM分数比传统比较建模方法高出17%。对于495个未知的难处理序列,预计有72个具有正确的折叠(TM分数>0.5),321个的大部分结构被正确建模(TM分数>0.35)。基于与PDB中现有蛋白质的结构相似性,317个序列可以可靠地归类到一个SCOP折叠家族。作为大肠杆菌的案例研究,所展示的结果代表了在使用最先进的从头折叠算法进行全基因组结构建模和折叠家族归类方面取得的有前景的进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/ba5d71dee411/srep01895-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/cdd04f4f8150/srep01895-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/5ae29b570891/srep01895-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/bd86ee81c438/srep01895-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/929c0fb22f24/srep01895-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/ba5d71dee411/srep01895-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/cdd04f4f8150/srep01895-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/5ae29b570891/srep01895-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/bd86ee81c438/srep01895-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/929c0fb22f24/srep01895-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ae7/3667494/ba5d71dee411/srep01895-f7.jpg

相似文献

1
Ab Initio structure prediction for Escherichia coli: towards genome-wide protein structure modeling and fold assignment.大肠杆菌的从头结构预测:迈向全基因组蛋白质结构建模与折叠分类
Sci Rep. 2013;3:1895. doi: 10.1038/srep01895.
2
Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.在蛋白质结构预测关键评估(CASP11)中用于从头蛋白质结构预测的QUARK和I-TASSER整合
Proteins. 2016 Sep;84 Suppl 1(Suppl 1):76-86. doi: 10.1002/prot.24930. Epub 2015 Sep 23.
3
Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10.I-TASSER与QUARK在蛋白质结构预测关键评估第10轮(CASP10)中用于基于模板和从头开始的蛋白质结构预测的相互作用
Proteins. 2014 Feb;82 Suppl 2(0 2):175-87. doi: 10.1002/prot.24341. Epub 2013 Aug 31.
4
Template-based protein structure prediction in CASP11 and retrospect of I-TASSER in the last decade.CASP11中基于模板的蛋白质结构预测及I-TASSER在过去十年的回顾。
Proteins. 2016 Sep;84 Suppl 1(Suppl 1):233-46. doi: 10.1002/prot.24918. Epub 2015 Sep 18.
5
Automated protein structure modeling in CASP9 by I-TASSER pipeline combined with QUARK-based ab initio folding and FG-MD-based structure refinement.CASP9 中通过 I-TASSER 流水线结合 QUARK 基于从头折叠和 FG-MD 基于结构精修的自动化蛋白质结构建模。
Proteins. 2011;79 Suppl 10(Suppl 10):147-60. doi: 10.1002/prot.23111. Epub 2011 Aug 23.
6
Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions.利用低精度接触图预测改进基于片段的从头蛋白质结构组装。
Nat Commun. 2021 Aug 18;12(1):5011. doi: 10.1038/s41467-021-25316-w.
7
Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.在蛋白质结构预测技术评估第12轮(CASP12)中,基于模板以及I-TASSER和QUARK流程的自由建模,并使用预测的接触图。
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):136-151. doi: 10.1002/prot.25414. Epub 2017 Nov 14.
8
A homology/ab initio hybrid algorithm for sampling near-native protein conformations.一种同源/从头算混合算法,用于采样接近天然蛋白质构象。
J Comput Chem. 2013 Aug 15;34(22):1925-36. doi: 10.1002/jcc.23339. Epub 2013 Jun 3.
9
Fast and accurate Ab Initio Protein structure prediction using deep learning potentials.使用深度学习势能进行快速准确的从头开始蛋白质结构预测。
PLoS Comput Biol. 2022 Sep 16;18(9):e1010539. doi: 10.1371/journal.pcbi.1010539. eCollection 2022 Sep.
10
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.

引用本文的文献

1
Recent Progress of Protein Tertiary Structure Prediction.蛋白质三级结构预测的最新进展。
Molecules. 2024 Feb 13;29(4):832. doi: 10.3390/molecules29040832.
2
Enhancement of novel Endo-polygalacturonase expression in Rhodotorula mucilaginosa PY18: insights from mutagenesis and molecular docking.增强罗伦隐球酵母 PY18 新型内切多聚半乳糖醛酸酶的表达:诱变和分子对接的见解。
Microb Cell Fact. 2023 Dec 8;22(1):252. doi: 10.1186/s12934-023-02253-5.
3
Development of anti-PD-L1 antibody based on structure prediction of AlphaFold2.基于 AlphaFold2 结构预测开发的抗 PD-L1 抗体。

本文引用的文献

1
Toward optimal fragment generations for ab initio protein structure assembly.为从头开始的蛋白质结构组装生成最佳片段。
Proteins. 2013 Feb;81(2):229-39. doi: 10.1002/prot.24179. Epub 2012 Oct 16.
2
Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field.从头开始使用连续结构片段和优化的基于知识的力场进行蛋白质结构组装。
Proteins. 2012 Jul;80(7):1715-35. doi: 10.1002/prot.24065. Epub 2012 Apr 13.
3
Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization.
Front Immunol. 2023 Oct 24;14:1275999. doi: 10.3389/fimmu.2023.1275999. eCollection 2023.
4
Proteome-wide landscape of solubility limits in a bacterial cell.细菌细胞中可溶性限制的蛋白质组全景。
Sci Rep. 2022 Apr 21;12(1):6547. doi: 10.1038/s41598-022-10427-1.
5
Highly accurate protein structure prediction for the human proteome.高精准度的人类蛋白质组蛋白结构预测。
Nature. 2021 Aug;596(7873):590-596. doi: 10.1038/s41586-021-03828-1. Epub 2021 Jul 22.
6
Toward the solution of the protein structure prediction problem.朝着解决蛋白质结构预测问题的方向努力。
J Biol Chem. 2021 Jul;297(1):100870. doi: 10.1016/j.jbc.2021.100870. Epub 2021 Jun 11.
7
Protein structure, amino acid composition and sequence determine proteome vulnerability to oxidation-induced damage.蛋白质结构、氨基酸组成和序列决定了蛋白质组对氧化诱导损伤的易感性。
EMBO J. 2020 Dec 1;39(23):e104523. doi: 10.15252/embj.2020104523. Epub 2020 Oct 19.
8
Selenoprotein N is an endoplasmic reticulum calcium sensor that links luminal calcium levels to a redox activity.硒蛋白 N 是内质网钙传感器,可将腔内腔内钙水平与氧化还原活性联系起来。
Proc Natl Acad Sci U S A. 2020 Sep 1;117(35):21288-21298. doi: 10.1073/pnas.2003847117. Epub 2020 Aug 17.
9
Machine learning applied to enzyme turnover numbers reveals protein structural correlates and improves metabolic models.机器学习在酶周转率中的应用揭示了蛋白质结构相关性,并改进了代谢模型。
Nat Commun. 2018 Dec 7;9(1):5252. doi: 10.1038/s41467-018-07652-6.
10
Evolution of gene knockout strains of E. coli reveal regulatory architectures governed by metabolism.大肠杆菌基因敲除株的进化揭示了受代谢控制的调控结构。
Nat Commun. 2018 Sep 18;9(1):3796. doi: 10.1038/s41467-018-06219-9.
通过两步原子水平能量最小化提高蛋白质模型的物理真实性和结构准确性。
Biophys J. 2011 Nov 16;101(10):2525-34. doi: 10.1016/j.bpj.2011.10.024. Epub 2011 Nov 15.
4
Assessment of template based protein structure predictions in CASP9.评估基于模板的蛋白质结构预测在 CASP9 中的表现。
Proteins. 2011;79 Suppl 10:37-58. doi: 10.1002/prot.23177. Epub 2011 Oct 15.
5
CASP9 assessment of free modeling target predictions.CASP9 对自由建模靶标预测的评估。
Proteins. 2011;79 Suppl 10(Suppl 10):59-73. doi: 10.1002/prot.23181. Epub 2011 Oct 14.
6
I-TASSER: a unified platform for automated protein structure and function prediction.I-TASSER:一个用于自动化蛋白质结构和功能预测的统一平台。
Nat Protoc. 2010 Apr;5(4):725-38. doi: 10.1038/nprot.2010.5. Epub 2010 Mar 25.
7
How significant is a protein structure similarity with TM-score = 0.5?蛋白质结构相似度 TM 值为 0.5 有多大意义?
Bioinformatics. 2010 Apr 1;26(7):889-95. doi: 10.1093/bioinformatics/btq066. Epub 2010 Feb 17.
8
Generating triangulated macromolecular surfaces by Euclidean Distance Transform.通过欧几里得距离变换生成三角化大分子表面。
PLoS One. 2009 Dec 2;4(12):e8140. doi: 10.1371/journal.pone.0008140.
9
Evaluation of template-based models in CASP8 with standard measures.基于模板的模型在 CASP8 中的评估与标准度量。
Proteins. 2009;77 Suppl 9(0 9):18-28. doi: 10.1002/prot.22561.
10
Progress and challenges in protein structure prediction.蛋白质结构预测的进展与挑战
Curr Opin Struct Biol. 2008 Jun;18(3):342-8. doi: 10.1016/j.sbi.2008.02.004. Epub 2008 Apr 22.