通过迭代TASSER模拟对小蛋白质进行从头建模。

Ab initio modeling of small proteins by iterative TASSER simulations.

作者信息

Wu Sitao, Skolnick Jeffrey, Zhang Yang

机构信息

Center for Bioinformatics and Department of Molecular Bioscience, University of Kansas, Lawrence, KS 66047, USA.

出版信息

BMC Biol. 2007 May 8;5:17. doi: 10.1186/1741-7007-5-17.

DOI:10.1186/1741-7007-5-17

PMID:17488521

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1878469/

Abstract

BACKGROUND

Predicting 3-dimensional protein structures from amino-acid sequences is an important unsolved problem in computational structural biology. The problem becomes relatively easier if close homologous proteins have been solved, as high-resolution models can be built by aligning target sequences to the solved homologous structures. However, for sequences without similar folds in the Protein Data Bank (PDB) library, the models have to be predicted from scratch. Progress in the ab initio structure modeling is slow. The aim of this study was to extend the TASSER (threading/assembly/refinement) method for the ab initio modeling and examine systemically its ability to fold small single-domain proteins.

RESULTS

We developed I-TASSER by iteratively implementing the TASSER method, which is used in the folding test of three benchmarks of small proteins. First, data on 16 small proteins (< 90 residues) were used to generate I-TASSER models, which had an average Calpha-root mean square deviation (RMSD) of 3.8A, with 6 of them having a Calpha-RMSD < 2.5A. The overall result was comparable with the all-atomic ROSETTA simulation, but the central processing unit (CPU) time by I-TASSER was much shorter (150 CPU days vs. 5 CPU hours). Second, data on 20 small proteins (< 120 residues) were used. I-TASSER folded four of them with a Calpha-RMSD < 2.5A. The average Calpha-RMSD of the I-TASSER models was 3.9A, whereas it was 5.9A using TOUCHSTONE-II software. Finally, 20 non-homologous small proteins (< 120 residues) were taken from the PDB library. An average Calpha-RMSD of 3.9A was obtained for the third benchmark, with seven cases having a Calpha-RMSD < 2.5A.

CONCLUSION

Our simulation results show that I-TASSER can consistently predict the correct folds and sometimes high-resolution models for small single-domain proteins. Compared with other ab initio modeling methods such as ROSETTA and TOUCHSTONE II, the average performance of I-TASSER is either much better or is similar within a lower computational time. These data, together with the significant performance of automated I-TASSER server (the Zhang-Server) in the 'free modeling' section of the recent Critical Assessment of Structure Prediction (CASP)7 experiment, demonstrate new progresses in automated ab initio model generation. The I-TASSER server is freely available for academic users http://zhang.bioinformatics.ku.edu/I-TASSER.

摘要

背景

从氨基酸序列预测三维蛋白质结构是计算结构生物学中一个重要的未解决问题。如果已解析出相近的同源蛋白质，那么该问题相对会变得容易一些，因为通过将目标序列与已解析的同源结构进行比对，可以构建高分辨率模型。然而，对于蛋白质数据库（PDB）库中没有相似折叠结构的序列，模型必须从头开始预测。从头开始的结构建模进展缓慢。本研究的目的是扩展TASSER（穿线法/组装/优化）方法用于从头建模，并系统地检验其折叠小单结构域蛋白质的能力。

结果

我们通过迭代实施TASSER方法开发了I-TASSER，该方法用于三个小蛋白质基准测试的折叠测试。首先，使用16个小蛋白质（<90个残基）的数据生成I-TASSER模型，其平均Cα-均方根偏差（RMSD）为3.8Å，其中6个的Cα-RMSD<2.5Å。总体结果与全原子ROSETTA模拟相当，但I-TASSER的中央处理器（CPU）时间要短得多（150个CPU日对5个CPU小时）。其次，使用20个小蛋白质（<120个残基）的数据。I-TASSER将其中4个折叠为Cα-RMSD<2.5Å。I-TASSER模型的平均Cα-RMSD为3.9Å，而使用TOUCHSTONE-II软件时为5.9Å。最后，从PDB库中选取20个非同源小蛋白质（<120个残基）。第三个基准测试获得的平均Cα-RMSD为3.9Å，其中7个案例的Cα-RMSD<2.5Å。

结论

我们的模拟结果表明，I-TASSER能够一致地预测小单结构域蛋白质的正确折叠，有时还能预测出高分辨率模型。与其他从头建模方法如ROSETTA和TOUCHSTONE II相比，I-TASSER的平均性能要么好得多，要么在更短的计算时间内与之相似。这些数据，连同自动化I-TASSER服务器（Zhang-Server）在最近的蛋白质结构预测关键评估（CASP）7实验的“自由建模”部分中的显著表现，证明了在自动化从头模型生成方面取得的新进展。I-TASSER服务器可供学术用户免费使用，网址为http://zhang.bioinformatics.ku.edu/I-TASSER。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e911/1878469/1e433f7ad86f/1741-7007-5-17-1.jpg

相似文献

Ab initio modeling of small proteins by iterative TASSER simulations.

BMC Biol. 2007 May 8;5:17. doi: 10.1186/1741-7007-5-17.

Analysis of TASSER-based CASP7 protein structure prediction results.

Proteins. 2007;69 Suppl 8:90-7. doi: 10.1002/prot.21649.

Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):76-86. doi: 10.1002/prot.24930. Epub 2015 Sep 23.

I-TASSER server for protein 3D structure prediction.

BMC Bioinformatics. 2008 Jan 23;9:40. doi: 10.1186/1471-2105-9-40.

Template-based modeling and free modeling by I-TASSER in CASP7.

Proteins. 2007;69 Suppl 8:108-17. doi: 10.1002/prot.21702.

Template-based protein structure prediction in CASP11 and retrospect of I-TASSER in the last decade.

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):233-46. doi: 10.1002/prot.24918. Epub 2015 Sep 18.

Automated protein structure modeling in CASP9 by I-TASSER pipeline combined with QUARK-based ab initio folding and FG-MD-based structure refinement.

Proteins. 2011;79 Suppl 10(Suppl 10):147-60. doi: 10.1002/prot.23111. Epub 2011 Aug 23.

Benchmarking of TASSER in the ab initio limit.

Proteins. 2007 Jul 1;68(1):48-56. doi: 10.1002/prot.21392.

TASSER-Lite: an automated tool for protein comparative modeling.

Biophys J. 2006 Dec 1;91(11):4180-90. doi: 10.1529/biophysj.106.084293. Epub 2006 Sep 8.

Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10.

Proteins. 2014 Feb;82 Suppl 2(0 2):175-87. doi: 10.1002/prot.24341. Epub 2013 Aug 31.

引用本文的文献

Myonectin and metabolic health: a systematic review.

Front Endocrinol (Lausanne). 2025 Jul 16;16:1557142. doi: 10.3389/fendo.2025.1557142. eCollection 2025.

Chemosensory Receptors in Vertebrates: Structure and Computational Modeling Insights.

Int J Mol Sci. 2025 Jul 10;26(14):6605. doi: 10.3390/ijms26146605.

Highly dynamic and sensitive NEMOer calcium indicators for imaging ER calcium signals in excitable cells.

Nat Commun. 2025 Apr 11;16(1):3472. doi: 10.1038/s41467-025-58705-6.

Repurposing thioridazine as a potential CD2068 inhibitor to mitigate antibiotic resistance in infection.

Comput Struct Biotechnol J. 2025 Mar 1;27:887-895. doi: 10.1016/j.csbj.2025.02.036. eCollection 2025.

Designing of a multiepitope-based vaccine against echinococcosis utilizing the potent Ag5 antigen: Immunoinformatics and simulation approaches.

PLoS One. 2025 Feb 12;20(2):e0310510. doi: 10.1371/journal.pone.0310510. eCollection 2025.

-targeted AI-driven vaccines: a paradigm shift in gastric cancer prevention.

Front Immunol. 2024 Nov 28;15:1500921. doi: 10.3389/fimmu.2024.1500921. eCollection 2024.

Binding structures of SERF1a with NT17-polyQ peptides of huntingtin exon 1 revealed by SEC-SWAXS, NMR and molecular simulation.

IUCrJ. 2024 Sep 1;11(Pt 5):849-858. doi: 10.1107/S2052252524006341.

An mRNA vaccine for pancreatic cancer designed by applying in silico immunoinformatics and reverse vaccinology approaches.

PLoS One. 2024 Jul 8;19(7):e0305413. doi: 10.1371/journal.pone.0305413. eCollection 2024.

How much metagenome data is needed for protein structure prediction: The advantages of targeted approach from the ecological and evolutionary perspectives.

Imeta. 2022 Mar 6;1(1):e9. doi: 10.1002/imt2.9. eCollection 2022 Mar.

Construction of an aerolysin-based multi-epitope vaccine against an machine learning and artificial intelligence-supported approach.

Front Immunol. 2024 Mar 1;15:1369890. doi: 10.3389/fimmu.2024.1369890. eCollection 2024.

本文引用的文献

On the origin and highly likely completeness of single-domain protein structures.

Proc Natl Acad Sci U S A. 2006 Feb 21;103(8):2605-10. doi: 10.1073/pnas.0509379103. Epub 2006 Feb 14.

TASSER: an automated method for the prediction of protein tertiary structures in CASP6.

Proteins. 2005;61 Suppl 7:91-98. doi: 10.1002/prot.20724.

Assessment of predictions submitted for the CASP6 comparative modeling category.

Proteins. 2005;61 Suppl 7:27-45. doi: 10.1002/prot.20720.

Toward high-resolution de novo structure prediction for small proteins.

Science. 2005 Sep 16;309(5742):1868-71. doi: 10.1126/science.1113801.

Prediction of solvent accessibility and sites of deleterious mutations from protein sequence.

Nucleic Acids Res. 2005 Jun 3;33(10):3193-9. doi: 10.1093/nar/gki633. Print 2005.

TM-align: a protein structure alignment algorithm based on the TM-score.

Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.

The protein structure prediction problem could be solved using the current PDB library.

Proc Natl Acad Sci U S A. 2005 Jan 25;102(4):1029-34. doi: 10.1073/pnas.0407152101. Epub 2005 Jan 14.

Ab initio prediction of the three-dimensional structure of a de novo designed protein: a double-blind case study.

Proteins. 2005 Feb 15;58(3):560-70. doi: 10.1002/prot.20338.

Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments.

Proteins. 2005 Feb 1;58(2):321-8. doi: 10.1002/prot.20308.

Scoring function for automated assessment of protein structure template quality.

Proteins. 2004 Dec 1;57(4):702-10. doi: 10.1002/prot.20264.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过迭代TASSER模拟对小蛋白质进行从头建模。

Ab initio modeling of small proteins by iterative TASSER simulations.

作者信息

Wu Sitao, Skolnick Jeffrey, Zhang Yang

机构信息

Center for Bioinformatics and Department of Molecular Bioscience, University of Kansas, Lawrence, KS 66047, USA.