• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将深度学习接触图与 I-TASSER 组装模拟相结合来折叠非同源蛋白质。

Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.

These authors contributed equally.

出版信息

Cell Rep Methods. 2021 Jul 26;1(3). doi: 10.1016/j.crmeth.2021.100014. Epub 2021 Jun 21.

DOI:10.1016/j.crmeth.2021.100014
PMID:34355210
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8336924/
Abstract

Structure prediction for proteins lacking homologous templates in the Protein Data Bank (PDB) remains a significant unsolved problem. We developed a protocol, C-I-TASSER, to integrate interresidue contact maps from deep neural-network learning with the cutting-edge I-TASSER fragment assembly simulations. Large-scale benchmark tests showed that C-I-TASSER can fold more than twice the number of non-homologous proteins than the I-TASSER, which does not use contacts. When applied to a folding experiment on 8,266 unsolved Pfam families, C-I-TASSER successfully folded 4,162 domain families, including 504 folds that are not found in the PDB. Furthermore, it created correct folds for 85% of proteins in the SARS-CoV-2 genome, despite the quick mutation rate of the virus and sparse sequence profiles. The results demonstrated the critical importance of coupling whole-genome and metagenome-based evolutionary information with optimal structure assembly simulations for solving the problem of non-homologous protein structure prediction.

摘要

在蛋白质数据库 (PDB) 中缺乏同源模板的蛋白质结构预测仍然是一个未解决的重大问题。我们开发了一种名为 C-I-TASSER 的协议,将来自深度神经网络学习的残基间接触图与最先进的 I-TASSER 片段组装模拟相结合。大规模基准测试表明,C-I-TASSER 可以折叠比不使用接触信息的 I-TASSER 多两倍的非同源蛋白质。当应用于 8266 个未解决的 Pfam 家族的折叠实验时,C-I-TASSER 成功折叠了 4162 个结构域家族,其中包括 504 个在 PDB 中未发现的折叠。此外,它为 SARS-CoV-2 基因组中的 85%的蛋白质创建了正确的折叠,尽管病毒的快速突变率和稀疏的序列特征。这些结果表明,将基于全基因组和宏基因组的进化信息与最佳结构组装模拟相结合对于解决非同源蛋白质结构预测问题至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/15d809e7093f/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/304ba67c2d6d/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/72deb77cd2c9/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/e3d8fdbb3741/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/c54eabf3357e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/d45a2e0b9806/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/392d3985c51e/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/15d809e7093f/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/304ba67c2d6d/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/72deb77cd2c9/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/e3d8fdbb3741/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/c54eabf3357e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/d45a2e0b9806/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/392d3985c51e/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa87/9017242/15d809e7093f/gr6.jpg

相似文献

1
Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations.通过将深度学习接触图与 I-TASSER 组装模拟相结合来折叠非同源蛋白质。
Cell Rep Methods. 2021 Jul 26;1(3). doi: 10.1016/j.crmeth.2021.100014. Epub 2021 Jun 21.
2
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
3
Ab initio modeling of small proteins by iterative TASSER simulations.通过迭代TASSER模拟对小蛋白质进行从头建模。
BMC Biol. 2007 May 8;5:17. doi: 10.1186/1741-7007-5-17.
4
I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction.I-TASSER-MTD:一个基于深度学习的多领域蛋白质结构和功能预测平台。
Nat Protoc. 2022 Oct;17(10):2326-2353. doi: 10.1038/s41596-022-00728-0. Epub 2022 Aug 5.
5
Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.在蛋白质结构预测技术评估第12轮(CASP12)中,基于模板以及I-TASSER和QUARK流程的自由建模,并使用预测的接触图。
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):136-151. doi: 10.1002/prot.25414. Epub 2017 Nov 14.
6
NMR data-driven structure determination using NMR-I-TASSER in the CASD-NMR experiment.在CASD-NMR实验中使用NMR-I-TASSER进行基于核磁共振数据驱动的结构测定。
J Biomol NMR. 2015 Aug;62(4):511-25. doi: 10.1007/s10858-015-9914-y. Epub 2015 Mar 4.
7
Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.在蛋白质结构预测关键评估(CASP11)中用于从头蛋白质结构预测的QUARK和I-TASSER整合
Proteins. 2016 Sep;84 Suppl 1(Suppl 1):76-86. doi: 10.1002/prot.24930. Epub 2015 Sep 23.
8
Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions.利用低精度接触图预测改进基于片段的从头蛋白质结构组装。
Nat Commun. 2021 Aug 18;12(1):5011. doi: 10.1038/s41467-021-25316-w.
9
Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14.使用深度学习距离和氢键约束进行 CASP14 中的蛋白质结构预测。
Proteins. 2021 Dec;89(12):1734-1751. doi: 10.1002/prot.26193. Epub 2021 Aug 7.
10
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

引用本文的文献

1
Genome-wide identification, characterization and evolutionary analysis of the pyrroline-5-carboxylate synthetase (P5CS), succinic semialdehyde dehydrogenase (SSADH), and dehydrin (DHN) genes in Solanum lycopersicum under drought stress.干旱胁迫下番茄中吡咯啉-5-羧酸合成酶(P5CS)、琥珀酸半醛脱氢酶(SSADH)和脱水素(DHN)基因的全基因组鉴定、特征分析及进化分析
BMC Plant Biol. 2025 Aug 9;25(1):1060. doi: 10.1186/s12870-025-07057-w.
2
A cryptic START domain regulates deeply conserved transcription factors.一个隐秘的起始结构域调控深度保守的转录因子。
bioRxiv. 2025 Aug 1:2025.07.29.667167. doi: 10.1101/2025.07.29.667167.
3

本文引用的文献

1
Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.通过深度残差卷积网络从一组共进化矩阵中推导高精度蛋白质接触图。
PLoS Comput Biol. 2021 Mar 26;17(3):e1008865. doi: 10.1371/journal.pcbi.1008865. eCollection 2021 Mar.
2
Virtual Screening of Human Class-A GPCRs Using Ligand Profiles Built on Multiple Ligand-Receptor Interactions.基于多种配体-受体相互作用构建配体特征对人 A 类 GPCR 的虚拟筛选
J Mol Biol. 2020 Aug 7;432(17):4872-4890. doi: 10.1016/j.jmb.2020.07.003. Epub 2020 Jul 9.
3
FASPR: an open-source tool for fast and accurate protein side-chain packing.
Phenotypic dichotomy in Crotalus durissus ruruima venom and potential consequences for clinical management of snakebite envenomations.
杜氏响尾蛇鲁里马亚种毒液的表型二分法及其对蛇咬伤中毒临床管理的潜在影响。
PLoS Negl Trop Dis. 2025 Aug 1;19(8):e0013296. doi: 10.1371/journal.pntd.0013296. eCollection 2025 Aug.
4
Chemosensory Receptors in Vertebrates: Structure and Computational Modeling Insights.脊椎动物的化学感受器:结构与计算建模见解
Int J Mol Sci. 2025 Jul 10;26(14):6605. doi: 10.3390/ijms26146605.
5
Search continues: Exploring immunoinformatics platforms for designing an effective multiepitope malaria vaccine candidate.搜索仍在继续:探索用于设计有效多表位疟疾疫苗候选物的免疫信息学平台。
BioTechnologia (Pozn). 2025 Jun 30;106(2):151-168. doi: 10.5114/bta/204528. eCollection 2025.
6
Impact of SARS-CoV-2 Variant NSP6 on Pathogenicity: Genetic Analysis and Cell Biology.严重急性呼吸综合征冠状病毒2变体NSP6对致病性的影响:遗传分析与细胞生物学
Curr Issues Mol Biol. 2025 May 14;47(5):361. doi: 10.3390/cimb47050361.
7
Bioinformatic Analysis of WNT Family Proteins.WNT家族蛋白的生物信息学分析
Bioinform Biol Insights. 2025 Jul 15;19:11779322251353347. doi: 10.1177/11779322251353347. eCollection 2025.
8
The Structural Basis of Binding Stability and Selectivity of Sarolaner Enantiomers for RDL Receptors.沙罗芦那胺对映体与RDL受体结合稳定性和选择性的结构基础
Molecules. 2025 Jun 26;30(13):2756. doi: 10.3390/molecules30132756.
9
Artificial Intelligence (AI)-Based Protein Structure Prediction and Analysis.基于人工智能(AI)的蛋白质结构预测与分析
Methods Mol Biol. 2025;2952:39-57. doi: 10.1007/978-1-0716-4690-8_3.
10
Predicted conformations of 5-HT3 receptor ion channels are modified by subunit D.5-羟色胺3型受体离子通道的预测构象被D亚基修饰。
Comput Struct Biotechnol J. 2025 May 29;27:2394-2402. doi: 10.1016/j.csbj.2025.05.048. eCollection 2025.
FASPR:一种用于快速准确的蛋白质侧链包装的开源工具。
Bioinformatics. 2020 Jun 1;36(12):3758-3765. doi: 10.1093/bioinformatics/btaa234.
4
Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。
Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.
5
Improved protein structure prediction using predicted interresidue orientations.利用预测的残基间取向改进蛋白质结构预测。
Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1496-1503. doi: 10.1073/pnas.1914677117. Epub 2020 Jan 2.
6
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.DeepMSA:构建深度多重序列比对以改进远距离同源蛋白质的接触预测和折叠识别。
Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863.
7
Detecting distant-homology protein structures by aligning deep neural-network based contact maps.通过对齐基于深度神经网络的接触图来检测远程同源蛋白结构。
PLoS Comput Biol. 2019 Oct 17;15(10):e1007411. doi: 10.1371/journal.pcbi.1007411. eCollection 2019 Oct.
8
Critical assessment of methods of protein structure prediction (CASP)-Round XIII.蛋白质结构预测方法的关键评估(CASP)-第十三轮。
Proteins. 2019 Dec;87(12):1011-1020. doi: 10.1002/prot.25823. Epub 2019 Oct 23.
9
Assessing the accuracy of contact predictions in CASP13.评估 CASP13 中接触预测的准确性。
Proteins. 2019 Dec;87(12):1058-1068. doi: 10.1002/prot.25819. Epub 2019 Oct 24.
10
HH-suite3 for fast remote homology detection and deep protein annotation.HH-suite3 用于快速远程同源检测和深度蛋白质注释。
BMC Bioinformatics. 2019 Sep 14;20(1):473. doi: 10.1186/s12859-019-3019-7.