• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用二级结构、更高距离阈值下的接触以及非接触改进蛋白质结构重建。

Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts.

作者信息

Adhikari Badri, Cheng Jianlin

机构信息

Department of Mathematics and Computer Science, University of Missouri-St.Louis, St. Louis, MO, 63121, USA.

Department of Electrical Engineering & Computer Science, Informatics Institute, University of Missouri, Columbia, MO, 65211, USA.

出版信息

BMC Bioinformatics. 2017 Aug 29;18(1):380. doi: 10.1186/s12859-017-1807-5.

DOI:10.1186/s12859-017-1807-5
PMID:28851269
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5576353/
Abstract

BACKGROUND

Residue-residue contacts are key features for accurate de novo protein structure prediction. For the optimal utilization of these predicted contacts in folding proteins accurately, it is important to study the challenges of reconstructing protein structures using true contacts. Because contact-guided protein modeling approach is valuable for predicting the folds of proteins that do not have structural templates, it is necessary for reconstruction studies to focus on hard-to-predict protein structures.

RESULTS

Using a data set consisting of 496 structural domains released in recent CASP experiments and a dataset of 150 representative protein structures, in this work, we discuss three techniques to improve the reconstruction accuracy using true contacts - adding secondary structures, increasing contact distance thresholds, and adding non-contacts. We find that reconstruction using secondary structures and contacts can deliver accuracy higher than using full contact maps. Similarly, we demonstrate that non-contacts can improve reconstruction accuracy not only when the used non-contacts are true but also when they are predicted. On the dataset consisting of 150 proteins, we find that by simply using low ranked predicted contacts as non-contacts and adding them as additional restraints, can increase the reconstruction accuracy by 5% when the reconstructed models are evaluated using TM-score.

CONCLUSIONS

Our findings suggest that secondary structures are invaluable companions of contacts for accurate reconstruction. Confirming some earlier findings, we also find that larger distance thresholds are useful for folding many protein structures which cannot be folded using the standard definition of contacts. Our findings also suggest that for more accurate reconstruction using predicted contacts it is useful to predict contacts at higher distance thresholds (beyond 8 Å) and predict non-contacts.

摘要

背景

残基-残基接触是准确进行蛋白质从头结构预测的关键特征。为了在准确折叠蛋白质时最佳利用这些预测接触,研究使用真实接触重建蛋白质结构所面临的挑战很重要。由于接触引导的蛋白质建模方法对于预测没有结构模板的蛋白质折叠很有价值,因此重建研究有必要聚焦于难以预测的蛋白质结构。

结果

在这项工作中,我们使用由最近的蛋白质结构预测技术关键评估(CASP)实验中发布的496个结构域组成的数据集以及150个代表性蛋白质结构的数据集,讨论了三种使用真实接触提高重建准确性的技术——添加二级结构、增加接触距离阈值以及添加非接触。我们发现使用二级结构和接触进行重建可以提供比使用完整接触图更高的准确性。同样,我们证明非接触不仅在使用的非接触是真实的情况下,而且在它们是预测的情况下都可以提高重建准确性。在由150种蛋白质组成的数据集上,我们发现通过简单地将低排名的预测接触用作非接触并将它们作为额外的约束添加,当使用TM分数评估重建模型时,可以将重建准确性提高5%。

结论

我们的研究结果表明,二级结构是准确重建中接触的宝贵辅助。证实了一些早期的发现,我们还发现更大的距离阈值对于折叠许多使用标准接触定义无法折叠的蛋白质结构很有用。我们的研究结果还表明,为了使用预测接触进行更准确的重建,在更高的距离阈值(超过8埃)预测接触并预测非接触是有用的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/03440eccef72/12859_2017_1807_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/d8f981c75e11/12859_2017_1807_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/5c820aa59355/12859_2017_1807_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/4777fdd828e8/12859_2017_1807_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/0126cdc13a8a/12859_2017_1807_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/ecdefba3f7fb/12859_2017_1807_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/4df46cda382a/12859_2017_1807_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/6aa4dd37a6b8/12859_2017_1807_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/c63eeddbcf02/12859_2017_1807_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/b10d67bd9e94/12859_2017_1807_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/03440eccef72/12859_2017_1807_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/d8f981c75e11/12859_2017_1807_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/5c820aa59355/12859_2017_1807_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/4777fdd828e8/12859_2017_1807_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/0126cdc13a8a/12859_2017_1807_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/ecdefba3f7fb/12859_2017_1807_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/4df46cda382a/12859_2017_1807_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/6aa4dd37a6b8/12859_2017_1807_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/c63eeddbcf02/12859_2017_1807_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/b10d67bd9e94/12859_2017_1807_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81c1/5576353/03440eccef72/12859_2017_1807_Fig10_HTML.jpg

相似文献

1
Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts.利用二级结构、更高距离阈值下的接触以及非接触改进蛋白质结构重建。
BMC Bioinformatics. 2017 Aug 29;18(1):380. doi: 10.1186/s12859-017-1807-5.
2
CONFOLD: Residue-residue contact-guided ab initio protein folding.CONFOLD:基于残基-残基接触引导的从头算蛋白质折叠。
Proteins. 2015 Aug;83(8):1436-49. doi: 10.1002/prot.24829. Epub 2015 Jun 6.
3
Distance-based reconstruction of protein quaternary structures from inter-chain contacts.基于链间接触的蛋白质四级结构的距离重建。
Proteins. 2022 Mar;90(3):720-731. doi: 10.1002/prot.26269. Epub 2021 Nov 2.
4
Hybridized distance- and contact-based hierarchical structure modeling for folding soluble and membrane proteins.用于可溶性和膜蛋白折叠的杂交距离和接触基层级结构建模。
PLoS Comput Biol. 2021 Feb 23;17(2):e1008753. doi: 10.1371/journal.pcbi.1008753. eCollection 2021 Feb.
5
CONFOLD2: improved contact-driven ab initio protein structure modeling.CONFOLD2:改进的接触驱动从头蛋白质结构建模。
BMC Bioinformatics. 2018 Jan 25;19(1):22. doi: 10.1186/s12859-018-2032-6.
6
CoinFold: a web server for protein contact prediction and contact-assisted protein folding.CoinFold:用于蛋白质接触预测和接触辅助蛋白质折叠的网络服务器。
Nucleic Acids Res. 2016 Jul 8;44(W1):W361-6. doi: 10.1093/nar/gkw307. Epub 2016 Apr 25.
7
Effective inter-residue contact definitions for accurate protein fold recognition.用于准确蛋白质折叠识别的有效残基间接触定义。
BMC Bioinformatics. 2012 Nov 9;13:292. doi: 10.1186/1471-2105-13-292.
8
Predicted residue-residue contacts can help the scoring of 3D models.预测的残基-残基接触可以帮助 3D 模型的评分。
Proteins. 2010 Jun;78(8):1980-91. doi: 10.1002/prot.22714.
9
Exploring the effects of sparse restraints on protein structure prediction.探索稀疏约束对蛋白质结构预测的影响。
Proteins. 2018 Feb;86(2):248-262. doi: 10.1002/prot.25438. Epub 2017 Dec 19.
10
Inter-residue interactions in protein folding and stability.蛋白质折叠与稳定性中的残基间相互作用。
Prog Biophys Mol Biol. 2004 Oct;86(2):235-77. doi: 10.1016/j.pbiomolbio.2003.09.003.

引用本文的文献

1
Combining High-Pressure NMR and Geometrical Sampling to Obtain a Full Topological Description of Protein Folding Landscapes: Application to the Folding of Two MAX Effectors from .结合高压 NMR 和几何采样获得蛋白质折叠景观的完整拓扑描述:应用于. 两个 MAX 效应蛋白的折叠。
Int J Mol Sci. 2022 May 13;23(10):5461. doi: 10.3390/ijms23105461.
2
Connective Tissue Growth Factor: From Molecular Understandings to Drug Discovery.结缔组织生长因子:从分子认识到药物发现
Front Cell Dev Biol. 2020 Oct 29;8:593269. doi: 10.3389/fcell.2020.593269. eCollection 2020.
3
A fully open-source framework for deep learning protein real-valued distances.

本文引用的文献

1
COMSAT: Residue contact prediction of transmembrane proteins based on support vector machines and mixed integer linear programming.COMSAT:基于支持向量机和混合整数线性规划的跨膜蛋白残基接触预测
Proteins. 2016 Mar;84(3):332-48. doi: 10.1002/prot.24979. Epub 2016 Jan 20.
2
Accurate contact predictions using covariation techniques and machine learning.使用共变技术和机器学习进行准确的接触预测。
Proteins. 2016 Sep;84 Suppl 1(Suppl Suppl 1):145-51. doi: 10.1002/prot.24863. Epub 2015 Aug 14.
3
An Improved Integration of Template-Based and Template-Free Protein Structure Modeling Methods and its Assessment in CASP11.
深度学习蛋白质实值距离的完全开源框架。
Sci Rep. 2020 Aug 7;10(1):13374. doi: 10.1038/s41598-020-70181-0.
4
StructureDistiller: Structural relevance scoring identifies the most informative entries of a contact map.结构蒸馏器:结构相关性评分可识别接触图中最具信息量的条目。
Sci Rep. 2019 Dec 6;9(1):18517. doi: 10.1038/s41598-019-55047-4.
5
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.
6
Application of an interpretable classification model on Early Folding Residues during protein folding.一种可解释分类模型在蛋白质折叠过程中早期折叠残基上的应用。
BioData Min. 2019 Jan 5;12:1. doi: 10.1186/s13040-018-0188-2. eCollection 2019.
7
Characterizing the relation of functional and Early Folding Residues in protein structures using the example of aminoacyl-tRNA synthetases.利用氨酰-tRNA 合成酶的例子来描述蛋白质结构中功能和早期折叠残基的关系。
PLoS One. 2018 Oct 30;13(10):e0206369. doi: 10.1371/journal.pone.0206369. eCollection 2018.
基于模板和无模板蛋白质结构建模方法的改进整合及其在CASP11中的评估
Protein Pept Lett. 2015;22(7):586-93. doi: 10.2174/0929866522666150520145717.
4
CONFOLD: Residue-residue contact-guided ab initio protein folding.CONFOLD:基于残基-残基接触引导的从头算蛋白质折叠。
Proteins. 2015 Aug;83(8):1436-49. doi: 10.1002/prot.24829. Epub 2015 Jun 6.
5
MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV:结合协同进化方法用于精确预测蛋白质中的接触和长程氢键
Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.
6
SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity.SSpro/ACCpro 5:利用序列谱、机器学习和结构相似性对蛋白质二级结构和相对溶剂可及性进行近乎完美的预测。
Bioinformatics. 2014 Sep 15;30(18):2592-7. doi: 10.1093/bioinformatics/btu352. Epub 2014 May 24.
7
SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures.SCOPe:蛋白质结构分类——扩展版,整合了 SCOP 和 ASTRAL 数据以及新结构的分类。
Nucleic Acids Res. 2014 Jan;42(Database issue):D304-9. doi: 10.1093/nar/gkt1240. Epub 2013 Dec 3.
8
HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment.HHblits:通过 HMM-HMM 比对进行快速迭代的蛋白质序列搜索。
Nat Methods. 2011 Dec 25;9(2):173-5. doi: 10.1038/nmeth.1818.
9
PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.PSICOV:使用基于稀疏逆协方差估计的大型多重序列比对进行精确结构接触预测。
Bioinformatics. 2012 Jan 15;28(2):184-90. doi: 10.1093/bioinformatics/btr638. Epub 2011 Nov 17.
10
Blurring contact maps of thousands of proteins: what we can learn by reconstructing 3D structure.模糊数千种蛋白质的接触图谱:通过重建 3D 结构我们可以学到什么。
BioData Min. 2011 Jan 13;4(1):1. doi: 10.1186/1756-0381-4-1.