序贯搜索可加快基于片段从头预测蛋白质结构的速度，提高预测效率。

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

机构信息

Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.

Department of Informatics, UCB Pharma, Slough SL1 3WE, UK.

出版信息

Bioinformatics. 2018 Apr 1;34(7):1132-1140. doi: 10.1093/bioinformatics/btx722.

DOI:10.1093/bioinformatics/btx722

PMID:29136098

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6030820/

Abstract

MOTIVATION

Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally.

RESULTS

We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5-2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy.

AVAILABILITY AND IMPLEMENTATION

Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2.

CONTACT

saulo.deoliveira@dtc.ox.ac.uk.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

大多数当前的从头预测方法都是随机采样蛋白质构象，因此需要大量的计算资源。在这里，我们考虑一种顺序采样策略，该策略基于最近的实验工作的思想，该实验表明许多蛋白质是共翻译折叠的。

结果

我们已经研究了一种伪贪婪搜索方法，从其中一个末端开始顺序进行，是否可以提高从头预测蛋白质结构的性能和准确性。我们观察到，当生成的诱饵少于 20000 个时，我们的顺序方法就会收敛，生成的诱饵少于通常预期的数量。使用我们的软件 SAINT2，我们还比较了顺序和非顺序方法生成模型的运行时间和质量。顺序预测比非顺序预测生成单个诱饵快 1.5-2.5 倍。在考虑最佳模型的质量时，顺序预测导致 41 个可溶性蛋白验证案例中有 31 个产生了更好的模型，24 个跨膜蛋白案例中有 18 个产生了更好的模型。顺序模式产生了 29 个此类案例的正确模型（TM-Score > 0.5），而非顺序模式仅产生了 22 个。我们的比较表明，顺序搜索策略可用于大大减少从头预测蛋白质结构的计算时间并提高准确性。

可用性和实现

数据可从以下网址下载：http://opig.stats.ox.ac.uk/resources。SAINT2 可从以下网址下载：https://github.com/sauloho/SAINT2。

联系人

saulo.deoliveira@dtc.ox.ac.uk。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e00/6030820/267221ee0428/btx722f1.jpg

相似文献

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.序贯搜索可加快基于片段从头预测蛋白质结构的速度，提高预测效率。

Bioinformatics. 2018 Apr 1;34(7):1132-1140. doi: 10.1093/bioinformatics/btx722.

Comparing co-evolution methods and their application to template-free protein structure prediction.比较共进化方法及其在无模板蛋白质结构预测中的应用。

Bioinformatics. 2017 Feb 1;33(3):373-381. doi: 10.1093/bioinformatics/btw618.

Combining co-evolution and secondary structure prediction to improve fragment library generation.结合共进化和二级结构预测来改进片段库生成。

Bioinformatics. 2018 Jul 1;34(13):2219-2227. doi: 10.1093/bioinformatics/bty084.

Granular clustering of de novo protein models.从头蛋白质模型的粒状聚类。

Bioinformatics. 2017 Feb 1;33(3):390-396. doi: 10.1093/bioinformatics/btw628.

MMpred: a distance-assisted multimodal conformation sampling for de novo protein structure prediction.MMpred：一种用于从头蛋白质结构预测的距离辅助多模态构象采样方法

Bioinformatics. 2021 Dec 7;37(23):4350-4356. doi: 10.1093/bioinformatics/btab484.

Building a better fragment library for de novo protein structure prediction.构建用于从头蛋白质结构预测的更好的片段库。

PLoS One. 2015 Apr 22;10(4):e0123998. doi: 10.1371/journal.pone.0123998. eCollection 2015.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

Sphinx: merging knowledge-based and ab initio approaches to improve protein loop prediction.斯芬克斯：融合基于知识和从头开始的方法以改进蛋白质环预测。

Bioinformatics. 2017 May 1;33(9):1346-1353. doi: 10.1093/bioinformatics/btw823.

The evolution of contact prediction: evidence that contact selection in statistical contact prediction is changing.接触预测的演变：接触选择在统计接触预测中发生变化的证据。

Bioinformatics. 2020 Mar 1;36(6):1750-1756. doi: 10.1093/bioinformatics/btz816.

MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV：结合协同进化方法用于精确预测蛋白质中的接触和长程氢键

Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.

引用本文的文献

Enhanced Methodology for Peptide Tertiary Structure Prediction Using GRSA and Bio-Inspired Algorithm.使用GRSA和生物启发算法的肽三级结构预测增强方法

Int J Mol Sci. 2025 Aug 2;26(15):7484. doi: 10.3390/ijms26157484.

RFQAmodel: Random Forest Quality Assessment to identify a predicted protein structure in the correct fold.RFQAmodel：随机森林质量评估，用于识别正确折叠的预测蛋白结构。

PLoS One. 2019 Oct 21;14(10):e0218149. doi: 10.1371/journal.pone.0218149. eCollection 2019.

Improved fragment-based protein structure prediction by redesign of search heuristics.通过重新设计搜索启发式方法来改进基于片段的蛋白质结构预测。

Sci Rep. 2018 Sep 12;8(1):13694. doi: 10.1038/s41598-018-31891-8.

本文引用的文献

Co-evolution techniques are reshaping the way we do structural bioinformatics.协同进化技术正在重塑我们进行结构生物信息学研究的方式。

F1000Res. 2017 Jul 25;6:1224. doi: 10.12688/f1000research.11543.1. eCollection 2017.

Comparing co-evolution methods and their application to template-free protein structure prediction.比较共进化方法及其在无模板蛋白质结构预测中的应用。

Bioinformatics. 2017 Feb 1;33(3):373-381. doi: 10.1093/bioinformatics/btw618.

Enhancing Protein Conformational Space Sampling Using Distance Profile-Guided Differential Evolution.利用距离分布引导差分进化增强蛋白质构象空间采样。

IEEE/ACM Trans Comput Biol Bioinform. 2017 Nov-Dec;14(6):1288-1301. doi: 10.1109/TCBB.2016.2566617. Epub 2016 May 11.

Protein structure determination using metagenome sequence data.利用宏基因组序列数据进行蛋白质结构测定。

Science. 2017 Jan 20;355(6322):294-298. doi: 10.1126/science.aah4043.

Balancing exploration and exploitation in population-based sampling improves fragment-based de novo protein structure prediction.在基于群体的采样中平衡探索与利用可改善基于片段的从头蛋白质结构预测。

Proteins. 2017 May;85(5):852-858. doi: 10.1002/prot.25244. Epub 2017 Mar 3.

Blind protein structure prediction using accelerated free-energy simulations.利用加速自由能模拟进行盲蛋白质结构预测。

Sci Adv. 2016 Nov 11;2(11):e1601274. doi: 10.1126/sciadv.1601274. eCollection 2016 Nov.

A critical assessment of hidden markov model sub-optimal sampling strategies applied to the generation of peptide 3D models.对应用于生成肽 3D 模型的隐马尔可夫模型次优采样策略的批判性评估。

J Comput Chem. 2016 Aug 5;37(21):2006-16. doi: 10.1002/jcc.24422. Epub 2016 Jun 18.

UniCon3D: de novo protein structure prediction using united-residue conformational search via stepwise, probabilistic sampling.UniCon3D：通过逐步概率采样进行联合残基构象搜索的从头蛋白质结构预测。

Bioinformatics. 2016 Sep 15;32(18):2791-9. doi: 10.1093/bioinformatics/btw316. Epub 2016 Jun 3.

Estimation of Uncertainties in the Global Distance Test (GDT_TS) for CASP Models.蛋白质结构预测技术评估（CASP）模型中全局距离测试（GDT_TS）不确定性的估计

PLoS One. 2016 May 5;11(5):e0154786. doi: 10.1371/journal.pone.0154786. eCollection 2016.

Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction.用于蛋白质结构预测的模因算法中多样性的生成、维持与利用

Evol Comput. 2016 Winter;24(4):577-607. doi: 10.1162/EVCO_a_00176. Epub 2016 Feb 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

序贯搜索可加快基于片段从头预测蛋白质结构的速度，提高预测效率。

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

CONTACT

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

联系人

补充信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献