Suppr超能文献

我们需要训练一个预测器需要多少个 3D 结构?

How many 3D structures do we need to train a predictor?

机构信息

Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 15701, Greece.

出版信息

Genomics Proteomics Bioinformatics. 2009 Sep;7(3):128-37. doi: 10.1016/S1672-0229(08)60041-8.

Abstract

It has been shown that the progress in the determination of membrane protein structure grows exponentially, with approximately the same growth rate as that of the water-soluble proteins. In order to investigate the effect of this, on the performance of prediction algorithms for both alpha-helical and beta-barrel membrane proteins, we conducted a prospective study based on historical records. We trained separate hidden Markov models with different sized training sets and evaluated their performance on topology prediction for the two classes of transmembrane proteins. We show that the existing top-scoring algorithms for predicting the transmembrane segments of alpha-helical membrane proteins perform slightly better than that of beta-barrel outer membrane proteins in all measures of accuracy. With the same rationale, a meta-analysis of the performance of the secondary structure prediction algorithms indicates that existing algorithmic techniques cannot be further improved by just adding more non-homologous sequences to the training sets. The upper limit for secondary structure prediction is estimated to be no more than 70% and 80% of correctly predicted residues for single sequence based methods and multiple sequence based ones, respectively. Therefore, we should concentrate our efforts on utilizing new techniques for the development of even better scoring predictors.

摘要

已经表明,膜蛋白结构测定的进展呈指数级增长,其增长率与水溶性蛋白大致相同。为了研究这对预测算法性能的影响,我们基于历史记录进行了一项前瞻性研究。我们使用不同大小的训练集训练了单独的隐马尔可夫模型,并在两类跨膜蛋白的拓扑预测上评估了它们的性能。我们表明,现有的预测α-螺旋跨膜蛋白跨膜片段的顶级算法在所有准确性度量上的表现均略优于β-桶外膜蛋白。基于同样的原理,对二级结构预测算法性能的荟萃分析表明,通过仅向训练集中添加更多非同源序列,现有算法技术无法进一步提高。基于单序列的方法和基于多个序列的方法的二级结构预测的上限估计分别不超过正确预测残基的 70%和 80%。因此,我们应该集中精力利用新技术开发更好的评分预测器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e31e/5054404/eaf6b5a144b0/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验