• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种新型结构位置特异性评分矩阵,用于预测蛋白质二级结构。

A novel structural position-specific scoring matrix for the prediction of protein secondary structures.

机构信息

Department of Chemistry, Tongji University, Shanghai 200092, China.

出版信息

Bioinformatics. 2012 Jan 1;28(1):32-9. doi: 10.1093/bioinformatics/btr611. Epub 2011 Nov 7.

DOI:10.1093/bioinformatics/btr611
PMID:22065541
Abstract

MOTIVATION

The precise prediction of protein secondary structure is of key importance for the prediction of 3D structure and biological function. Although the development of many excellent methods over the last few decades has allowed the achievement of prediction accuracies of up to 80%, progress seems to have reached a bottleneck, and further improvements in accuracy have proven difficult.

RESULTS

We propose for the first time a structural position-specific scoring matrix (SPSSM), and establish an unprecedented database of 9 million sequences and their SPSSMs. This database, when combined with a purpose-designed BLAST tool, provides a novel prediction tool: SPSSMPred. When the SPSSMPred was validated on a large dataset (10,814 entries), the Q3 accuracy of the protein secondary structure prediction was 93.4%. Our approach was tested on the two latest EVA sets; accuracies of 82.7 and 82.0% were achieved, far higher than can be achieved using other predictors. For further evaluation, we tested our approach on newly determined sequences (141 entries), and obtained an accuracy of 89.6%. For a set of low-homology proteins (40 entries), the SPSSMPred still achieved a Q3 value of 84.6%.

AVAILABILITY

The SPSSMPred server is available at http://cal.tongji.edu.cn/SPSSMPred/

CONTACT

lith@tongji.edu.cn

摘要

动机

准确预测蛋白质二级结构对于预测 3D 结构和生物功能至关重要。尽管在过去几十年中开发了许多优秀的方法,已经能够达到高达 80%的预测精度,但似乎已经达到了一个瓶颈,进一步提高精度变得困难。

结果

我们首次提出了一种结构位置特异性评分矩阵(SPSSM),并建立了一个前所未有的包含 900 万序列及其 SPSSM 的数据库。该数据库与专门设计的 BLAST 工具结合,提供了一种新的预测工具:SPSSMPred。当在一个大型数据集(10814 个条目)上验证 SPSSMPred 时,蛋白质二级结构预测的 Q3 精度为 93.4%。我们的方法在最新的两个 EVA 数据集上进行了测试,分别实现了 82.7%和 82.0%的精度,远高于其他预测器所能达到的精度。为了进一步评估,我们在新确定的序列(141 个条目)上测试了我们的方法,获得了 89.6%的精度。对于一组低同源性的蛋白质(40 个条目),SPSSMPred 仍然能够达到 Q3 值为 84.6%。

可用性

SPSSMPred 服务器可在 http://cal.tongji.edu.cn/SPSSMPred/ 上使用。

联系人

lith@tongji.edu.cn

相似文献

1
A novel structural position-specific scoring matrix for the prediction of protein secondary structures.一种新型结构位置特异性评分矩阵,用于预测蛋白质二级结构。
Bioinformatics. 2012 Jan 1;28(1):32-9. doi: 10.1093/bioinformatics/btr611. Epub 2011 Nov 7.
2
SPSSM8: an accurate approach for predicting eight-state secondary structures of proteins.SPSSM8:一种准确预测蛋白质 8 态二级结构的方法。
Biochimie. 2013 Dec;95(12):2460-4. doi: 10.1016/j.biochi.2013.09.007. Epub 2013 Sep 18.
3
Improving protein secondary structure prediction based on short subsequences with local structure similarity.基于局部结构相似性的短序列提高蛋白质二级结构预测。
BMC Genomics. 2010 Dec 2;11 Suppl 4(Suppl 4):S4. doi: 10.1186/1471-2164-11-S4-S4.
4
Prediction of protein structural classes for low-homology sequences based on predicted secondary structure.基于预测的二级结构预测低同源序列的蛋白质结构类别。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-11-S1-S9.
5
MUPRED: a tool for bridging the gap between template based methods and sequence profile based methods for protein secondary structure prediction.MUPRED:一种弥合基于模板的方法与基于序列轮廓的方法在蛋白质二级结构预测方面差距的工具。
Proteins. 2007 Feb 15;66(3):664-70. doi: 10.1002/prot.21177.
6
Prediction of protein structural classes for low-similarity sequences using reduced PSSM and position-based secondary structural features.使用简化的位置特异性得分矩阵(PSSM)和基于位置的二级结构特征预测低相似性序列的蛋白质结构类别。
Gene. 2015 Jan 10;554(2):241-8. doi: 10.1016/j.gene.2014.10.037. Epub 2014 Oct 24.
7
A novel method for protein secondary structure prediction using dual-layer SVM and profiles.一种使用双层支持向量机和轮廓进行蛋白质二级结构预测的新方法。
Proteins. 2004 Mar 1;54(4):738-43. doi: 10.1002/prot.10634.
8
EVA: Evaluation of protein structure prediction servers.EVA:蛋白质结构预测服务器评估
Nucleic Acids Res. 2003 Jul 1;31(13):3311-5. doi: 10.1093/nar/gkg619.
9
A protein structural classes prediction method based on PSI-BLAST profile.一种基于PSI-BLAST序列谱的蛋白质结构类预测方法。
J Theor Biol. 2014 Jul 21;353:19-23. doi: 10.1016/j.jtbi.2014.02.034. Epub 2014 Mar 4.
10
A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction.基于二级结构的位置特异性评分矩阵在提高蛋白质二级结构预测中的应用。
PLoS One. 2021 Jul 28;16(7):e0255076. doi: 10.1371/journal.pone.0255076. eCollection 2021.

引用本文的文献

1
Boosting the accuracy of protein secondary structure prediction through nearest neighbor search and method hybridization.通过最近邻搜索和方法杂交提高蛋白质二级结构预测的准确性。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i317-i325. doi: 10.1093/bioinformatics/btaa336.
2
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?蛋白质二级结构预测的长征:终章?
Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129.
3
NMRDSP: an accurate prediction of protein shape strings from NMR chemical shifts and sequence data.
NMRDSP:基于核磁共振化学位移和序列数据对蛋白质形状字符串进行准确预测
PLoS One. 2013 Dec 23;8(12):e83532. doi: 10.1371/journal.pone.0083532. eCollection 2013.
4
DomHR: accurately identifying domain boundaries in proteins using a hinge region strategy.使用铰链区策略准确识别蛋白质的结构域边界。
PLoS One. 2013 Apr 11;8(4):e60559. doi: 10.1371/journal.pone.0060559. Print 2013.
5
Predicting turns in proteins with a unified model.用统一模型预测蛋白质的转折。
PLoS One. 2012;7(11):e48389. doi: 10.1371/journal.pone.0048389. Epub 2012 Nov 7.
6
DSP: a protein shape string and its profile prediction server.DSP:蛋白质形状字符串及其轮廓预测服务器。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W298-302. doi: 10.1093/nar/gks361. Epub 2012 May 2.
7
Retrieving backbone string neighbors provides insights into structural modeling of membrane proteins.提取骨干字符串邻居为膜蛋白的结构建模提供了深入了解。
Mol Cell Proteomics. 2012 Jul;11(7):M111.016808. doi: 10.1074/mcp.M111.016808. Epub 2012 Mar 13.