Suppr超能文献

基于二级结构的位置特异性评分矩阵在提高蛋白质二级结构预测中的应用。

A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction.

机构信息

Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan.

Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan.

出版信息

PLoS One. 2021 Jul 28;16(7):e0255076. doi: 10.1371/journal.pone.0255076. eCollection 2021.

Abstract

Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM.

摘要

蛋白质二级结构预测(SSP)有多种应用;然而,多年来其准确性的提高相对有限。为了推动所有相关领域的发展,我们旨在推动 SSP 取得根本进展。已经有许多令人钦佩的努力来改进 SSP 的机器学习算法。这项工作因此通过操作输入特征退了一步。基于此,提出了一种基于二级结构元素的位置特定评分矩阵(SSE-PSSM),在此基础上可以建立一组新的机器学习特征。通过使用共享 <25%序列同一性的训练集和测试集进行严格的独立测试,评估了这种新 PSSM 的可行性。在所有实验中,所提出的 PSSM 均优于传统的氨基酸 PSSM。这种新的 PSSM 可以很容易地与氨基酸 PSSM 结合,并且准确性的提高非常显著。通过结合 SSE-PSSM 和著名的 SSP 方法进行的初步测试表明,在三状态和八状态 SSP 准确性方面分别提高了 2.0%和 5.2%。如果这种 PSSM 可以集成到最先进的 SSP 方法中,那么 SSP 的整体准确性可能会打破当前的限制,并最终使所有在开发过程中二级结构预测起着重要作用的研究和应用受益。为了促进 SSE-PSSM 与现代 SSP 方法的应用和集成,我们建立了一个网络服务器和独立程序,用于生成 SSE-PSSM,可在 http://10.life.nctu.edu.tw/SSE-PSSM 上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db51/8318245/24f08733e604/pone.0255076.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验