• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于一级结构相似性的蛋白质二级结构预测新混合编码方法。

A new hybrid coding for protein secondary structure prediction based on primary structure similarity.

作者信息

Li Zhong, Wang Jing, Zhang Shunpu, Zhang Qifeng, Wu Wuming

机构信息

College of Science, Zhejiang Sci-Tech University, Hangzhou 30018, China.

College of Science, Zhejiang Sci-Tech University, Hangzhou 30018, China.

出版信息

Gene. 2017 Jun 30;618:8-13. doi: 10.1016/j.gene.2017.03.011. Epub 2017 Mar 16.

DOI:10.1016/j.gene.2017.03.011
PMID:28322997
Abstract

The coding pattern of protein can greatly affect the prediction accuracy of protein secondary structure. In this paper, a novel hybrid coding method based on the physicochemical properties of amino acids and tendency factors is proposed for the prediction of protein secondary structure. The principal component analysis (PCA) is first applied to the physicochemical properties of amino acids to construct a 3-bit-code, and then the 3 tendency factors of amino acids are calculated to generate another 3-bit-code. Two 3-bit-codes are fused to form a novel hybrid 6-bit-code. Furthermore, we make a geometry-based similarity comparison of the protein primary structure between the reference set and the test set before the secondary structure prediction. We finally use the support vector machine (SVM) to predict those amino acids which are not detected by the primary structure similarity comparison. Experimental results show that our method achieves a satisfactory improvement in accuracy in the prediction of protein secondary structure.

摘要

蛋白质的编码模式会极大地影响蛋白质二级结构的预测准确性。本文提出了一种基于氨基酸理化性质和趋势因子的新型混合编码方法用于蛋白质二级结构预测。首先将主成分分析(PCA)应用于氨基酸的理化性质以构建一个3位编码,然后计算氨基酸的3个趋势因子以生成另一个3位编码。两个3位编码融合形成一个新型混合6位编码。此外,在二级结构预测之前,我们对参考集和测试集之间的蛋白质一级结构进行基于几何的相似性比较。最后我们使用支持向量机(SVM)来预测那些未被一级结构相似性比较检测到的氨基酸。实验结果表明,我们的方法在蛋白质二级结构预测的准确性方面取得了令人满意的提高。

相似文献

1
A new hybrid coding for protein secondary structure prediction based on primary structure similarity.一种基于一级结构相似性的蛋白质二级结构预测新混合编码方法。
Gene. 2017 Jun 30;618:8-13. doi: 10.1016/j.gene.2017.03.011. Epub 2017 Mar 16.
2
Predicting protein secondary structure by a support vector machine based on a new coding scheme.基于一种新编码方案的支持向量机预测蛋白质二级结构
Genome Inform. 2004;15(2):181-90.
3
Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.通过将预测的二级结构信息纳入周的伪氨基酸组成的通用形式,准确预测蛋白质结构类别。
J Theor Biol. 2014 Mar 7;344:12-8. doi: 10.1016/j.jtbi.2013.11.021. Epub 2013 Dec 6.
4
COMSAT: Residue contact prediction of transmembrane proteins based on support vector machines and mixed integer linear programming.COMSAT:基于支持向量机和混合整数线性规划的跨膜蛋白残基接触预测
Proteins. 2016 Mar;84(3):332-48. doi: 10.1002/prot.24979. Epub 2016 Jan 20.
5
HYPROSP II--a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence.HYPROSP II——一种基于局部预测置信度的用于蛋白质二级结构预测的基于知识的混合方法。
Bioinformatics. 2005 Aug 1;21(15):3227-33. doi: 10.1093/bioinformatics/bti524. Epub 2005 Jun 2.
6
Prediction of microRNA-binding residues in protein using a Laplacian support vector machine based on sequence information.基于序列信息,使用拉普拉斯支持向量机预测蛋白质中的微小RNA结合残基。
J Bioinform Comput Biol. 2018 Jun;16(3):1840009. doi: 10.1142/S0219720018400097. Epub 2018 Feb 4.
7
Highly accurate and consistent method for prediction of helix and strand content from primary protein sequences.一种从蛋白质一级序列预测螺旋和链含量的高度准确且一致的方法。
Artif Intell Med. 2005 Sep-Oct;35(1-2):19-35. doi: 10.1016/j.artmed.2005.02.006.
8
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
9
Prediction of non-classical secreted proteins using informative physicochemical properties.利用信息丰富的物理化学特性预测非经典分泌蛋白。
Interdiscip Sci. 2010 Sep;2(3):263-70. doi: 10.1007/s12539-010-0023-z. Epub 2010 Jul 25.
10
Prediction of protein secondary structure based on residue pairs.基于残基对的蛋白质二级结构预测
J Bioinform Comput Biol. 2004 Jun;2(2):343-52. doi: 10.1142/s0219720004000594.

引用本文的文献

1
Impact of Multi-Factor Features on Protein Secondary Structure Prediction.多因素特征对蛋白质二级结构预测的影响。
Biomolecules. 2024 Sep 13;14(9):1155. doi: 10.3390/biom14091155.
2
SERT-StructNet: Protein secondary structure prediction method based on multi-factor hybrid deep model.SERT-StructNet:基于多因素混合深度模型的蛋白质二级结构预测方法。
Comput Struct Biotechnol J. 2024 Mar 22;23:1364-1375. doi: 10.1016/j.csbj.2024.03.018. eCollection 2024 Dec.
3
Ofatumumab and Granzyme B as immunotoxin against CD20 antigen.奥法木单抗和颗粒酶B作为针对CD20抗原的免疫毒素
In Silico Pharmacol. 2022 Mar 18;10(1):6. doi: 10.1007/s40203-022-00120-6. eCollection 2022.
4
Research on predicting 2D-HP protein folding using reinforcement learning with full state space.基于全状态空间的强化学习预测 2D-HP 蛋白质折叠的研究。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):685. doi: 10.1186/s12859-019-3259-6.
5
A computational method for prediction of xylanase enzymes activity in strains of Bacillus subtilis based on pseudo amino acid composition features.基于伪氨基酸组成特征的枯草芽孢杆菌木聚糖酶活力预测的计算方法。
PLoS One. 2018 Oct 22;13(10):e0205796. doi: 10.1371/journal.pone.0205796. eCollection 2018.
6
Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.基于 LFDA 降维的 PsePSSM 和 DCCA 系数融合预测细胞凋亡蛋白的亚细胞定位。
BMC Genomics. 2018 Jun 19;19(1):478. doi: 10.1186/s12864-018-4849-9.
7
Rama: a machine learning approach for ribosomal protein prediction in plants.拉玛:一种植物核糖体蛋白预测的机器学习方法。
Sci Rep. 2017 Nov 24;7(1):16273. doi: 10.1038/s41598-017-16322-4.