• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过离散小波变换将物理化学性质纳入周氏伪氨基酸组成的一般形式来识别蛋白质四级结构属性。

Identifying protein quaternary structural attributes by incorporating physicochemical properties into the general form of Chou's PseAAC via discrete wavelet transform.

作者信息

Sun Xing-Yu, Shi Shao-Ping, Qiu Jian-Ding, Suo Sheng-Bao, Huang Shu-Yun, Liang Ru-Ping

机构信息

Department of Chemistry, Nanchang University, Nanchang 330031, P.R. China.

出版信息

Mol Biosyst. 2012 Oct 30;8(12):3178-84. doi: 10.1039/c2mb25280e.

DOI:10.1039/c2mb25280e
PMID:22990717
Abstract

In vivo, some proteins exist as monomers and others as oligomers. Oligomers can be further classified into homo-oligomers (formed by identical subunits) and hetero-oligomers (formed by different subunits), and they form the structural components of various biological functions, including cooperative effects, allosteric mechanism and ion-channel gating. Therefore, with the avalanche of protein sequences generated in the post-genomic era, it is very important for both basic research and the pharmaceutical industry to acquire the possible knowledge about quaternary structural attributes of their proteins of interest. In view of this, a high throughput method (DWT_DT), a 2-layer approach by fusing discrete wavelet transform (DWT) and decision-tree algorithm (DT) with physicochemical features, has been developed to predict protein quaternary structures. The 1st layer is to assign a query protein to one of the 10 main quaternary structural attributes. The 2nd layer is to evaluate whether the protein in question is composed of homo- or hetero-oligomers. The overall accuracy by jackknife test for the 1st layer identification was 89.60%. The overall accuracy of the 2nd layer varies from 88.23 to 100%. The results suggest that this newly developed protocol (DWT_DT) is very promising in predicting quaternary structures with complicated composition.

摘要

在体内,一些蛋白质以单体形式存在,而另一些则以寡聚体形式存在。寡聚体可进一步分为同型寡聚体(由相同亚基形成)和异型寡聚体(由不同亚基形成),它们构成了各种生物学功能的结构成分,包括协同效应、别构机制和离子通道门控。因此,在后基因组时代产生大量蛋白质序列的情况下,获取有关其感兴趣蛋白质四级结构属性的可能知识,对基础研究和制药行业都非常重要。鉴于此,已开发出一种高通量方法(DWT_DT),即通过将离散小波变换(DWT)和决策树算法(DT)与物理化学特征相融合的两层方法,来预测蛋白质四级结构。第一层是将查询蛋白质分配到10种主要四级结构属性之一。第二层是评估所讨论的蛋白质是由同型寡聚体还是异型寡聚体组成。通过留一法检验,第一层识别的总体准确率为89.60%。第二层的总体准确率在88.23%至100%之间。结果表明,这种新开发的方案(DWT_DT)在预测组成复杂的四级结构方面非常有前景。

相似文献

1
Identifying protein quaternary structural attributes by incorporating physicochemical properties into the general form of Chou's PseAAC via discrete wavelet transform.通过离散小波变换将物理化学性质纳入周氏伪氨基酸组成的一般形式来识别蛋白质四级结构属性。
Mol Biosyst. 2012 Oct 30;8(12):3178-84. doi: 10.1039/c2mb25280e.
2
Predicting homo-oligomers and hetero-oligomers by pseudo-amino acid composition: an approach from discrete wavelet transformation.基于伪氨基酸组成预测同寡聚体和异寡聚体:一种来自离散小波变换的方法。
Biochimie. 2011 Jul;93(7):1132-8. doi: 10.1016/j.biochi.2011.03.010. Epub 2011 Apr 3.
3
OligoPred: a web-server for predicting homo-oligomeric proteins by incorporating discrete wavelet transform into Chou's pseudo amino acid composition.寡聚预测:一个通过将离散小波变换纳入周的伪氨基酸组成来预测同源寡聚蛋白的网络服务器。
J Mol Graph Model. 2011 Sep;30:129-34. doi: 10.1016/j.jmgm.2011.06.014. Epub 2011 Jul 7.
4
QuatIdent: a web server for identifying protein quaternary structural attribute by fusing functional domain and sequential evolution information.QuatIdent:一个通过融合功能域和序列进化信息来识别蛋白质四级结构属性的网络服务器。
J Proteome Res. 2009 Mar;8(3):1577-84. doi: 10.1021/pr800957q.
5
Using Chou's pseudo amino acid composition to predict protein quaternary structure: a sequence-segmented PseAAC approach.利用周氏伪氨基酸组成预测蛋白质四级结构:一种序列分段伪氨基酸组成方法。
Amino Acids. 2008 Oct;35(3):591-8. doi: 10.1007/s00726-008-0086-x. Epub 2008 Apr 22.
6
Prediction of protein homo-oligomer types by pseudo amino acid composition: Approached with an improved feature extraction and Naive Bayes Feature Fusion.基于伪氨基酸组成预测蛋白质同源寡聚体类型:采用改进的特征提取和朴素贝叶斯特征融合方法
Amino Acids. 2006 Jun;30(4):461-8. doi: 10.1007/s00726-006-0263-8. Epub 2006 May 15.
7
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
8
Identification of protein-protein binding sites by incorporating the physicochemical properties and stationary wavelet transforms into pseudo amino acid composition.通过将物理化学性质和静态小波变换纳入伪氨基酸组成来鉴定蛋白质-蛋白质结合位点。
J Biomol Struct Dyn. 2016 Sep;34(9):1946-61. doi: 10.1080/07391102.2015.1095116. Epub 2015 Oct 29.
9
iPPI-Esml: An ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC.iPPI-Esml:一种通过将蛋白质的物理化学性质和小波变换纳入伪氨基酸组成来识别蛋白质相互作用的集成分类器。
J Theor Biol. 2015 Jul 21;377:47-56. doi: 10.1016/j.jtbi.2015.04.011. Epub 2015 Apr 20.
10
Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine.基于小波变换和支持向量机预测细胞凋亡蛋白的亚细胞定位。
Amino Acids. 2010 Apr;38(4):1201-8. doi: 10.1007/s00726-009-0331-y. Epub 2009 Aug 4.

引用本文的文献

1
GLTM: A Global-Local Attention LSTM Model to Locate Dimer Motif of Single-Pass Membrane Proteins.GLTM:一种用于定位单次跨膜蛋白二聚体基序的全局-局部注意力长短期记忆模型。
Front Genet. 2022 Mar 15;13:854571. doi: 10.3389/fgene.2022.854571. eCollection 2022.
2
QUATgo: Protein quaternary structural attributes predicted by two-stage machine learning approaches with heterogeneous feature encoding.QUATgo:通过具有异构特征编码的两阶段机器学习方法预测蛋白质四级结构属性。
PLoS One. 2020 Apr 29;15(4):e0232087. doi: 10.1371/journal.pone.0232087. eCollection 2020.
3
Some illuminating remarks on molecular genetics and genomics as well as drug development.
关于分子遗传学和基因组学以及药物开发的一些有启发性的观点。
Mol Genet Genomics. 2020 Mar;295(2):261-274. doi: 10.1007/s00438-019-01634-z. Epub 2020 Jan 1.
4
osFP: a web server for predicting the oligomeric states of fluorescent proteins.osFP:一个用于预测荧光蛋白寡聚状态的网络服务器。
J Cheminform. 2016 Dec 20;8:72. doi: 10.1186/s13321-016-0185-8. eCollection 2016.
5
iACP: a sequence-based tool for identifying anticancer peptides.iACP:一种用于鉴定抗癌肽的基于序列的工具。
Oncotarget. 2016 Mar 29;7(13):16895-909. doi: 10.18632/oncotarget.7815.
6
iSuc-PseAAC: predicting lysine succinylation in proteins by incorporating peptide position-specific propensity.iSuc-PseAAC:通过纳入肽段位置特异性倾向预测蛋白质中的赖氨酸琥珀酰化
Sci Rep. 2015 Jun 18;5:10184. doi: 10.1038/srep10184.
7
Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.结合周氏距离对伪氨基酸组成和主成分分析进行蛋白质远程同源性检测。
Mol Genet Genomics. 2015 Oct;290(5):1919-31. doi: 10.1007/s00438-015-1044-4. Epub 2015 Apr 21.
8
An ensemble method with hybrid features to identify extracellular matrix proteins.一种具有混合特征的集成方法用于识别细胞外基质蛋白。
PLoS One. 2015 Feb 13;10(2):e0117804. doi: 10.1371/journal.pone.0117804. eCollection 2015.
9
Sequence-based identification of recombination spots using pseudo nucleic acid representation and recursive feature extraction by linear kernel SVM.基于序列的重组位点鉴定,使用伪核酸表示法和线性核支持向量机进行递归特征提取。
BMC Bioinformatics. 2014 Nov 20;15(1):340. doi: 10.1186/1471-2105-15-340.
10
iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.iPro54-PseKNC:一种基于序列的预测工具,用于通过伪k元核苷酸组成识别原核生物中的σ-54启动子。
Nucleic Acids Res. 2014 Dec 1;42(21):12961-72. doi: 10.1093/nar/gku1019. Epub 2014 Oct 31.