• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于预测蛋白质二级结构类别的集成快速傅里叶变换模型。

An Integrated-OFFT Model for the Prediction of Protein Secondary Structure Class.

作者信息

Panda Bishnupriya, Majhi Babita, Thakur Abhimanyu

机构信息

Department of Computer Science and Engineering, Institute of Technical Education and Research, Siksha 'O' Anusandhan University, Bhubaneswar, Orissa, India.

Department of Computer Science and Information Technology, Guru Ghashidas Vishwavidyalaya (A Central University), Bilaspur, Chhattisgarh, India.

出版信息

Curr Comput Aided Drug Des. 2019;15(1):45-54. doi: 10.2174/1573409914666180828105228.

DOI:10.2174/1573409914666180828105228
PMID:30152288
Abstract

BACKGROUND

Proteins are the utmost multi-purpose macromolecules, which play a crucial function in many aspects of biological processes. For a long time, sequence arrangement of amino acid has been utilized for the prediction of protein secondary structure. Besides, in major methods for the prediction of protein secondary structure class, the impact of Gaussian noise on sequence representation of amino acids has not been considered until now; which is one of the important constraints for the functionality of a protein.

METHODS

In the present research, the prediction of protein secondary structure class was accomplished by integrated application of Stockwell transformation and Amino Acid Composition (AAC), on equivalent Electron-ion Interaction Potential (EIIP) representation of raw amino acid sequence. The introduced method was evaluated by using 4 benchmark datasets of low sequence homology, namely PDB25, 498, 277, and 204. Furthermore, random forest algorithm together with the out-of-bag error estimate and Support Vector Machine (SVM), using k-fold cross validation demonstrated high feature representation potential of our reported approach.

RESULTS

The overall prediction accuracy for PDB25, 498, 277, and 204 datasets with randomforest classifier was 92.5%, 94.79%, 92.45%, and 88.04% respectively, whereas with SVM, the results were 84.66%, 95.32%, 89.29%, and 84.37% respectively.

CONCLUSION

An integrated-order-function-frequency-time (OFFT) model has been proposed for the prediction of protein secondary structure class. For the first time, we reported the effect of Gaussian noise on the prediction accuracy of protein secondary structure class and proposed a robust integrated- OFFT model, which is effectively noise resistant.

摘要

背景

蛋白质是用途最为广泛的大分子,在生物过程的许多方面发挥着关键作用。长期以来,氨基酸序列排列一直被用于预测蛋白质二级结构。此外,在蛋白质二级结构类别的主要预测方法中,高斯噪声对氨基酸序列表示的影响至今尚未得到考虑;而这是蛋白质功能的重要限制因素之一。

方法

在本研究中,通过将斯托克韦尔变换与氨基酸组成(AAC)综合应用于原始氨基酸序列的等效电子 - 离子相互作用势(EIIP)表示,实现了蛋白质二级结构类别的预测。使用4个低序列同源性的基准数据集,即PDB25、498、277和204,对所提出的方法进行了评估。此外,随机森林算法结合袋外误差估计和支持向量机(SVM),采用k折交叉验证,证明了我们所报道方法具有很高的特征表示潜力。

结果

使用随机森林分类器时,PDB25、498、277和204数据集的总体预测准确率分别为92.5%、94.79%、92.45%和88.04%,而使用SVM时,结果分别为84.66%、95.32%、89.29%和84.37%。

结论

提出了一种用于预测蛋白质二级结构类别的综合阶次 - 函数 - 频率 - 时间(OFFT)模型。我们首次报道了高斯噪声对蛋白质二级结构类别预测准确率的影响,并提出了一种强大的综合OFFT模型,该模型具有有效的抗噪声能力。

相似文献

1
An Integrated-OFFT Model for the Prediction of Protein Secondary Structure Class.一种用于预测蛋白质二级结构类别的集成快速傅里叶变换模型。
Curr Comput Aided Drug Des. 2019;15(1):45-54. doi: 10.2174/1573409914666180828105228.
2
Novel structure-driven features for accurate prediction of protein structural class.用于准确预测蛋白质结构类别的新型结构驱动特征。
Genomics. 2014 Apr;103(4):292-7. doi: 10.1016/j.ygeno.2014.04.002. Epub 2014 Apr 18.
3
Structural class prediction of protein using novel feature extraction method from chaos game representation of predicted secondary structure.利用从预测二级结构的混沌博弈表示中提取的新特征方法对蛋白质进行结构类预测。
J Theor Biol. 2016 Jul 7;400:1-10. doi: 10.1016/j.jtbi.2016.04.011. Epub 2016 Apr 12.
4
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.
5
Prediction of protein structural class using novel evolutionary collocation-based sequence representation.使用基于新型进化搭配的序列表示法预测蛋白质结构类别。
J Comput Chem. 2008 Jul 30;29(10):1596-604. doi: 10.1002/jcc.20918.
6
SVM-based method for protein structural class prediction using secondary structural content and structural information of amino acids.基于支持向量机的蛋白质结构类预测方法,该方法利用二级结构含量和氨基酸的结构信息。
J Bioinform Comput Biol. 2011 Aug;9(4):489-502. doi: 10.1142/s0219720011005422.
7
A novel predictor for protein structural class based on integrated information of the secondary structure sequence.一种基于二级结构序列综合信息的蛋白质结构类新型预测器。
Biochimie. 2014 Aug;103:131-6. doi: 10.1016/j.biochi.2014.05.008. Epub 2014 May 22.
8
A novel Multi-Agent Ada-Boost algorithm for predicting protein structural class with the information of protein secondary structure.一种利用蛋白质二级结构信息预测蛋白质结构类别的新型多智能体Ada-Boost算法。
J Bioinform Comput Biol. 2015 Oct;13(5):1550022. doi: 10.1142/S0219720015500225. Epub 2015 Aug 11.
9
Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.通过将预测的二级结构信息纳入周的伪氨基酸组成的通用形式,准确预测蛋白质结构类别。
J Theor Biol. 2014 Mar 7;344:12-8. doi: 10.1016/j.jtbi.2013.11.021. Epub 2013 Dec 6.
10
High-accuracy prediction of protein structural class for low-similarity sequences based on predicted secondary structure.基于预测的二级结构对低相似度序列进行蛋白质结构类别高精度预测。
Biochimie. 2011 Apr;93(4):710-4. doi: 10.1016/j.biochi.2011.01.001. Epub 2011 Jan 13.

引用本文的文献

1
Predicting liposome formulations by the integrated machine learning and molecular modeling approaches.通过集成机器学习和分子建模方法预测脂质体制剂
Asian J Pharm Sci. 2023 May;18(3):100811. doi: 10.1016/j.ajps.2023.100811. Epub 2023 Apr 16.
2
Deep learning approach for early prediction of COVID-19 mortality using chest X-ray and electronic health records.基于胸部 X 光和电子健康记录的深度学习方法对 COVID-19 死亡率的早期预测。
BMC Bioinformatics. 2023 May 9;24(1):190. doi: 10.1186/s12859-023-05321-0.
3
Prediction of lipid nanoparticles for mRNA vaccines by the machine learning algorithm.
通过机器学习算法预测用于mRNA疫苗的脂质纳米颗粒。
Acta Pharm Sin B. 2022 Jun;12(6):2950-2962. doi: 10.1016/j.apsb.2021.11.021. Epub 2021 Dec 2.
4
Development of machine learning model for diagnostic disease prediction based on laboratory tests.基于实验室检查的机器学习模型在疾病诊断预测中的开发。
Sci Rep. 2021 Apr 7;11(1):7567. doi: 10.1038/s41598-021-87171-5.
5
Polymorphisms and genetic effects of PRLR, MOGAT1, MINPP1 and CHUK genes on milk fatty acid traits in Chinese Holstein.中国荷斯坦奶牛 PRLR、MOGAT1、MINPP1 和 CHUK 基因多态性及其遗传效应与乳脂肪酸性状的关系。
BMC Genet. 2019 Aug 16;20(1):69. doi: 10.1186/s12863-019-0769-1.